summaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorRaymond Hettinger <python@rcn.com>2003-12-12 13:13:47 (GMT)
committerRaymond Hettinger <python@rcn.com>2003-12-12 13:13:47 (GMT)
commitfeb78c94fa444a75b912ca355c3c80d7453f03f0 (patch)
treeadcee8530db89d03316317b7fcdede9788663968
parent1f84ed0d8a1f23efde85ec9b68557b87dbe5d3e3 (diff)
downloadcpython-feb78c94fa444a75b912ca355c3c80d7453f03f0.zip
cpython-feb78c94fa444a75b912ca355c3c80d7453f03f0.tar.gz
cpython-feb78c94fa444a75b912ca355c3c80d7453f03f0.tar.bz2
Expand the groupby() example to:
* show that it is typically used with sorted data, * highlight commonalities with SQL's groupby and Unix's uniq, * demonstrate valid uses for the default identity function, * add some excitement by suggesting the range of possibilities.
-rw-r--r--Doc/whatsnew/whatsnew24.tex15
1 files changed, 15 insertions, 0 deletions
diff --git a/Doc/whatsnew/whatsnew24.tex b/Doc/whatsnew/whatsnew24.tex
index 05b06b5..e0e2c9c 100644
--- a/Doc/whatsnew/whatsnew24.tex
+++ b/Doc/whatsnew/whatsnew24.tex
@@ -294,6 +294,21 @@ return consecutive runs of odd or even numbers.
>>>
\end{verbatim}
+Like its SQL counterpart, \function{groupby()} is typically used with
+sorted input. The logic for \function{groupby()} is similar to the
+\UNIX{} \code{uniq} filter which makes it handy for eliminating,
+counting, or identifying duplicate elements:
+
+\begin{verbatim}
+>>> word = 'abracadabra'
+>>> [k for k, g in groupby(list.sorted(word))]
+['a', 'b', 'c', 'd', 'r']
+>>> [(k, len(list(g))) for k, g in groupby(list.sorted(word))]
+[('a', 5), ('b', 2), ('c', 1), ('d', 1), ('r', 2)]
+>>> [k for k, g in groupby(list.sorted(word)) if len(list(g)) > 1]
+['a', 'b', 'r']
+\end{verbatim}
+
\item A new \function{getsid()} function was added to the
\module{posix} module that underlies the \module{os} module.
(Contributed by J. Raynor.)