summaryrefslogtreecommitdiffstats
path: root/Doc
diff options
context:
space:
mode:
authorRaymond Hettinger <rhettinger@users.noreply.github.com>2019-03-12 07:43:27 (GMT)
committerGitHub <noreply@github.com>2019-03-12 07:43:27 (GMT)
commitfc06a192fdc44225ef1cc879f615a81931ad0a85 (patch)
tree3a6eb28aac5ebf320cb665fa53a9a08cf75fbcc0 /Doc
parent3e936431e23b424b1e4665e8165c245924f0ab02 (diff)
downloadcpython-fc06a192fdc44225ef1cc879f615a81931ad0a85.zip
cpython-fc06a192fdc44225ef1cc879f615a81931ad0a85.tar.gz
cpython-fc06a192fdc44225ef1cc879f615a81931ad0a85.tar.bz2
bpo-35892: Fix mode() and add multimode() (#12089)
Diffstat (limited to 'Doc')
-rw-r--r--Doc/library/statistics.rst38
-rw-r--r--Doc/whatsnew/3.8.rst8
2 files changed, 38 insertions, 8 deletions
diff --git a/Doc/library/statistics.rst b/Doc/library/statistics.rst
index 81119da..97e1c3a 100644
--- a/Doc/library/statistics.rst
+++ b/Doc/library/statistics.rst
@@ -37,7 +37,7 @@ Averages and measures of central location
These functions calculate an average or typical value from a population
or sample.
-======================= =============================================
+======================= ===============================================================
:func:`mean` Arithmetic mean ("average") of data.
:func:`fmean` Fast, floating point arithmetic mean.
:func:`harmonic_mean` Harmonic mean of data.
@@ -45,8 +45,9 @@ or sample.
:func:`median_low` Low median of data.
:func:`median_high` High median of data.
:func:`median_grouped` Median, or 50th percentile, of grouped data.
-:func:`mode` Mode (most common value) of discrete data.
-======================= =============================================
+:func:`mode` Single mode (most common value) of discrete or nominal data.
+:func:`multimode` List of modes (most common values) of discrete or nomimal data.
+======================= ===============================================================
Measures of spread
------------------
@@ -287,12 +288,12 @@ However, for reading convenience, most of the examples show sorted sequences.
.. function:: mode(data)
- Return the most common data point from discrete or nominal *data*. The mode
- (when it exists) is the most typical value, and is a robust measure of
- central location.
+ Return the single most common data point from discrete or nominal *data*.
+ The mode (when it exists) is the most typical value and serves as a
+ measure of central location.
- If *data* is empty, or if there is not exactly one most common value,
- :exc:`StatisticsError` is raised.
+ If there are multiple modes, returns the first one encountered in the *data*.
+ If *data* is empty, :exc:`StatisticsError` is raised.
``mode`` assumes discrete data, and returns a single value. This is the
standard treatment of the mode as commonly taught in schools:
@@ -310,6 +311,27 @@ However, for reading convenience, most of the examples show sorted sequences.
>>> mode(["red", "blue", "blue", "red", "green", "red", "red"])
'red'
+ .. versionchanged:: 3.8
+ Now handles multimodal datasets by returning the first mode encountered.
+ Formerly, it raised :exc:`StatisticsError` when more than one mode was
+ found.
+
+
+.. function:: multimode(data)
+
+ Return a list of the most frequently occurring values in the order they
+ were first encountered in the *data*. Will return more than one result if
+ there are multiple modes or an empty list if the *data* is empty:
+
+ .. doctest::
+
+ >>> multimode('aabbbbccddddeeffffgg')
+ ['b', 'd', 'f']
+ >>> multimode('')
+ []
+
+ .. versionadded:: 3.8
+
.. function:: pstdev(data, mu=None)
diff --git a/Doc/whatsnew/3.8.rst b/Doc/whatsnew/3.8.rst
index 9cd5a3a..ad86917 100644
--- a/Doc/whatsnew/3.8.rst
+++ b/Doc/whatsnew/3.8.rst
@@ -282,6 +282,9 @@ Added :func:`statistics.fmean` as a faster, floating point variant of
:func:`statistics.mean()`. (Contributed by Raymond Hettinger and
Steven D'Aprano in :issue:`35904`.)
+Added :func:`statistics.multimode` that returns a list of the most
+common values. (Contributed by Raymond Hettinger in :issue:`35892`.)
+
Added :class:`statistics.NormalDist`, a tool for creating
and manipulating normal distributions of a random variable.
(Contributed by Raymond Hettinger in :issue:`36018`.)
@@ -591,6 +594,11 @@ Changes in the Python API
* The function :func:`platform.popen` has been removed, it was deprecated since
Python 3.3: use :func:`os.popen` instead.
+* The :func:`statistics.mode` function no longer raises an exception
+ when given multimodal data. Instead, it returns the first mode
+ encountered in the input data. (Contributed by Raymond Hettinger
+ in :issue:`35892`.)
+
* The :meth:`~tkinter.ttk.Treeview.selection` method of the
:class:`tkinter.ttk.Treeview` class no longer takes arguments. Using it with
arguments for changing the selection was deprecated in Python 3.6. Use