summaryrefslogtreecommitdiffstats
path: root/Modules
diff options
context:
space:
mode:
authorRaymond Hettinger <python@rcn.com>2013-07-28 09:34:42 (GMT)
committerRaymond Hettinger <python@rcn.com>2013-07-28 09:34:42 (GMT)
commit662908b5e536308713ffb4bcbe3a7a8f250e4003 (patch)
treef558d5b52c35a612db104639e462617fead513fd /Modules
parent16b10c64d719fed995ecfe01a1f18e4e184d0659 (diff)
downloadcpython-662908b5e536308713ffb4bcbe3a7a8f250e4003.zip
cpython-662908b5e536308713ffb4bcbe3a7a8f250e4003.tar.gz
cpython-662908b5e536308713ffb4bcbe3a7a8f250e4003.tar.bz2
Restore the data block size to 62.
The former block size traded away good fit within cache lines in order to gain faster division in deque_item(). However, compilers are getting smarter and can now replace the slow division operation with a fast integer multiply and right shift. Accordingly, it makes sense to go back to a size that lets blocks neatly fill entire cache-lines. GCC-4.8 and CLANG 4.0 both compute "x // 62" with something roughly equivalent to "x * 9520900167075897609 >> 69".
Diffstat (limited to 'Modules')
-rw-r--r--Modules/_collectionsmodule.c9
1 files changed, 6 insertions, 3 deletions
diff --git a/Modules/_collectionsmodule.c b/Modules/_collectionsmodule.c
index dc31cbc..26d8783 100644
--- a/Modules/_collectionsmodule.c
+++ b/Modules/_collectionsmodule.c
@@ -10,11 +10,14 @@
/* The block length may be set to any number over 1. Larger numbers
* reduce the number of calls to the memory allocator, give faster
* indexing and rotation, and reduce the link::data overhead ratio.
- * Ideally, the block length should be a power-of-two for faster
- * division/modulo computations during indexing.
+ *
+ * Ideally, the block length will be set to two less than some
+ * multiple of the cache-line length (so that the full block
+ * including the leftlink and rightlink will fit neatly into
+ * cache lines).
*/
-#define BLOCKLEN 64
+#define BLOCKLEN 62
#define CENTER ((BLOCKLEN - 1) / 2)
/* A `dequeobject` is composed of a doubly-linked list of `block` nodes.