1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
|
Pre-notes
The cutting of the channel system is not as clean as I would like
it to be, simply because cisco has the special need of a channel
system trimmed down to the std* channels, without complete removal.
I am not sure that I have removed the maximum amount of C Api's
and functions possible for this specific configuration.
A first step in rationalizing this section would be NO_CHANNELS
to remove the I/O system completely, and then NO_NONSTDCHAN
for minial exposure of channels. NO_FILEEVENTS is orthogonal
to NO_NONSTDCHAN. Drivers are possible only if not NO_CHANNELS,
but can be disabled separately. The standard channels need the
"file" driver (currently not disable-able), should use #ifdef's
to ensure integrity.
=> Would be interesting to have a configuration tool which
is able to express and enforce these constraints.
=> The linux core configuration uses the domain specific
language CML2 (Eric Raymond, written in Python).
! Investigate possible usage of SourceNavigator as
basic for parsing the Tcl core. Use custom tools to
follow dependencies between structures and functions.
(What-If tools: What if I exclude this function/struct,
what else can be removed, or requires this).
Also: What are the leaf functions in the system ...
! Mapping help: Associate functions with functional areas
and see how the areas relate, how much can be removed
whenever an area is excluded ...
------------------------------------------------------------------------
------------------------------------------------------------------------
------------------------------------------------------------------------
Shrinking the core.
Filesystem
Shrinking the usage of stack
Large static arrays on the stack
Look for #define's, check usage, create #defines if necessary
DString !! (initial dstring data in structure!)
RE's ?
NRE1
== running a stack test of the full test suite for a build is 1.5 hours ==
== something for the evening and the night ==
Document methodology of testing stack
Macros
TCL_NO_<feature> to deactivate/cut feature
MODULAR_TCL activates all TCL_NO_<feature> macros
------------------------------------------------------------------------
Cut 1
The cut currently restricts itself to the UNIX and GENERIC parts.
No changes in Win* and Mac areas.
channel system
- no sockets TCL_NO_SOCKETS /
- no serial/tty TCL_NO_TTY /
- no pipes TCL_NO_PIPES /
- no pid command TCL_NO_PIDCMD /
- channel system provides TCL_NO_NONSTDCHAN /
only std* channels [x]
- no channel copying TCL_NO_CHANNELCOPY /
- no [read]ing TCL_NO_CHANNEL_READ /
- no [eof] [/] TCL_NO_CHANNEL_EOF /
- no channel set/get cfg TCL_NO_CHANNEL_CONFIG [+] /
- no [fblocked] TCL_NO_CHANNEL_BLOCKED [/] /
- no fileevents TCL_NO_FILEEVENTS [=] /
filesystem
- disable filesystem TCL_NO_FILESYSTEM [%] /*
- disable load'ing TCL_NO_LOADCMD /
master/slave interpreters
- disable slave interp TCL_NO_SLAVEINTERP /*
- disable command aliases TCL_NO_CMDALIASES /*
[*] Access from the C level is not removed.
[x] Implies that no .rc can be read during unix init.
Implies that no startup script can be read by tclsh.
Implies NO_SOCKETS, NO_TTY, NO_PIPES
Implies currently 'no "source" cmd' and no loading of encoding files.
In the generic case this functionality can the reimplemented by direct
OS calls without using the channel system. Makes the
implementation platform dependent. As Cisco doesn't want this functionality
we disable them without adding a new implementation.
Implies that channels cannot be moved/shared between master/slave interps.
(seek is removed under the assumption that the std* channels are not seekable)
[/] Tcl_Eof, Tcl_InputBlocked stay because they are required by [gets].
[+] Tcl_SetChannelOption stays, required for initial config of std channels.
[=] Implies no socket servers. Reason: Accept callback for socket server
is done through fev's
[%] Ripping the filesystem intrudes heavily on the startup sequence of the
interpreter as auto_path, package paths, etc. can't be initialized anymore.
This also cuts into the initialization of encodings.
Given that encodings will be changed later to not use UTF internally this
is no big deal. For Cisco. Others might want to have 'no fs', but UTF.
We have to check that the startup sequence is still operational.
Given that without a FS loading of encoding from files is
impossible the loss of initialization is again not so big a deal.
------------------------------------------------------------------------
------------------------------------------------------------------------
------------------------------------------------------------------------
Handling of stub table when cutting features:
1. Disable all functions for the feature, from the bottom up to
the top (script level command). This includes full disabling
of stub functions too.
The bottom-up approach enforces link errors in the higher
levels and thus allows us to use the compiler to find all
relevant places where we have to cut.
Cutting stub functions is essential to find everything.
2. Go through the functions causing link errors in tclStubInit.o
== stub functions. Add variants which are empty, return errors
etc. and compile these when the feature is disabled.
** Changed **
Add suppressor definitions to "tcl*.decls" and regen the code.
------------------------------------------------------------------------
------------------------------------------------------------------------
------------------------------------------------------------------------
Future:
Implement a mechanism for 'tcl.decls'
which allows the definition of (static, loadable) sub packages.
So that the stub table is minimally initialized and
sub packages initialize their slots when loaded.
------------------------------------------------------------------------
------------------------------------------------------------------------
------------------------------------------------------------------------
Cut 2, Working on the stacksize.
* Spliced the NRE1 engine by Miguel Sofer into
the core. Stack testing the testsuite show
an average saving of 4 K stack space.
* Reducing amount of characters directly stored in
a DString structure from 200 to 1. Average
savings when going through testsuite:
* Going through #defines in headers and sources
to identify more locations placing data on the
stack.
New controlling macros.
#define TCL_NO_RECURSE /* enables the NRE modifications */
set by default
Tcl_ExternalToUtfDString is in trouble for TCL_DSTRING_STATIC_SIZE=1 I
guess TDSS < UTF_MAX is trouble because the function does not check
before attempting the first conversion. ... Ok, SZ=25 is ok for the
testsuite. This doesn't mean that it is ok in real life, but encoding
is cut of for cisco, so we can screw this here. Keep in mind for
later.
1,4,5,10,17 fail
21,25 ok
#define TCL_FMT_STATIC_FLOATBUFFER_SZ 320
#define TCL_FMT_STATIC_VALIDATE_LIST 16
#define TCL_FOREACH_STATIC_ARGS 9
#define TCL_FOREACH_STATIC_LIST_SZ 4
#define TCL_FOREACH_STATIC_VARLIST_SZ 5
#define TCL_RESULT_APPEND_STATIC_LIST_SZ 16
#define TCL_MERGE_STATIC_LIST_SZ 20
#define TCL_PROC_STATIC_CLOCALS 20
#define TCL_PROC_STATIC_ARGS 20
#define TCL_INVOKE_STATIC_ARGS 20
#define TCL_EVAL_STATIC_VARCHARS 30
#define TCL_STATS_COUNTERS 10
#define TCL_LSORT_STATIC_MERGE_BUCKETS 30
-DTCL_FMT_STATIC_FLOATBUFFER_SZ=320 -DTCL_FMT_STATIC_VALIDATE_LIST=16 -DTCL_FOREACH_STATIC_ARGS=9 -DTCL_FOREACH_STATIC_LIST_SZ=4 -DTCL_FOREACH_STATIC_VARLIST_SZ=5 -DTCL_RESULT_APPEND_STATIC_LIST_SZ=16 -DTCL_MERGE_STATIC_LIST_SZ=20 -DTCL_PROC_STATIC_CLOCALS=20 -DTCL_PROC_STATIC_ARGS=20 -DTCL_INVOKE_STATIC_ARGS=20 -DTCL_EVAL_STATIC_VARCHARS=30 -DTCL_STATS_COUNTERS=10 -DTCL_LSORT_STATIC_MERGE_BUCKETS=30
cut_dstring ...
-DTCL_FMT_STATIC_FLOATBUFFER_SZ=0 -DTCL_FMT_STATIC_VALIDATE_LIST=0 -DTCL_FOREACH_STATIC_ARGS=0 -DTCL_FOREACH_STATIC_LIST_SZ=0 -DTCL_FOREACH_STATIC_VARLIST_SZ=0 -DTCL_RESULT_APPEND_STATIC_LIST_SZ=0 -DTCL_MERGE_STATIC_LIST_SZ=0 -DTCL_PROC_STATIC_CLOCALS=0 -DTCL_PROC_STATIC_ARGS=0 -DTCL_INVOKE_STATIC_ARGS=0 -DTCL_EVAL_STATIC_VARCHARS=0 -DTCL_STATS_COUNTERS=0 -DTCL_LSORT_STATIC_MERGE_BUCKETS=0
------------------------------------------------------
General look through the code for static buffers
on the stack.
tclAlloc /ok
tclAsync /ok
tclBasic Tcl_CallWhenDeleted 32+INT_SPACE
Tcl_ExprString TCL_DOUBLE_SPACE
tclBinary /ok
tclClock /ok
tclCmdAH StoreStatData TCL_INTEGER_SPACE
Tcl_FormatObjCmd
...(Obj)Cmd functions often hold quite a lot of state in local
variables.
For exact measurements we have to instrument the C code with
additional (macroized) function calls to record exact sizes for every
invoked C function. Automatic instrumentation is difficult. Could
instrument the dispatchers first (where commands are invoked) to get
stack sizes for bigger blocks of execution (command + utility
functionality called by it).
-----------------------------------------------------------------------------------------------
A big structure is 'CompileEnv'. Instead of trying to reduce its size
it might be better to allocate the whole structure of the heap.
#define TCL_COMPENV_ON_HEAP /* Allocate temp. CompileEnv structs off the heap */
Stack measure
@ TclSetByteCodeFromAny ../../src/tcl834_stkr/unix/../generic/tclCompile.c 300 = 2036
@ TclCompileByteCodesForExpr ../../src/tcl834_stkr/unix/../generic/tclExecute.c 6022 = 2008
On Heap
@ TclSetByteCodeFromAny ../../src/tcl834_stkr/unix/../generic/tclCompile.c 300 = 100
@ TclCompileByteCodesForExpr ../../src/tcl834_stkr/unix/../generic/tclExecute.c 6022 = 68
-----------------------------------------------------------------------------------------------
Ditto Tcl_Parse
@ TclCompileSetCmd ../../src/tcl834_stkr/unix/../generic/tclCompCmds.c 1618 = 640
@ TclCompileIncrCmd ../../src/tcl834_stkr/unix/../generic/tclCompCmds.c 1356 = 636
On Heap
@ TclCompileSetCmd ../../src/tcl834_stkr/unix/../generic/tclCompCmds.c 1619 = 268
@ TclCompileIncrCmd ../../src/tcl834_stkr/unix/../generic/tclCompCmds.c 1356 = 264
-----------------------------------------------------------------------------------------------
TclObjInterpProc
832, 792, 752
=> STATIC_CLOCALS 20 ==> 716 bytes accounted for.
=> STATIC_CLOCALS is of help and changing it does not crash the interp.
TclInvokeStringCommand
TCL_INVOKE_STATIC_ARGS => 20 x char* = 80
|