Topic 09-discussion.md
Technical Choices
Modularity and Granularity
In an ideal world, a program should only load the libraries it needs. Penlight is intended to work in situations where an extra 100Kb of bytecode could be a problem. It is straightforward but tedious to load exactly what you need:
local data = require 'pl.data' local List = require 'pl.List' local array2d = require 'pl.array2d' local seq = require 'pl.seq' local utils = require 'pl.utils'
This is the style that I follow in Penlight itself, so that modules don't mess with the global environment; also, stringx.import()
is not used because it will update the global string table.
But require 'pl'
is more convenient in scripts; the question is how to ensure that one doesn't load the whole kitchen sink as the price of convenience. The strategy is to only load modules when they are referenced. In 'init.lua' (which is loaded by require 'pl'
) a metatable is attached to the global table with an __index
metamethod. Any unknown name is looked up in the list of modules, and if found, we require it and make that module globally available. So when tablex.deepcompare is encountered, looking up tablex causes 'pl.tablex' to be required. .
Modifying the behaviour of the global table has consequences. For instance, there is the famous module strict which comes with Lua itself (perhaps the only standard Lua module written in Lua itself) which also does this modification so that global variiables must be defined before use. So the implementation in 'init.lua' allows for a 'not found' hook, which 'pl.strict.lua' uses.
But the strategy is worth the effort: the old 'kitchen sink' 'init.lua' would pull in about 260K of bytecode, whereas now typical programs use about 100K less, and short scripts even better - for instance, if they were only needing functionality in utils .
There are some functions which mark their output table with a special metatable, when it seems particularly appropriate. For instance, tablex.makeset creates a Set , and seq.copy creates a List . But this does not automatically result in the loading of pl.Set and pl.List ; only if you try to access any of these methods. In 'utils.lua', there is an exported table called stdmt
:
stdmt = { List = {}, Map = {}, Set = {}, MultiMap = {} }
If you go through 'init.lua', then these plain little 'identity' tables get an __index
metamethod which forces the loading of the full functionality. Here is the code from 'list.lua' which starts the ball rolling for lists:
List = utils.stdmt.List
List.__index = List
List._name = "List"
List._class = List
The 'load-on-demand' strategy helps to modularize the library. Especially for more casual use, require 'pl'
is a good compromise between convenience and modularity.
In this current version, I have generally reduced the amount of trickery involved. Previously, Map was defined in pl.class ; now it is sensibly defined in pl.Map ; pl.class only contains the basic class mechanism (and returns that function.) For consistency, List is returned directly by require 'pl.List'
(note the uppercase 'L'), Also, the amount of module dependencies in the non-core libraries like pl.config have been reduced.
Defining what is Callable
'utils.lua' exports function_arg
which is used extensively throughout Penlight. It defines what is meant by 'callable'. Obviously true functions are immediately passed back. But what about strings? The first option is that it represents an operator in 'operator.lua', so that '<' is just an alias for operator.lt .
We then check whether there is a function factory defined for the metatable of the value.
(It is true that strings can be made callable, but in practice this turns out to be a cute but dubious idea, since all strings share the same metatable. A common programming error is to pass the wrong kind of object to a function, and it's better to get a nice clean 'attempting to call a string' message rather than some obscure trace from the bowels of your library.)
The other module that registers a function factory is pl.func . Placeholder expressions cannot be directly calleable, and so need to be instantiated and cached in as efficient way as possible.
(An inconsistency is that utils.is_callable does not do this thorough check.)