8 - Shared Memory Speed Tables

Client-server speed tables can involve a noticable performance hit, as a sizable amount of Tcl code gets executed to make the remote speed table function like a local one.

While they're still pretty fast, server actions are inherently serialized because of the single-threaded access model afforded using standard Tcl fileevent actions within the Tcl event model.

When the speed table resides on the same machine as the client, and particularly in this era of relatively inexpensive multiprocessor systems, it would be highly desirable for a client to be able to access the speed table directly through shared memory, bypassing the server entirely.

This work was undertaken in the summer of '07 by Peter da Silva. The goal was to provide a way for same-server clients to access the speed table through shared memory while retaining the ability to build and use speed tables without using shared memory at all.

Tricky synchronization issues surfaced immediately. For instance, what should we do if a row gets changed or added while a search is being performed? We don't want to completely lock out access to the table during a search. Thus we have to really deal with database updates during searches, which raise referential integrity issues and garbage collection / dangling pointer issues. Many searches, such as ones involving results sorting, collecting a set of pointers to the rows that have matched. Those rows cannot disappear behind search's back.

Also the code was already in heavy production with tables containing tens of millions of rows. This work had to be rock solid or it wouldn't be usable.

To simplify the problem, we decided to funnel writes through the client/server mechanism and only allows reads and searches to occur through shared memory.

Our approach is to maintain metadata about in-progress searches in shared memory and have a cycle number that increases as the database is updated. When a search begins, the client copies the current cycle number to a word in shared memory allocated for it by the server. As normal activity causes rows to be modified, updated. or deleted by the server the cycle they were modified on is stored in the row. If rows (or any other shared memory object, such as strings) are deleted, they are added to a garbage pool along with the current cycle, but not actually freed for reuse until the server garbage collects them on a later cycle.

If the client detects that a row it's examining has been modified since it started its search, it restarts the search operation. The server makes sure to update pointers within shared memory in an order such that the client will never step into a partially modified structure. This allows the whole operation to proceed without explicit locks, so long as pointer and cycle updates are atomic and ordered.

Garbage collection is performed by locating deleted memory elements that have a cycle number is lower than the cycle number of any client currently performing a search.

Note: All shared tables must be part of the same C Extension.

New methods to the meta-table (class) command:

create name master list

Creates a new master table based on the parameters in the list:

name tableName

The shared memory segment has a small write-once symbol table that is used to locate individual ctables and other objects in shared mem.

file fileName (required)

Multiple tables can be mapped in the same file, distinguished by the ctable name or the name provided in the "name" option.

size bytes

Used to create the initial size of the file, or if it's already mapped it checks if it's at least this big. The size is assumed to be measured in bytes, unless a suffix such as "K", "M", or "G" is present to indicate kilobytes, megabytes, or gigabytes, respectively. If this parameter is not specified then the default size of 4M is assumed.

flags flagList

The only shared memory flags implemented are sync/nosync (default nosync) and core/nocore (default core).

panic boolean

Allow the handling of some types of shared-memory exhaustion errors to be treated as fatal panics (true) or errors (false) that can be caught. (default true)

minfree bytes

Minimum free space allowed in shared memory. If there is less than this many bytes free, an error will be thrown. Default is 10% of the table size, up to 8M.

create tableName reader list

The list provided is collected from the master table (already opened in another process) through the attach command (below).

This attaches to an existing shared memory segment based on the information in the list, then searches for the reader cycle tagged by the process ID provided to attach, and creates a reader-mode ctable. This table contains a pointer to the master ctable in shared memory, data copied from the master, and other bookkeeping elements.

New speed table methods:

attach pid

Only valid for a master shared table, Creates a structure in the shared memory segment for the process pid, and returns a list of parameters that will be passed to the reader to tell it how to connect to the shared memory segment.

getprop
getprop propName ?propName?...

With no names, returns a key-value list of properties of the ctable, whatever is needed for reflection.

Currently the properties available are type, extension, and key. These are needed for reflection on shared tables.

With names, returns a list of only those properties.

share
share command ?options?

The "share" extension actually stands apart from the ctable extension and Tcl. It provides the shared memory segments and handles memory allocation from the segments. It's unlikely to be useful to use the explicit share form except in internal debugging.

The following commands are meaningful for ctable shares:

names

Returns a list of named objects in the share. These are not necessarily ctables, they may be string variables or objects created by other libraries.

set name string ?name? ?string? (master only)

Sets a shared string variable, for passing additional environment or context to readers. Multiple variables can be set in a single command.

get name

Gets the value of a single shared string variable that was defined with "set".

multiget name ?name?

Gets the values of multiple shared string variables that were defined with "set". Returns a list of each of the values.

info

Returns some internal information about the share in a key-value list. The data includes size, flags, name, whether you're the creator (master), and filename.

pools

Returns a list of information for fixed size memory pools in the shared segment. There will be at least two pools, the garbage pool (containing elements that have been freed but are still in use by at least one reader) and one pool the size of a ctable row is set up for each ctable. For each pool it will return the size of elements that the pool will manage, how many elements in each chunk of elements allocated at once, the total number of chunks allocated, and the number of free elements: {element_size elements_per_chunk chunks free_elements}

pool element_size elements_per_chunk max_chunks (master only)

Creates a pool for objects of element_size bytes, allocated in elements_per_chunk chunks, up to a maximum of max_chunks. If max_chunks is zero it will extend to pool to the limit of the shared segment if necessary.

free

Returns an estimate of the free memory in the share. This estimate does not count pools, and is calculated by iterating over the free list and the garbage collection pool and adding up the total size of each free or dirty block.

STAPI support for shared memory tables:

package require st_shared
::stapi::connect shared://port/table ?options?

Options:

Connect to a ctable on localhost as a ctable_server client, and then open a parallel shared memory client for the same ctable. These connections are hidden behind a STAPI wrapper, so all ctable commands can be used: shared memory will be used for read-only "search" commands, and the ctable_server TCP connection will be used for all other commands.

Server example:

top_brands_nokey_m create m master file sharefile.dat
   [...]
::ctable_server::register sttp://*:1616/master m
   [...]
if !$tcl_interactive { ::ctable_server::serverwait }

This is just like a normal ctable server, except that the ctable itself is a shared memory master table.

Client example:

package require st_shared
   [...]

# Connect to the server, using the shared ctable
# extension created by the server in the directory
# "build". Returns a stapi object.
set r [::stapi::connect shared://1616/master -build build]

# This command is performed using shared memory.
$r search -compare {{= name phred}} -key k -code {puts $k}

# This command is performed using TCP
$r set fred $row

# Close the reader and disconnect from the server.
$r destroy