scheduler - zmailer transport queue scheduler daemon
scheduler [ -divFHnQSVW ] [ -f configfile ] [ -E newentsmax ] [ -L logfile ] [ -l statisticslog ] [ -N transp- maxfno ] [ -P postoffice ] [ -p channel/host-pair ] [ -R maxforkfreq ] [ -q rendezvous ]
The scheduler daemon manages the delivery processing of messages in ZMailer.
The router(8) creates message control files in the POSTOFFICE/transport directory. These refer to the original message files in the POSTOFFICE/queue directory.
The scheduler reads each message control file from POSTOFFICE/transport, translates the contained message and destination information into internal data structures, and unlinks the message control file.
Based on scheduling, priority, and execution information read from a configuration file, the scheduler arranges to execute Transport Agents relevant to the queued messages.
At the time scheduled for a particular transport agent invocation, the scheduler will start a transport agent (or use one from idle-pool), and tell it one by one which message control files to process. When all the destination addresses in a message have been processed, the scheduler performs error reporting tasks if any, and then deletes the message control file in POSTOFFICE/transport and the original message file in POSTOFFICE/queue. All message delivery is actually performed by Transport Agents, which are declared in a configuration file for the scheduler. Each transport agent is executed with the same current directory as the scheduler. scheduler-trans- porter interaction protocol is described latter in this man-page.
The standard output of each transport agent are destination address delivery reports; either successful delivery, unsuccessful delivery, or deferral of the address. Each report uses byte offsets in the message control file to refer to the address. Reports may also include a comment line which will be displayed in the scheduler's own reports.
Two types of reports are produced:
1. |
Error messages caused by unsuccessful delivery of a message are appended to its mes- sage control file. Occasionally, for example, when all addresses have been processed, the scheduler generates an error message to the error return address of the message (usually the original sender). |
|
2. |
The scheduler binds itself to a well-known TCP/IP port (MAILQ, TCP port 174) on startup. Any connections to this port are processed synchronously in the scheduler at points in the execution where the state is internally consistent. The scheduler simply dumps its internal state in a terse format to the TCP stream. It is expected that the client program will reconstruct the data structures sufficiently to give a user a good idea of what the scheduler thinks the world looks like. The mailq(1) program serves this purpose. |
|
Invoking scheduler without any argument will start it as a daemon.
-d -E newentsmax |
run as a daemon, usually used after -v to log daemon activity in great detail. when globbing new tasks from the directory, pick only first ``newentsmax'' of them, and leave rest for a new scan run. |
|
-f configfile -F -H -HH |
overrides the default configuration file MAILSHARE/scheduler.cf. Freeze -- don't actually run anything, just do queue scanning. (For debug purposes..) Use multi-level hashing at the spool directories. This will efficiently reduce the lengths of the scans at the directories to find some arbitary file in them. One `H' means "single level hashing", two `HH's mean "dual level hashing". ``Hash'' is directory which name is single upper case alphabet (A-Z). When existing, ZENV variable SCHEDULERDIRHASH overrides the `H' option. |
|
-i -L logfile -l statisticslog |
run interactively, i.e., not as a daemon. overrides the default log file location LOGDIR/scheduler. starts the appending of delivery statistics information (ASCII form) into given file. No default value. |
|
-M [1|2] |
Version of the mailq protocol this server runs; essentially a test option, as existence of PARAMauthfile=".." assignment at the scheduler.conf file turns the protocol into version 2. |
-N transmaxfno |
sets how many filehandles are allocated for the scheduler's started children (if the system has adjustable resources.) |
|
-n |
Toggles configuration flag `default_full_content', which defines what will be DSN RET parameter assumed value in case the originator didn't supply that parameter. Default behaviour is similar to RET=FULL, while usage of this option is equivalent of RET=HDRS. This option does not override originator supplied DSN RET parameter value. |
|
-p channel/host |
A debug-type option for running selectively some thread under a single instance of the scheduler. Use this with options: -v |
|
-P postoffice -q rendezvous |
specifies an alternate POSTOFFICE directory. on machines without TCP/IP networking, the rendezvous between scheduler and mailq(1) is done using a well-known named pipe. This option overrides the default location for this special file, either RENDEZVOUS or /usr/tmp/.mailq.text. |
|
-Q |
The ``Q''-mode, don't output the old style data into the queue querier, only the new-style one. |
|
-S |
Synchronous startup mode, scans all jobs at the directory before starting even the first transporter. |
|
-v -W |
be verbose about activity, and do not detach as a daemon. be used in conjunction with -v to delay verbose logging start until after all the files have |
|
-V |
print version message and run interactively. |
The scheduler configuration file consists of a set of clauses. Each clause is selected by the pattern it starts with. The patterns for the clauses are matched, in sequence, with the channel/host string for each recipient address. When a clause pattern matches an address, the parameters set in the clause will be applied to the scheduler's pro- cessing of that address. If the clause specifies a command, the clause pattern matching sequence is terminated. This is a clause:
local/* |
|||||
A clause consists of: |
interval=10s |
- A selection pattern (in shell style) that is matched against the channel/host string for an
address.
- 0 or more variable assignments or keywords (described below).
There are several possible PARAM-assignments starting at column 0, more of them below.
If the selection pattern does not contain a '/', it is assumed to be a channel pattern and the host pattern is assumed to be the wildcard '*'.
The components of a clause are separated by whitespace. The pattern introducing a clause must start in the first column of a line, and the variable assignments or keywords inside a clause must not start in the first column of a line. This means a clause may be written both compactly all on one line, or spread out with an assignment or key- word per line.
If the clause is empty (i.e., consists only of a pattern), then the contents of the next non-empty clause will be used. The typical configuration file will contain the following clauses:
- a clause matching all addresses (using the pattern */*) that sets up default values. - a clause matching the local delivery channel (usually local). - a clause matching the deferred delivery channel (usually hold). |
- a clause matching the error reporting channel (usually error). - clauses specific to the other channels known by the router, for example, smtp and uucp. |
The actual names of these channels are completely controlled by the router configuration file.
Empty lines, and lines whose first non-whitespace character is '#', are ignored.
Variable values may be unquoted words or values or doublequoted strings. Intervals (delta time) are specified using a concatenation of: numbers suffixed with 's', 'm', 'h', or 'd' modifiers designating the number as a second, minute, hour, or day value. For example: 1h5m20s.
The known variables and keywords, and their typical values and semantics are:
interval (1m) |
specifies the primary retry interval, which determines how frequently a transport agent should be scheduled for an address. The value is a delta time specification. This value, and the retries value mentioned below, are combined to determine the interval between each retry attempt. |
|
idlemax (3x interval) |
When a transport agent runs out of jobs, they are moved to ``idle pool'', and it a TA spends more than idlemax time in there, it is terminated. |
|
expiry (3d) |
specifies the maximum age of an address in the scheduler queue before a repeatedly deferred address is bounced with an expiration error. The actual report is produced when all addresses have been processed. |
retries (1 1 2 3 5 8 13 21 34)
specifies the retry interval policy of the scheduler for an address. The value must be a sequence of positive integers, these being multiples of the primary interval before a retry is scheduled. The scheduler starts by going through the sequence as an address is repeatedly deferred. When the end of the sequence is reached, the scheduler will jump into the sequence at a random spot and continue towards the end. This allows various retry strate- gies to be specified easily: brute force (or "jackhammer"): |
maxta (0) |
if retrying an address would cause the number of simultaneously active transport agents to exceed the specified value, the retry is postponed. The check is repeated frequently so the address may be retried as soon as possible after the scheduled retry interval. If the value is 0, a value of 1000 is used instead. Keep in mind that all running transport agents will keep open two pipe(2) file-handle, and thus system-wide limits may force lower maximum, than 1000. On a system with max 256 open files, this would most likely succeed at 120. |
|
maxchannel (0) |
if retrying an address would cause the number of simultaneously active transport agents processing mail for the same channel to exceed the specified value, the retry is postponed. The check is repeated frequently so the address may be retried as soon as possible after the scheduled retry interval. If the value is 0, a value of 1000 is used instead. |
|
maxring (0) |
Recipients are groupped into ``threads'', and similar threads are groupped into ``thread- rings'', where same transport agent can be switched over from one recipient to another. This defines how many transport agents can be running at any time at the ring. |
|
maxthr (1) |
This limits the number of parallel transport agents within each thread; that is, using higher value than default ``1'' will allow running more than one TA for the jobs at the thread. Do note that running more than one TA in parallel may also require lowering OVERFEED value. (E.g. having a queue of 30 messages will not benefit from more TAs, unless they all get something to process. Having OVERFEED per default at 150 will essentially feed |
overfeed (150) |
whole queue to one TA, others are not getting any.) This tells how many job specifiers to feed to the TA when the TA process state is ``STUFFING'' Because the scheduler is a bit sluggish to spin around to spot active TAs, it does make sense to feed more than one task to a TA, and then wait for the results. |
|
skew (5) |
is the maximum number of retries before the retry time is aligned to a standard boundary (seconds since epoch, modulo primary interval). The lower this number (1 is lowest), the faster the alignment is done. The purpose of this alignment is to ensure that eventually a single transport agent invokation will be able to process destination addresses that arrived randomly to the scheduler. |
|
user (root) |
is the user id of a transport agent processing the address. The value is either numeric (a uid) or an account name. |
|
group (daemon) |
is the group id of a transport agent processing the address. The value is either numeric (a |
command (smtp -srl ${LOGDIR}/smtp $host)
is the command line used to start a transport agent to process the address. The program pathname is specified relative to the MAILBIN/ta directory. The string "$channel" is replaced by the current matched channel, and "$host" is replaced by the current matched host, from the destination address, and "${LOGDIR}" substitutes ZENV variable LOGDIR value there. It is strongly recommended that the $host is not to be used on a command definition, as it limits the usability of idled transporter. |
queueonly |
a clause with queueonly flag does not auto-start at the arrival of a message, instead it must be started by means of smtpserver(8) command ETRN thru an SMTP connection. To have message expiration working, following additional entries are suggested: interval=1h retries="24" |
For example, this is a complete configuration file:
The first clause (*/*) |
# Default values sets up default values for all addresses. There is no command specification, |
so clause |
matching will continue after address have picked up the parameters set here.
The third clause (error) has an implicit host wildcard of '*', so it would match the same as specifying error/* would have.
The fifth clause (smtp/*.toronto.edu) has no further components so it selects the components of the following non- empty clause (the sixth).
Both the fifth and sixth clauses are specific to address destinations within the TORONTO.EDU and UTORONTO.CA organization (the two are parallel domains). At most 10 deliveries to the smtp channel may be concurrently active, and at most 2 for all possible hosts within TORONTO.EDU. If $host is mentioned in the com- mand specification, the transport agent will only be told about the message control files that indicate SMTP
delivery to a particular host. The actual host is picked at random from the current choices, to avoid systematic errors leading to a deadlock of any queue.
The scheduler can assign several of its internal parameters by having variable assignments beginning at column 0, and beginning with "PARAM" text:
PARAMmailqpath = "UNIX:/path/to/pf_unix/mailq/socket"
PARAMmailqpath = "TCP:mailq"
PARAMmailqpath = "TCP:174"
These define two different type of possible socket addresses for "mailq" protocol; |
PARAMautfile = "/path/to/scheduler.auth"
Location of MAILQv2 autentication control file
PARAMglobal-report-interval = 15m
Interval by which all permanent reports accumulated into a message are reported by;
sends out early reports of delivery failures, and does not force to wait for maximum queue
timeout in case the message has more than once recipient.
A message control file contains all the information needed by delivery programs like scheduler and the transport agents. It is a terse presentation of the router's decisions, along with some useful reference information.
The message control file consists of a number of fields. All fields start in the first column (i.e., at the beginning of the file or just after a newline), and most fields extend to the end of line. The one exception is the message header field which extends till a double-newline terminator. For all but this message header field, the second column is reserved for a tag byte. This position is used to lock the field and to indicate the status of past processing of the field. For example, the success or failure of delivery to a recipient address is indicated by a '+' or means the field has not been processed, or that processing has been deferred. A '~' indicates the field is locked because some transport agent is currently processing delivery for the address. The known field names and tags are defined in <mail.h>. For all the recipient addresses, there is 6 characters space for transport-process process-id so that a quickly restarted scheduler will not do double-delivery on some slowly running transporter.
The following fields are mandatory:
i |
the name of the message file in the POSTOFFICE/queue directory and of the message con- trol file in the POSTOFFICE/transport directory. 'i' is a mnemonic for 'inode' since mes- sage file names must be unique on the system and therefore are named by their inode num- ber. |
|
o |
the byte offset of the message body in the original message. |
The following fields will frequently exist:
e l |
is the return address for error messages, in a form that can be put in a To: header line. is a string identifying this message in log entries. Typically the message id of the message |
|
n N |
is the delivery-status-notification environment id data for the message. are the delivery-status-notification entries for each recipient address. |
The following fields will occasionally appear:
x |
is the log identification string (usually a message id) of an obsoleted message. The sched- uler will purge any such identified message after running sanity checks. |
|
v |
is the name of a file that the delivery system can appended log information to. This would appear as the result of running sendmail -v or Mail -v. Since all programs need to refer to the same file, on mail clusters it is recommended that this be a relative path naming a file within the POSTOFFICE directory hierarchy. |
A message control file must contain at least one address "group". Each group consists of a sender address field, one or more recipient address fields, and a message header that goes along with these. An address field is a string containing a space-separated 4-tuple (quad) as follows:
channel host |
is the name of the delivery channel for this address. This must be a contiguous word. is the name of the next destination host for this address. This too must be a contiguous |
user is the address to be handed to the destination host for further delivery. This string may con-
tain space. It is distinguishable because the last component cannot contain spaces.
privilege is the numeric uid representing the privileges associated with this address.
The address group components are:
s r R |
is a sender address field. is a recipient address field. is an XOR recipient address field. The first element is a tag (a class number) to identify col- lections of recipient addresses which are equivalent (and therefore mutually exclusive). This is followed by an address field as described above. |
|
N m |
are the delivery-status-notification parameters for the previous recipient. is the message header for this address group. |
After one or more of these address groups, the error messages for addresses are appended to the message con- trol file. This is done by the scheduler as it receives error reports from transport agents.
d |
is a diagnostic field, consisting of a message control file byte offset indicating an address, |
For example, this is a typical message control file (it is a snapshot taken while a transport agent was running):
i 15582 |
||
m |
The transport agent interface follows master-slave -model, where the TA informs the scheduler that it is ready for the work, and then the scheduler sends it one job description, and awaits for diagnistics. Once the job is finished, the TA notifies the scheduler, that it is ready for a new job.
A short sample session looks like this:
(start the transport agent) Normal diagnostic output is of the form: |
id/offset\tnotarydata\tstatus message
where id is the inode number of the message file, offset is a byte offset within its control file where the address being reported on is kept, notarydata is a Ctrl-A separated triplet is delivery-status-notification information for the message, status is one of ok, error, deferred, or retryat, and the message is descriptive text associated with the report. The text is terminated by a linefeed. Any other format (as might be produced by subprocesses) is passed to standard output for logging in the scheduler log. The retryat response will assume the first word of the text is a numeric parameter, either an incremental time in seconds if prefixed by +, or otherwise an absolute time in sec- onds since epoch.
The exit status is a code from <sysexits.h>.
The statistics log reports condenced performance oriented information in following format:
timestamp fileid dt1 dt2 state $channel/$host
812876190 where the fields are: |
90401-2 |
0 |
5 |
ok usenet/- |
timestamp fileid dt1 dt2 state $channel/$host |
The original spoolfile ctime (creation time) stamp in decimal. Spoolfile name after the router has processed it The time difference from spoolfile ctime to scheduler control file creation by the router The time difference from scheduler file ctime to the delivery that is logged on What happened? Values: ok, error, expiry Where/how it was processed |
Upon accepting a TCP connection on the MAILQ port (TCP port 174), the scheduler dumps data to the TCP stream in the following format and immediately closes the connection:
The TCP stream syntax is:
version id\n |
The first line (all bytes up to an ASCII LF character, octal 12) is used to identify the syntax of all bytes following the line terminator LF. The first 8 characters of the first line are "version " as a check that this is indeed a MAILQ port server that has been reached, the remaining bytes are the real data format identification. The data is inter- preted according to that format until the terminating connection close.
Format identifiers should be registered with the author. The only one currently defined is "zmailer 1.0". For that data format, the syntax of the data following the first LF is:
Vertices:\n |
|||
Where: |
\n is an ASCII linefeed a "contiguous string" is a sequence of printable |
For example, here is sample output from connecting to the MAILQ port:
version zmailer 1.0 |
311680:64722; 151,331#128.100.8.4: Null read! (will retry)
312192:63471; 152#128.89.0.93: connect: Connection timed out (will retry)
Channels:
smtp:>311424>311680>312192
Hosts:
scg.toronto.edu:>311424
mv04.ecf.toronto.edu:>311680 |
This is sufficient information to be able to reconstruct the transport queues as seen by the scheduler process, and to find more information than what is shown here by actually looking up the message control and data files referred to.
The MAILQv2 protocol is interactive autenticating protocol, unlike its predecessor (v1). The system begins with a greeting telling version, and then giving one line of challenge to be used in subsequent autentication command:
version zmailer 2.0\n
MAILQ-V2-CHALLENGE: 942665308.906504.3\n
Protocol commands are:
AUTH username hexauthenticator
The "login" of the mailq session. The hexauthenticator is lowercase hexadecimal charac- ter printout of MD5 checksum ran over the catenate of the challenge string (without its end- ing newline character), and the user's password. This algorithm is essentially the same what APOP scheme uses. |
SHOW SNMP Implements 'mailq -QQQ'. SHOW QUEUE SHORT SHOW QUEUE THREADS |
Implements 'mailq -Q'.
SHOW THREAD channel host
Reports details usable to implement mailq-v1 like interface. The details are TAB separated 1) filepath under $POSTOFFICE/transport/ |
ETRN etrn string Supports ETRN-cluster subsystem at smtpserver.
KILL MSG spoolid Unimplemented. KILL THREAD channel host Responses are written out to same socket in POP-like manner: AUTH .... \n |
+OK or -LOGIN FAILED \n SHOW SNMP\n |
.\n
If the output text contains a dot at the beginning of the line, it is duplicated in SMTP (and POP) style.
Of various commands, the "SHOW" class implements multiple text-line outputs, others only "+OK" (or "-ERR...").
For autenticating MAILQv2 protocol users, system can use PARAMauthfile="/path/to/file.auth" PARAM- assignment to identify file containing the data, and with the file to authenticate and parametrize what user can do thru the MAILQv2 port.
# |
in it!) |
# - Enabled attributes (tokens, space separated) |
channel host" |
# ETRN "ETRN etrn_string" |
LOGDIR MAILBIN MAILSHARE PATH POSTOFFICE |
defines location of log files. Example: LOGDIR=/var/log/mail Defines where executable transport-agent binaries exist under $MAILBIN/ta/ directory. Location of scheduler configuration files What PATH environment variable to give to transport-agent subprograms. defines directory where all POSTOFFICE functions are under. |
SCHEDULERDIRHASH
Carries a numeric value of ``1'' or ``2'' (if defined at all), which will then override possible |
SYSLOGFLG |
Existence of ``c'' or ``C'' character in value string enables syslogging of some events as seen by the scheduler. |
|
ZCONFIG |
Gives location of zmailer.conf |
|
SIGHUP: close and reopen the stdout/stderr log file.
SIGTERM: exit cleanly.
SIGQUIT: |
exit cleanly, but at first order transporter childs to shut down, and collect their status |
|
SIGALRM: SIGUSR1: SIGUSR2: |
check pending work. reread the scheduler configuration file. dump state information to the rendezvous file. |
/etc/zmailer.conf |
router(8), mailq(1)
RFC 822 |
The basic Internet email format specification |
Several extended SMTP facilities are implemented: RFC 1341/1521/2045 MIME specification (body, formats) |
This program authored and copyright by:
Rayan Zachariassen <rayan@cs.toronto.edu>
A plenty of changes and several real bugfixes by:
Matti Aarnio <mea@nic.funet.fi>