grefer(1) | User Commands | grefer(1) |
grefer - process bibliographic references for groff
grefer |
[-bCenPRS] [-a n] [-B field.macro] [-c fields] [-f n] [-i fields] [-k field] [-l range-expression] [-p database-file] [-s fields] [-t n] [file ...] |
grefer |
--help |
grefer |
-v |
grefer |
--version |
The GNU implementation of refer is part of the groff(1) document formatting system. grefer is a gtroff(1) preprocessor that prepares bibilographic citations by looking up keywords specified in a roff(7) input document, obviating the need to type such annotations, and permitting the citation style in formatted output to be altered independently and systematically. It copies the contents of each file to the standard output stream, except that it interprets lines between .[ and .] as citations to be translated into groff input, and lines between .R1 and .R2 as instructions regarding how citations are to be processed. Normally, grefer is not executed directly by the user, but invoked by specifying the -R option to groff(1). If no file operands are given on the command line, or if file is “-”, the standard input stream is read.
Each citation specifies a reference. The citation can specify a reference that is contained in a bibliographic database by giving a set of keywords that only that reference contains. Alternatively it can specify a reference by supplying a database record in the citation. A combination of these alternatives is also possible.
For each citation, grefer can produce a mark in the text. This mark consists of some label which can be separated from the text and from other labels in various ways. For each reference it also outputs groff(7) language commands that can be used by a macro package to produce a formatted reference for each citation. The output of grefer must therefore be processed using a suitable macro package, such as me, mm, mom, or ms. The commands to format a citation's reference can be output immediately after the citation, or the references may be accumulated, and the commands output at some later point. If the references are accumulated, then multiple citations of the same reference will produce a single formatted reference.
The interpretation of lines between .R1 and .R2 as prepreocessor commands is a feature of GNU refer. Documents making use of this feature can still be processed by AT&T refer just by adding the lines
.de R1 .ig R2 ..
grefer generates .lf requests so that file names and line numbers in messages produced by commands that read grefer output will be correct; it also interprets lines beginning with .lf so that file names and line numbers in the messages and .lf lines that it produces will be accurate even if the input has been preprocessed by a command such as gsoelim(1).
The bibliographic database is a text file consisting of records separated by one or more blank lines. Within each record fields start with a % at the beginning of a line. Each field has a one character name that immediately follows the %. It is best to use only upper and lower case letters for the names of fields. The name of the field should be followed by exactly one space, and then by the contents of the field. Empty fields are ignored. The conventional meaning of each field is as follows:
For all fields except %A and %E, if there is more than one occurrence of a particular field in a record, only the last such field will be used.
If accent strings are used, they should follow the character to be accented. This means that an ms document must call the .AM macro when it initializes. Accent strings should not be quoted: use one \ rather than two. Accent strings are an obsolescent feature of the me and ms macro packages; modern documents should use groff special character escape sequences instead; see groff_char(7).
Citations have a characteristic format.
.[opening-text flags keywords fields .]closing-text
The opening-text, closing-text, and flags components are optional. Only one of the keywords and fields components need be specified.
The keywords component says to search the bibliographic databases for a reference that contains all the words in keywords. It is an error if more than one reference is found.
The fields components specifies additional fields to
replace or supplement those specified in the reference. When references are
being accumulated and the keywords component is non-empty, then
additional fields should be specified only on the first occasion that a
particular reference is cited, and will apply to all citations of that
reference.
The opening-text and closing-text components specify strings to be used to bracket the label instead of those in the bracket-label command. If either of these components is non-empty, the strings specified in the bracket-label command will not be used; this behavior can be altered using the [ and ] flags. Leading and trailing spaces are significant for these components.
The flags component is a list of non-alphanumeric characters each of which modifies the treatment of this particular citation. AT&T refer will treat these flags as part of the keywords and so will ignore them since they are non-alphanumeric. The following flags are currently recognized.
An advantage of using the [ and ] flags rather than including the brackets in opening-text and closing-text is that you can change the style of bracket used in the document just by changing the bracket-label command. Another is that sorting and merging of citations will not necessarily be inhibited if the flags are used.
If a label is to be inserted into the text, it will be attached to the line preceding the .[ line. If there is no such line, then an extra line will be inserted before the .[ line and a warning will be given.
There is no special notation for making a citation to multiple references. Just use a sequence of citations, one for each reference. Don't put anything between the citations. The labels for all the citations will be attached to the line preceding the first citation. The labels may also be sorted or merged. See the description of the <> label expression, and of the sort-adjacent-labels and abbreviate-label-ranges commands. A label will not be merged if its citation has a non-empty opening-text or closing-text. However, the labels for a citation using the ] flag and without any closing-text immediately followed by a citation using the [ flag and without any opening-text may be sorted and merged even though the first citation's opening-text or the second citation's closing-text is non-empty. (If you wish to prevent this, use the dummy character escape sequence \& as the first citation's closing-text.)
Commands are contained between lines starting with .R1 and .R2. Recognition of these lines can be prevented by the -R option. When a .R1 line is recognized any accumulated references are flushed out. Neither .R1 nor .R2 lines, nor anything between them, is output.
Commands are separated by newlines or semicolons. A number sign (#) introduces a comment that extends to the end of the line, but does not conceal the newline. Each command is broken up into words. Words are separated by spaces or tabs. A word that begins with a (neutral) double quote (") extends to the next double quote that is not followed by another double quote. If there is no such double quote, the word extends to the end of the line. Pairs of double quotes in a word beginning with a double quote collapse to one double quote. Neither a number sign nor a semicolon is recognized inside double quotes. A line can be continued by ending it with a backslash “\”; this works everywhere except after a number sign.
Each command name that is marked with * has an associated negative command no-name that undoes the effect of name. For example, the no-sort command specifies that references should not be sorted. The negative commands take no arguments.
In the following description each argument must be a single word; field is used for a single upper or lower case letter naming a field; fields is used for a sequence of such letters; m and n are used for a non-negative numbers; string is used for an arbitrary string; file is used for the name of a file.
.[ $LIST$ .]
If string is omitted, it will default to AP; if field is also omitted it will default to X. Only one field can be an annotation.
bracket-label \*([. \*(.] ", "
date-as-label D.+yD.y%a*D.-y
et-al " et al" 2 3
join-authors " and " ", " ", and "
Label expressions can be evaluated both normally and tentatively. The result of normal evaluation is used for output. The result of tentative evaluation, called the tentative label, is used to gather the information that normal evaluation needs to disambiguate the label. Label expressions specified by the date-as-label and short-label commands are not evaluated tentatively. Normal and tentative evaluation are the same for all types of expression other than @, *, and % expressions. The description below applies to normal evaluation, except where otherwise specified.
The above expressions are listed in order of precedence (highest first); & and | have the same precedence.
Each reference starts with a call to the macro ]-. The string [F will be defined to be the label for this reference, unless the no-label-in-reference command has been given. There then follows a series of string definitions, one for each field: string [X corresponds to field X. The register [P is set to 1 if the P field contains a range of pages. The [T, [A and [O registers are set to 1 according as the T, A and O fields end with any of .?! (an end-of-sentence character). The [E register will be set to 1 if the [E string contains more than one name. The reference is followed by a call to the ][ macro. The first argument to this macro gives a number representing the type of the reference. If a reference contains a J field, it will be classified as type 1, otherwise if it contains a B field, it will be type 3, otherwise if it contains a G or R field it will be type 4, otherwise if it contains an I field it will be type 2, otherwise it will be type 0. The second argument is a symbolic name for the type: other, journal-article, book, article-in-book, or tech-report. Groups of references that have been accumulated or are produced by the bibliography command are preceded by a call to the ]< macro and followed by a call to the ]> macro.
--help displays a usage message, while -v and --version show version information; all exit afterward.
Other options are equivalent to grefer commands.
The B option has command equivalents with the addition that the file names specified on the command line are processed as if they were arguments to the bibliography command instead of in the normal way.
grefer uses temporary files. See the groff(1) man page for details of where such files are created.
In label expressions, <> expressions are ignored inside .char expressions.
We can illustrate the operation of grefer with a sample bibliographic database containing one entry and a simple roff document to cite that entry.
$ cat > my-db-file %A Daniel P.\& Friedman %A Matthias Felleisen %C Cambridge, Massachusetts %D 1996 %I The MIT Press %T The Little Schemer, Fourth Edition $ refer -p my-db-file Read the book .[ friedman .] on your summer vacation. <Control+D> .lf 1 - Read the book\*([.1\*(.] .ds [F 1 .]- .ds [A Daniel P. Friedman and Matthias Felleisen .ds [C Cambridge, Massachusetts .ds [D 1996 .ds [I The MIT Press .ds [T The Little Schemer, Fourth Edition .nr [T 0 .nr [A 0 .][ 2 book .lf 5 - on your summer vacation.
The foregoing shows us that grefer (a) produces a label “1”; (b) brackets that label with interpolations of the “[.” and “.]” strings; (c) calls a macro “]-”; (d) defines strings and registers containing the label and bibliographic data for the reference; (e) calls a macro “][”; and (f) uses the lf request to restore the line numbers of the original input. As discussed in subsection “Macro interface” above, it is up to the document or a macro package to employ and format this information usefully. Let us see how we might turn groff_ms(7) to this task.
$ REFER=my-db-file groff -R -ms .LP Read the book .[ friedman .] on your summer vacation. Commentary is available.\*{*\*} .FS \*{*\*} Space reserved for penetrating insight. .FE
ms's automatic footnote numbering mechanism is not aware of grefer's label numbering, so we have manually specified a (superscripted) symbolic footnote for our non-bibliographic aside.
“Some Applications of Inverted Indexes on the Unix System”, by M. E. Lesk, 1978, AT&T Bell Laboratories Computing Science Technical Report No. 69.
gindxbib(1), glookbib(1), lkbib(1)
2 July 2023 | groff 1.23.0 |