Every gismu has from two to five rafsi, each of a different form, but each such rafsi represents only one gismu. It is valid to use any of the rafsi forms in building lujvo – whichever the reader or listener will most easily understand, or whichever is most pleasing – subject to the rules of lujvo making. There is a scoring algorithm which is intended to determine which of the possible and legal lujvo forms will be the standard dictionary form (see Section 4.12).
Each gismu always has at least two rafsi forms; one is the gismu itself (used only at the end of a lujvo), and one is the gismu without its final vowel (used only at the beginning or middle of a lujvo). These forms are represented as CVC/CV or CCVCV (called “the 5-letter rafsi”), and CVC/C or CCVC (called “the 4-letter rafsi”) respectively. The dashes in these rafsi form representations show where other rafsi may be attached to form a valid lujvo. When lujvo are formed only from 4-letter and 5-letter rafsi, known collectively as “long rafsi”, they are called “unreduced lujvo”.
Some examples of unreduced lujvo forms are:
In addition to these two forms, each gismu may have up to three additional short rafsi, three letters long. All short rafsi have one of the forms CVC, CCV, or CVV. The total number of rafsi forms that are assigned to a gismu depends on how useful the gismu is, or is presumed to be, in making lujvo, when compared to other gismu that could be assigned the rafsi.
For example, zmadu (“more than”) has the two short rafsi zma and mau (in addition to its unreduced rafsi zmad and zmadu), because a vast number of lujvo have been created based on zmadu, corresponding in general to English comparative adjectives ending in “-er” such as “whiter” (Lojban labmau). On the other hand, bakri (“chalk”) has no short rafsi and few lujvo.
There are at most one CVC-form, one CCV-form, and one CVV-form rafsi per gismu. In fact, only a tiny handful of gismu have both a CCV-form and a CVV-form rafsi assigned, and still fewer have all three forms of short rafsi. However, gismu with both a CVC-form and another short rafsi are fairly common, partly because more possible CVC-form rafsi exist. Yet CVC-form rafsi, even though they are fairly easy to remember, cannot be used at the end of a lujvo (because lujvo must end in vowels), so justifying the assignment of an additional short rafsi to many gismu.
The intention was to use the available “rafsi space”- the set of all possible short rafsi forms – in the most efficient way possible; the goal is to make the most-used lujvo as short as possible (thus maximizing the use of short rafsi), while keeping the rafsi very recognizable to anyone who knows the source gismu. For this reason, the letters in a rafsi have always been chosen from among the five letters of the corresponding gismu. As a result, there are a limited set of short rafsi available for assignment to each gismu. At most seven possible short rafsi are available for consideration (of which at most three can be used, as explained above).
Here are the only short rafsi forms that can possibly exist for gismu of the form CVC/CV, like sakli. The digits in the second column represent the gismu letters used to form the rafsi.
CVC | 123 | -sak- |
CVC | 124 | -sal- |
CVV | 12'5 | -sa'i- |
CVV | 125 | -sai- |
CCV | 345 | -kli- |
CCV | 132 | -ska- |
(The only actual short rafsi for sakli is -sal-.)
For gismu of the form CCVCV, like blaci, the only short rafsi forms that can exist are:
CVC | 134 | -bac- |
CVC | 234 | -lac- |
CVV | 13'5 | -ba'i- |
CVV | 135 | -bai- |
CVV | 23'5 | -la'i- |
CVV | 235 | -lai- |
CCV | 123 | -bla- |
(In fact, blaci has none of these short rafsi; they are all assigned to other gismu. Lojban speakers are not free to reassign any of the rafsi; the tables shown here are to help understand how the rafsi were chosen in the first place.)
There are a few restrictions: a CVV-form rafsi without an apostrophe cannot exist unless the vowels make up one of the four diphthongs ai, ei, oi, or au; and a CCV-form rafsi is possible only if the two consonants form a permissible initial consonant pair (see Section 4.1). Thus mamta, which has the same form as salci, can only have mam, mat, and ma'a as possible rafsi: in fact, only mam is assigned to it.
Some cmavo also have associated rafsi, usually CVC-form. For example, the ten common numerical digits, which are all CV form cmavo, each have a CVC-form rafsi formed by adding a consonant to the cmavo. Most cmavo that have rafsi are ones used in composing tanru.
The term for a lujvo made up solely of short rafsi is “fully reduced lujvo”. Here are some examples of fully reduced lujvo:
In addition, the unreduced forms in Example 4.27 and Example 4.28 may be fully reduced to:
As noted above, CVC-form rafsi cannot appear as the final rafsi in a lujvo, because all lujvo must end with one or two vowels. As a brivla, a lujvo must also contain a consonant cluster within the first five letters – this ensures that they cannot be mistaken for compound cmavo. Of course, all lujvo have at least six letters since they have two or more rafsi, each at least three letters long; hence they cannot be confused with gismu.
When attaching two rafsi together, it may be necessary to insert a hyphen letter. In Lojban, the term “hyphen” always refers to a letter, either the vowel y or one of the consonants r and n. (The letter l can also be a hyphen, but is not used as one in lujvo.)
The y-hyphen is used after a CVC-form rafsi when joining it with the following rafsi could result in an impermissible consonant pair, or when the resulting lujvo could fall apart into two or more words (either cmavo or gismu).
Thus, the tanru pante tavla (“protest talk”) cannot produce the lujvo patta'a, because tt is not a permissible consonant pair; the lujvo must be patyta'a. Similarly, the tanru mudri siclu (“wooden whistle”) cannot form the lujvo mudsiclu; instead, mudysiclu must be used. (Remember that y is not counted in determining whether the first five letters of a brivla contain a consonant cluster: this is why.)
The y-hyphen is also used to attach a 4-letter rafsi, formed by dropping the final vowel of a gismu, to the following rafsi. (This procedure was shown, but not explained, in Example 4.27 to Example 4.31.)
The lujvo forms zunlyjamfu, zunlyjma, zuljamfu, and zuljma are all legitimate and equivalent forms made from the tanru zunle jamfu (“left foot”). Of these, zuljma is the preferred one since it is the shortest; it thus is likely to be the form listed in a Lojban dictionary.
The r-hyphen and its close relative, the n-hyphen, are used in lujvo only after CVV-form rafsi. A hyphen is always required in a two-part lujvo of the form CVV-CVV, since otherwise there would be no consonant cluster.
An r-hyphen or n-hyphen is also required after the CVV-form rafsi of any lujvo of the form CVV-CVC/CV or CVV-CCVCV since it would otherwise fall apart into a CVV-form cmavo and a gismu. In any lujvo with more than two parts, a CVV-form rafsi in the initial position must always be followed by a hyphen. If the hyphen were to be omitted, the supposed lujvo could be broken into smaller words without the hyphen: because the CVV-form rafsi would be interpreted as a cmavo, and the remainder of the word as a valid lujvo that is one rafsi shorter.
An n-hyphen is only used in place of an r-hyphen when the following rafsi begins with r. For example, the tanru rokci renro (“rock throw”) cannot be expressed as ro'ire'o (which breaks up into two cmavo), nor can it be ro'irre'o (which has an impermissible double consonant); the n-hyphen is required, and the correct form of the hyphenated lujvo is ro'inre'o. The same lujvo could also be expressed without hyphenation as rokre'o.
There is also a different way of building lujvo, or rather phrases which are grammatically and semantically equivalent to lujvo. You can make a phrase containing any desired words, joining each pair of them with the special cmavo zei. Thus,
is the exact equivalent of brivla (but not necessarily the same as the underlying tanru bridi valsi, which could have other meanings.) Using zei is the only way to get a cmavo lacking a rafsi, a cmevla, or a fu'ivla into a lujvo:
Example 4.41 is particularly noteworthy because the phrase that would be produced by removing the zeis from it doesn't end with a brivla, and in fact is not even grammatical. As written, the example is a tanru with two components, but by adding a zei between by. and livgyterbilma to produce
the whole phrase would become a single lujvo. The longer lujvo of Example 4.43 may be preferable, because its place structure can be built from that of bilma, whereas the place structure of a lujvo without a brivla must be constructed ad hoc.
Note that rafsi may not be used in zei phrases, because they are not words. CVV rafsi look like words (specifically cmavo) but there can be no confusion between the two uses of the same letters, because cmavo appear only as separate words or in compound cmavo (which are really just a notation for writing separate but closely related words as if they were one); rafsi appear only as parts of lujvo.