cross-posted from: https://hexbear.net/post/3947641

Hii, idk why I'm writing this cuz there's probably 0 Lojban speakers here but... idk you may like my special interest writing lol and i just had this urge to start writing this. Also wanna preface this by saying this is mostly composed of my opinions (which I will try to justify ofc) and I'm just some unqualified nerd

Soo, for the last few weeks I've been learning this constructed language called "Lojban". The name Lojban is technically an exonym created by English speakers and some Lojban speakers will use it even while speaking Lojban but a better "native" name is probably "jbobau" or "banjubu'o", the latter for use if someone is pilled on algorithmically deriving Lojban words for countries, cultures, and languages from their ISO codes (I think I am lol, but I think it's also okay to have one of our own words for ourselves and use the algorithmically derived stuff for everything else).

Lojban is a in class of constructed languages called "logical languages" and the usual definition is that they are languages for people to communicate anything a natural language could but they are based on logic. Lojban itself comes out of a previous language project called "Loglan" (yes that's literally just the english words for logic and language mashed together, Lojban's name is similar but has a better justification you will see later lol) after its community became disillusioned with the slow progress of its development and that the specification did not even come close to reflecting its real use (not to mention being intended to just encode English with logical structures which many people were also dissatisfied with). Loglan's creator, James Cooke Brown, was actually threatening people with legal action for trying to improve the language by making dictionaries or specifying grammar that they were already using. So a lot of Loglan's community left the project and formed the "Logical Language Group" (the organization that actually defines what Lojban is) and created Lojban to try to improve on Loglan. For a while there was some kind of complicated legal battle over copyright between the LLG and James Cooke Brown, which the LLG won eventually but not before they had already created a new vocabulary for Lojban from scratch to avoid copyright restrictions.

It's weird but I remember hearing about Lojban as a kid cuz I think I was briefly interested in constructed languages (I even had a little one of my own I can barely remember) and cuz I was allowed unlimited access to the internet basically since I could work a computer on my own lol. I was talking with a friend from here and the topic of languages came up and we talked briefly about if a language that people speak could also be used for programming a computer or something and I vaguely remembered Lojban. So that night I looked into it again, for real this time, and discovered it's actually rly cool and I wanted to make a serious attempt at learning it.

Lojban has several "claims to fame", and not all of them are true sadly and I feel like this isn't made clear enough to newcomers. It's very cool and I love it ofc lol, but there are some things about it that I don't like and which I only learned of after I got a decent grasp on the language and the project.


Quick intro to Lojban

In Lojban, the basic structure of a text or any speech ("utterances") is a series of "bridi". A bridi is like a Lojban sentence but the word "bridi" literally means "predication". Like in classical first-order logics and like many English sentences ("I like you", I have the relationship of "liking" you), a bridi is a assertion of a relationship between some things. Bridi are composed are 2 things: a selbri, (a predicate word or basically like a verb) which has a series of "places" numbered x1 through x[however many places the selbri has], and sumti which are like arguments or parameters that fit in the numbered places of the selbri.

So "I like you" can be rendered in Lojban like this: mi [cu] nelci do

"nelci" is a word being used as a selbri which has the English definition of "x1 is fond of/likes/has a taste for x2 (object/state)". On its own it's not very useful ofc, unless you have sumti to pass to it. I used the words "mi" and "do", which are like pronouns for the speaker and the listener, respectively. Mi and do are considered to be in a category of word called "cmavo" [SCHMA vo], which literally means "structure word" and which are small, very commonly used words used for lots of things like filling in the sumti places of selbri. There are a few other pronoun-like words defined by Lojban but those aren't very useful unless we only want to make statements about ourselves! Btw, there is another cmavo called "cu", it means that the word after it is a selbri in a bridi, the reason it's in brackets is cuz you often don't need to specify it. In this case we don't cuz it's obvious where the selbri begins and ends.

Here's one of the most wild features of Lojban that you get to learn about almost immediately when you start learning about it. There are no nouns :3

What you have to do if you want something with the function of a noun to use as a sumti to a selbri is to use a few different mechanisms that are provided for by the cmavo, most commonly the cmavo "le" and "lo" which are similar to the English word "the". They convert a selbri to a sumti like this:

le gerku cu nelci le panka (The dog likes the park, or extremely literally: all of the at-least-one described as a dog liked/likes/will like (tenses are optional :3) all of the at-least-one described as a park)

Notice in this case, the cu was necessary cuz otherwise it would be ambiguous where the first sumti to nelci ends. Lojban LOVES terminators to grammatical structures. Le has a corresponding terminator like a close parenthesis "ku". I could have also replaced the cu with a ku (the former is pronounced like shu btw, ku is pronounced like any English speaker would pronounce it hehe) and it would have also been valid. Most of these terminators can usually be dropped where dropping them wouldn't cause any ambiguity. It seems complicated at first but you do actually get a feel pretty quickly when you can leave out the terminators.

Also notice that "gerku" and "panka" look very similar to nelci. That's cuz they are the same type of word called a "gismu" :3 and you can tell easily where the gismu in a Lojban utterance are cuz they have a defined, regular form. They always have 5 letters, always start with a consonant and end with a single vowel, always contain exactly one consonant pair, and they are are always stressed on the first syllable (which is a consequence of the fact that ALL Lojban words are stressed on their second-to-last syllable).

Gerku and panku literally mean "is a dog" and "is a park" (they have more places for more info but yeh hehe)


Intro to xorlo and bear goo

Oh, you thought the above section was me explaining a language I love so that you could maybe understand it a little better? Actually, it was all a trick to give you the context to just barely grasp the biggest Lojban struggle session that ever happened enough to understand how fundamentally cooked this """"logical"""" language is from a logical perspective :3 (/j)

Actually, I lied in the last section. "le" and "lo" don't work like that anymore. Lojban underwent a period of rapid development within the LLG from its initial creation until the publication of the official specification of the entire language in a red book called "The Complete Lojban Language" in 1997. Then development was mandatorily frozen for 5 years to see how people used this "baseline" standard language. And since then, the development of the language was left to a different committee of LLG members and Lojban users called the BPFK which was dissolved by LLG in 2018 to form the very similar committee called the LFK (I will spare you their Lojban names :3). Technically, LLG has to approve whatever the BPFK and now the LFK do, but the vibe I get is that most of the LLG members who developed the language either left or mostly checked out of further development of the language. I think the LLG bylaws say they have to meet once a year but...... not a lot has happened since 1997 tbh lol

One of the BPFK's first language reforms (maybe the first actually) was a reform called "xorlo".

You know those meanings of le and lo I mentioned earlier? Something that was very important in baseline Lojban was this concept of those le and lo words (called "gadri") having implicit quantifiers.

Like when I say "le gerku" how do you know how many dogs I'm referring to? The answer, in baseline Lojban, is that le has 2 implicit qualifiers which, if not overridden, are this: "ro (all of) le (what is described as) su'o (at least one) gerku (dog)". So, to be exactly clear, what I said was that one or more dog(s) that I'm thinking of like/liked/will like one or more park(s) that I'm thinking of lol. It can't be zero cuz that affects the hypothetical truth value of the bridi (more on that later) cuz there has to be an actually existing liker and liked in order for that bridi to be true. Another important thing is, why did I use le instead of lo? The answer, also in baseline Lojban, is cuz lo refers to things that actually are something, as in: my use of the selbri after "lo" is not restricted by my intentions. That probably sounds like nonsense lol. Cuz it is unless you know the secret thing that no one told me initially and which a lot of other people never learned about le and lo which is that they also specify sets. Before I tell you more on that, I should tell you that lo also has implicit quantifiers like this: "su'o (at least one) lo (things that actually truly are) ro (all) gerku (dog)". So, on set specifying: "le gerku" is ALL MEMBERS of the set of dog(s) THAT I AM THINKING OF and "lo gerku" is at least ONE member of the set of ALL dog(s) IN THE UNIVERSE (of discourse, maybe, lol)

Here is a very good example of the semantics of that from the red book I mentioned:

Example 6.41.

[ro] le ci gerku cu blabi
[All-of] those-described-as three dogs are-white
The three dogs are white.

Example 6.42.

ci lo [ro] gerku cu blabi
Three-of those-which-are [all] dogs are-white
Three dogs are white.

Okay, cool, cool, seems easy right? Just have to remember those implicit quantifiers doggirl-thumbsup

Wait...... aware Lojban is a logical language. Speaking extremely literally, "lo gerku" is those things which dog, as in things that are dogging. Wtf is a thing that dogs and also.... wtf is a set?

Okay okay, what about the gismu "cribe" which means "is a bear". Imagine: you're in a forest with a friend. You and your friend are trying to find lo cribe (a bear, one or more members of the set of things that are bearing). You're walking along and you come across a pile of bear goo (a bear that is a pile of goo, maybe it's a bear in a late stage of decomposition or something). Did you and your friend just find lo cribe??????????????????A???A?A???A/fd/ffd/fdflfmf.f.c? If the goo bears, doesn't that mean it is part of lo cribe?

The problem is, that from a classical first-order formal logical perspective............. human intentions are worthless. But more than that LOJBAN HAS NO COMPLETE, FORMAL DEFINITION OF ANYTHING EXCEPT ITS GRAMMAR AND IT ISN'T EVEN CLEAR ON WHAT A SET IS WHILE USING THEM

So the BPFK nerds argued about this endlessly in one of their sessions and could not come to any kind of conclusion on whether bear goo is lo cribe or not so they decided to drop almost all the semantics described above of le and lo despite the fact that no human speaker could come across bear goo and describe it as lo cribe rather than le cribe (as in goo that is bearing in speaker's idea of the goo) unless they were trying to make a smuglord point about it being lo cribe to a committee and we got the "xorlo" reform. Now you use lo for everything unless you have some specific things in mind and then you use le. And these have no qualifiers unless you add them so "lo gerku" could mean a dog, all dogs, or some dogs and "le gerku" means some unspecified amount of things I'm thinking of, each of which I describe to you as a dog.

Btw, I didn't even get into the plural quantification thing, as in there are predicates that can only be satisfied by groups of things and not by the things themselves and vice versa. Like one person cannot "gather" and a group of people cannot eat, the people in the group each eat

If you wanna melt your brain some more with that, here you go: https://mw.lojban.org/papri/gadri:_an_unofficial_commentary_from_a_logical_point_of_view :3

Also, Lojban has set and "mass" gadri like le and lo that do explicitly specify masses (a group of "individual" (that has its own bad definition) things that has the same properties of the individual things which compose it and may have other properties as well) and sets (a group of things that has entirely different properties from the individual things that compose it). Like one way to interpret that is that sets of bears have cardinality but they don't have "bearness" and vice versa for sets) which, as a result, can be used to specify the stuff about plural quantification of variables in predicates (see the above link for info about "collectivity" and "distributivity")

If this all sounds really confusing, that's because it is and I'm still trying to figure out what Lojban means to me. Initially I was using xorloified Lojban cuz it's an official reform (but technically not a "finished" official reform or something that is permanently binding lol, idek, it's been 20 years lmao) and I hadn't yet realized Lojban is in the process of splitting into multiple dialects cuz, to me it seems like, people who cared about the logical aspects of Lojban are now dissatisfied with the lack of progress on better formalism and the whole language actually and xorlo and similar BPFK reforms that make the language easier to speak while reducing its semantic richness and the people who don't really care about logic in Lojban hate that stuff anyway and are cool with dropping most of it. I read some of the arguments for and against xorlo and I am using non-xorlo Lojban for now cuz I wasn't convinced by the bear goo arguments. It doesn't matter that much for communication most of the time, I feel like, usually, xorlo people will just miss out on some of the meaning of what a xorlo non-user will be saying but otherwise it's okay. Although, sometimes I legit cannot understand wtf IRC users are saying at all lol cuz their dialect is so far from the baseline Lojban I've been learning and also I cannot make their utterances parse sometimes in the computer parsers cuz its so far away from baseline lol


Lojban myths and truths

  1. Lojban is a logical language

This is true, as long as you can agree on what lo logical language ku means :3 Tbh, when I went in I rly was expecting SOME kind of real formal system behind it before I found out that Lojban is a language to speak some logic but not rly a language to do logic, I guess. Most of the parts are there but not much has materialized (yet?)

There are some attempts to formalize Lojban more, this new one is particularly impressive tbh, even though I think the approach is kinda wrong: https://brismu.systems/ (brismu: a relational interpretation of Lojban)

  1. Lojban is an unambiguous language

This is true syntactically. There are real formal definitions of Lojban grammar out there based on BNF grammars and parsing expression grammers which is VERY VERY VERY NICE lol. It really helps to learn the language cuz if you're doing something grammatically wrong a parser will reject it. Semantically, I think you can be even more vague and ambiguous in Lojban (like with tanru, which are metaphors based on combining multiple gismu) than probably most natural languages but you can also be painfully, beautifully non-ambiguous with your meaning :3

So.... it's mixed

  1. Lojban is somewhat culturally neutral

It's hard/impossible for me to say exactly cuz I was a monolingual English speaker before learning Lojban and I live in amerikkka which I think is where most of Lojban's creators are from. It does have some very nice features though. Like there is no set word order, I didn't mention it before but you can move around the sumti places and selbri in pretty much any configuration you want so likee if you don't speak a SVO language like I'm speaking now you can totally just do whatever you want and still be understood. Also, the parts which make up the gismu which form the root words of the language are algorithmically sourced from the world's biggest languages weighted by how many speakers they have. So Mandarin is the most important source for the gismu

But at the same time Lojban's phonology (how it sounds) and especially orthography (how it's written, like with the Latin script) is quite European, although there are much less used writing systems for Lojban floating around. I should mention though that the language, while it has a very European phonology, was carefully designed so that speakers who might have trouble making some of the more difficult or culturally specific sounds can still be understood. Tbh that would be worth a post of its own but you can find details in the first few chapters of the Red Book if you're interested


On perfection

Am getting tired now so I'm gonna finish this up before I get too tired to post it and then forget it but...

I don't want anyone to put off of trying Lojban out cuz of the bear goo stuff or anything else I mentioned. It's genuinely a very cool language project, it's just unfinished imo even if most of its original creators are done with it

There are a lot of things in Lojban I wish we had in English, like spoken tone indicators, evidential indicators (cmavo that attach to things to specify how you came to know or believe something), actually working grammar checkers, etc etc etc

Would encourage you to look into Lojban if any of this stuff interests you. You'll just have to figure out some/a lot of it on your own but if you figure out a good interpretation maybe we should all start speaking it hehe :3

I hope you liked reading, sorry if it's messy I kinda just threw this together, little energy for more elaborating rn

    • PaX [comrade/them, they/them]
      hexagon
      ·
      edit-2
      3 days ago

      tony-cheer

      I love this :3

      It works but you might have more clarity imo saying: "le cribe cu ckaji le ka pitxavyterkubli" (The bear has the quality of planar-6-sided-ness). I forget what this is called but the implied meaning of tanru and lujvo are changed by conversions of the gismu involved right? My take on that lujvo (which I made using the program "jvocuhadju" which is a component of the "jbofihe" parser which implements the official lujvo making algorithm) was made from the tanru "plita xa te kubli" cuz the x1 of kubli is the thing that is a cube/polygon/regular polyhedron but I wanted the meaning of the x3 place for surfaces/sides. I'm still very new to all this ofc. Also ofc, lujvo have defined meanings unlike tanru but I'm just going off of the rafsi of yours since no dictionary has that hehe :3

      Ofc, you could also just say "le cribe cu kubli li re li xa" for "the bear is a polygon of 2 dimensions and 6 sides"

      Are you a Lojban speaker?

      Edit: tbh, this polygon thing reminds me of the cultural gismu problem where inherently there's no way to have a complete set of all polygons defined in the language and that does kinda suck if you want a short way to state types of polygons or whatever, rather than using.... like.... a description sumti based on kubli with some sumti attached (another way to state the above lol, maybe better). I just looked in the dictionary though and someone has been cooking some experimental cmavo up that describe polygons based on Schlafli symbols or something, pretty cool but not familiar with the theory

      • NephewAlphaBravo [he/him]
        ·
        edit-2
        2 days ago

        I just stole a "word" for hexagon from here, so maybe it's an old way to construct it. Or more likely it's because I've never heard of lojban before your posts and wasn't 100% certain I even put the "sentence" in the right order lol

        Lojban is really funny to learn about when the other languages I speak are german and computer programming, the way the words are defined like function calls with arguments so you wind up making these huge extremely-literal compound words like you get in german

        Any idea what the "vy" in "xavypitkubli" is doing there by the way? I gathered that the whole thing means (six)(??)(abbreviated form of "plane")(regular shape) but couldn't figure out what that second bit is doing in there. "Pentagon" had the same thing with the word for "five" being followed by an extra "m"

        • PaX [comrade/them, they/them]
          hexagon
          ·
          edit-2
          2 days ago

          Sorry, I accidentally hit the post button on the other reply lol

          I just stole a "word" for hexagon from here

          Ohhh, I seee. Is weird cuz that word isn't in the dictionary (I like https://la-lojban.github.io/sutysisku/lojban/index.html and https://vlasisku.lojban.org/) yet that page says the definitions are from la vlasisku lol. Maybe it got deleted later or something

          Or more likely it's because I've never heard of lojban before your posts and wasn't 100% certain I even put the "sentence" in the right order lol

          Actually pretty cool that you are already making valid bridi with lujvo selbri then :3

          Even though I didn't know that lujvo I could guess its meaning and place structure from it's components

          Btw, this parser has been the most vital tool for me while learning this language hehe: https://lojban.github.io/ilmentufa/camxes.html

          The fact that the grammar is truly unambiguous is sooo cool to me, rly helps learning in a systematic way (I have always struggled with learning other languages cuz of all the vibes-bases rules with a trillion exceptions). Now I can just try to write something and have the computer tell me if what I said was at least syntactically correct and if it parses like I intended

          Lojban is really funny to learn about when the other languages I speak are german and computer programming, the way the words are defined like function calls with arguments so you wind up making these huge extremely-literal compound words like you get in german

          Yesss, hehehe. Tbh when I started learning Lojban and even now it kinda feels like a "thought programming language" lol. I don't know any other people-languages but I always did like the German language word-compounding features so I did/do similar things in English even now lol.

          Also, when I first started learning Lojban I was making up lots of compound words but now I try to do it less cuz like while lujvo have defined meanings and place structures in the dictionary, you can also just make them up on the spot and hope people will understand you. There is another structure called a "tanru" and these are actually the source of lujvo. It's like a kind of compound word metaphor that works like this

          Let's say there's uhhhh, a really fast dog and you wanna describe this fast dog as existing hehe. The gismu (Lojban root word) "gerku" means just "x1 is a dog/canine of species/breed x2" but it doesn't describe this dog of having the quality of "fast". So what you can is actually make a tanru with the word "sutra" that means "x1 is fast/swift/quick/hastes/rapid at doing/being/bringing about x2 (event/state)":

          imagine I pointed to the fast dog while I said the following bridi

          ti [cu] sutra gerku
          This is a fast-type-of dog
          

          Tanru are made up of multiple brivla (words that can serve as selbri which are the gismu, lujvo, and cmene (these are Lojban names)) that, by default, left-associate to modify the words to the right. The final place structure of the tanru (which is basically a compound selbri) is determined by its last component (like in the above case: "gerku") There's some great examples here if you want to learn more: https://la-lojban.github.io/uncll/romoi/xhtml_section_chunks/section-simple-tanru.html

          Sadly, while they are extremely flexible, tanru are also often kinda vague. Like I didn't specify what action the dog is fast at doing :3 Although, I think at least many people will read that as a fast-moving type of dog hehe

          I actually could fix that and be less ambiguous if I used the cmavo "be" to attach a "abstractor" sumti to sutra while it's inside the tanru I think (I need to go back and reread the section of the CLL about that). Abstractors are like le and lo but for using turning whole bridi into sumti. Here's my attempt with the abstractor "zu'o" which means something like "activity of [bridi]" in brief words:

          ti [cu] sutra be fe le zu'o cadzu be'o gerku
          This [end preceding sumti, begin selbri] fast (attach sumti) (next sumti is in x2 place) the activity of (bridi: walking) (end attached sumti, sometimes you can drop this terminator but not here) dog
          
          This is a (fast-at-the-activity-of-walking)-type-of-dog.
          

          As vague or painfully non-ambiguous as you like :3 I love it hehehe. I also try to avoid making tanru when there are better, clearer ways to accomplish the same thing

          Any idea what the "vy" in "xavypitkubli" is doing there by the way? I gathered that the whole thing means (six)(??)(abbreviated form of "plane")(regular shape) but couldn't figure out what that second bit is doing in there. "Pentagon" had the same thing with the word for "five" being followed by an extra "m"

          There are 2 primary reasons for the "vy" in that lujvo. The first is easier to explain: Lojban gismu (root words) and cmavo (structural words) have these things called "rafsi" associated with them. Each gismu has a few defined rafsi that are like pieces that represent the whole gismu that you can use to build lujvo with. The assigned rafsi (just one in this case) for the number "xa" is "xav". So if you want to put it in a lujvo, you would use xav. And also I think besides it being assigned, part of the reason for it being that way is cuz of the stuff I am about to say lol

          The second reason (that explains the "y") has to do with Lojban word structure. And btw the "y" is a schwa sound like a short "uh". Something very important to spoken Lojban working well is being able to tell where where words begin and end unambiguously. Lojban is supposed to be exactly the same in its written and spoken forms. As in: you can say every written Lojban utterance including what English speakers would normally call "punctuation" and "emoticons or emoji" and every spoken Lojban utterance should be able to be transcribed exactly as if it were always written. English speakers just use vibes and experience to parse out whereotherpeople'swordsbeginandend :3 but Lojban has just a few rules to keep in mind (that were also used to build the entire vocabulary) when you're making your own words like names (cmene) or lujvo, that work together so if you miss or forget some of the rules or are not capable of making certain sounds (like word stress) the other rules will still save you from saying ambiguous utterances even if your utterance is technically not valid according to the formal grammar

          And this is the part of Lojban I'm fuzziest on cuz I haven't had the opportunity to actually speak to someone with Lojban sadly so I haven't practiced or internalized a lot of it but I will try to explain. One of these rules is that Lojban words are separated by either glottal stops or short pauses. Actually the "." character in Lojban means that there is an explicit glottal stop and most speakers separate their sentences with a explicit glottal stop and the new bridi marker "i" like this: ".i". Spaces between words can mean either and I tend to switch between them depending on how fast I'm speaking but my native dialect of English does that "hard attack" thing where I do that even in English so it's natural to me lol

          The reason there is that "y" there is cuz if it wasn't there it would cause the word to be ambiguous about whether it is a single word or 2 words according to the above rule. It's possible that in the time between you vocalizing "xav" and then "pit", a listener could interpret that short pause as a word separator like "xav pitkubli" but because the schwa is there it keeps the word together. It also might violate rules about permissible sounds at the end of words. Something like that. Btw, you usually don't need to remember all of this all the time, part of the official algorithm for making lujvo has a step for inserting "y"s where they would be needed.

          You can also read more about it in the CLL here: https://la-lojban.github.io/uncll/romoi/xhtml_section_chunks/chapter-phonology.html probably way better than I explained hehe