Good Reason

It's okay to be wrong. It's not okay to stay wrong.

Category: language (page 18 of 22)

Where’s Daniel? And what’s that hammering noise?

Been working on a conference paper.

You know what the worst thing is about doing a paper? No, not getting it rejected. Okay, the second worst thing. Not being able to find any research similar to yours.

Actually, it could be a great thing. You could be the genius who has figured out something new that no one’s ever thought of. On the other hand, you could be doing something worthless that no one else wants to do. Or — more terrifyingly — you’re just a sucky researcher who can’t do a literature review, and it’s already been done, and everyone knows it. Except you, you lazy person. Frightening, isn’t it?

So when I get a good result, I always feel elated, but I brace myself. Writing this paper has been a bit like that.

But it’s in the can now. I’ve sent it off to EMNLP, a Very Big and Important Conference. And if you want to see the results of the study, I’ve made a presentation that you can view. You can read a PDF, or you can have a cute Flash animation, if you’d rather.

I got your community standards right here!

I tell ya, that Google’s useful for all kinds of things.

The operator of a porn web site has been brought to trial for violating ‘community standards’. But who knows what ‘community standards’ are? Well, his lawyer has an interesting answer: check out Google Trends and see what the community’s really up to!

In Florida, it turns out that the search term ‘orgy’ is as American as ‘apple pie’.

Except that ‘apple pie’ hits a spike around Thanksgiving, but ‘orgy’ is popular year-round.

But Perth? Looks like we’re just into surfing. New South Wales is a different story. It’s just about Orgy Season over there. Meanwhile, in Victoria, the worrying fisting trend continues unabated.

English First = xenophobia

Americans think that it’s important to learn languages. As long as the language is ‘English’, and the learner is ‘someone else‘.

Americans believe by large majorities that it is more important for newcomers to learn English than it is for their fellow citizens to become bilingual.

The latest Rasmussen Reports national telephone survey reveals that 83% of likely voters place a higher priority on encouraging immigrants to speak English as their primary language.

Just 13% take the opposite view and say it is more important for Americans to learn other languages.

Broken down along party lines, 79% of Republicans and 59% of Democrats reject the idea that all Americans should know multiple languages. Among unaffiliated voters, 68% say their fellow citizens do not need to know a language other than English.

Last fall, a Rasmussen Reports survey found that 77% of Americans believed that employers should be allowed to require employees to speak English while on the job.

We tend to feel very attached to our language. It’s our tool for expressing who we are. We might not even be able to think without it. As a linguist, I’ve noticed that even smart, aware people aren’t very good at examining their language attitudes. That means it’s an area where our darkest urges can pool and simmer. And that means that we can frequently find xenophobia lurking here.

An antidote is to learn to talk the way someone else does. Not only do I want to see official documents available in other languages, I want to see second language instruction promoted more aggressively in schools. It’s not just a way to expand the mind, it’s a way to combat the more pernicious kinds of intolerance.

He’ll bollix up everything, given the chance.

Did John McCain make a reference to testicles in a recent speech?

As a matter of fact, he did.

The United States Supreme Court yesterday rendered a decision which I think is one of the worst decisions in the history of this country.

So now what are we going to do. We are now going to have the courts flooded with so-called, quote, Habeas Corpus suits against the government, whether it be about the diet, whether it be about the reading material. And we are going to be bollixed up in a way that is terribly unfortunate, because we need to go ahead and adjudicate these cases.

Astute readers might have noticed that ‘bollix’ appears to be a variant of ‘bollocks’, which indeed it is.

The Online Etymological Dictionary (that’s the other OED) says it’s a

respelling (euphemistic?) of bollocks, pl. of bollock “testicle,” from O.E. beallucas “testicles,”

As to the content, McCain is arguing that the rule of law shouldn’t be followed because it’s too time-consuming. Much better to just throw people in jail forever because you just know in your heart that they’re guilty.

To which I say, bollocks.

Religious nutter Turing Test

The Turing Test is a classic in AI. On one side of the screen is you, and on the other side there’s either a human typing to you, or a computer generating text to you. A computer system passes the Turing Test if you can’t tell the difference between the computer and a human.

But when the human is a religious ranter, it tends to lower the bar a bit.

So here’s your test. One of the following text blocks is a bit of an email I got today from ‘Günter’, a poor soul trapped in two false beliefs: that supernatural beings exist, and that he can write comprehensible English.

The other block of text was made with a simple Markov chain trained on word trigrams from Günter’s email.

Sample number one:

Every thing, Love is a ground-need, without Love no Feelings are working, no human can find anymore satisfaction no matter what he trays to do it, the highest law of God, and no grace. Not even when somebody used your Authority-shyness like always, it is exactly the same. With Love the Apocalypse is running for ten years. {Glasshouse-effect? Global warming ?}.You can easy scientific prove, it is up to you. Jesus said <I came on earth to bring the Love and only where the Love and how to do Love, try it and you have to feel it. John says <even when you are doing what I say {that only is FAITH} and not only when you know about it>.Jesus said: sacrifice your self <you have to like them in any way, only give what you have<Logo>!

Number two:

That’s why Jesus said the End is near. The human where believing that Love is: cooking a meal; mending socks; squeeze a lemon; give Money, Tender; Fondness or even Sexuality. They filling up whole Libraries with books about Love, only in the explanation of the Old Testament {Torah; Koran; Kamathutra ;} or Jesus was never one interested. Jesus tried to teach Love and how to do it, the highest law of God. Out of the old scriptures he explained; proofing and showed in life what he is talking about. God says in the in the Old Testament; Torah; Koran; Kamathutra; <I m the Love and only the Love and only where the Love is can I be.> don’t make a picture or allegory {don’t compare me with nothing or nobody} of me. Never!!

Well, humans, which is the person, and which is the computer?

Sarcasm experiment

Sarcasm’s a funny thing. You say the opposite of what you mean, but somehow the other person uses their knowledge of the situation and the ordinary meaning of your words to unwind the utterance and decode your intention.

A new study gives some insight into what’s happening brain-wise. Usually it’s the left hemisphere that handles language, but apparently when part of the right hemisphere is knocked out, people become unable to pick up on the paralinguistic cues (like intonational contour) that signal an indirect speech act like sarcasm.

Although people with mild Alzheimer’s disease perceived the sarcasm as well as anyone, it went over the heads of many of those with semantic dementia, a progressive brain disease in which people forget words and their meanings.

“You would think that because they lose language, they would pay close attention to the paralinguistic elements of the communication,” Dr. Rankin said.

To her surprise, though, the magnetic resonance scans revealed that the part of the brain lost among those who failed to perceive sarcasm was not in the left hemisphere of the brain, which specializes in language and social interactions, but in a part of the right hemisphere previously identified as important only to detecting contextual background changes in visual tests.

Context, eh? Can’t understand sarcasm without that.

I’m serious.

How to improve T9

I recently won the contest for ‘Last Person on Earth to Get a Mobile Phone’. First prize was a mobile phone. I like it. It was worth outlasting that guy from the Amazon. He got a toaster, and nowhere to plug it in. Ha.

My phone uses T9, the predictive text algorithm. It was invented in the early 90’s, and don’t you think we would have come up with some improvements in language technology since then? But no, we’re still stuck with it, and every day I text Ms Perfect to tell her that I’ll be ‘good room’ instead of ‘home soon’. ‘Good’ and ‘home’ are textonyms, you probably know, both keyed as 4663. I can change from one to the other by hitting zero, but it irritates of.

Irritates ‘me’, sorry.

I’ve seen very little out there on improving the T9 algorithm, so here are my suggestions.

  • At the very least, correct gibberish words. Even a relevant word like ‘texting’ comes out as ‘textiog’ on my Samsung mobile.
  • Auto completion. When I type a long word like ‘predictive’ or ‘abracadabra’, it should have a way to complete the word for me. If there is one, someone let me know.
  • Long-term memory on training. T9 does try to adapt to your usage. I’ve noticed that if I type the same textonym over and over, changing it to another variant each time, it’ll select the variant automatically on the fourth time. But only for that message. Next message you send, it forgets all your training. What is the point?
  • And this is the big one: Word bigram modelling. Many textonyms could be disambiguated simply by looking at one or two previous words. For example, ‘good’ and ‘home’ are both 4663, but the previous words are very often different. If the previous word is ‘coming’, choose ‘home’. If ‘is’, ‘was’, or an adverb like ‘very’, choose ‘good’. It’s very simple to check this. When I compared ‘home’ and ‘good’ in the Brown Corpus, there were no duplicates in the top 100 lists of words previous. Same with ‘am’ and ‘an’, another pair of textonyms. Which tells me that just looking at the previous word would be enough to disambiguate in the majority of cases. And that means we can stop hitting zero so many times.

Accents

Currently on heavy YouTube rotation is aspiring actress Amy Walker, presenting 21 accents in two and a half minutes.

Accents are interesting. It’s easy to draw a lot of inferences about people from their accent, even when you’re trying not to. Even linguists aren’t immune to some strange attitudes. There I was, enjoying the clip. Then she got to Seattle, and just for a moment in spite of myself, I caught myself thinking, “Well, that one was easy. She was just talking like a normal person there.” Which I know is silly, because everyone has an accent.

So, at what point did her accent seem least marked for you?

Oh, and if you want to play with accents, try Sound Comparisons out of the University of Edinburgh and the Speech Accent Archive courtesy of GMU.

Apes and syntax

In an earlier post, I pointed out why linguists reject the notion that apes (or birds, or dolphins, or other non-humans) can use human language. Sure, they can manipulate symbols to get what they want, just like we do. But human language is unlike anything we see in the animal kingdom.

Ask a linguist what’s the difference, and she’ll probably say it’s a matter of syntax. In a human language, the words have to come in a certain order. ‘John hit Bill’ is different from ‘Bill hit John’. Or if I say “The fluffy bunny exploded,” you have an automatic understanding that ‘fluffy’ and ‘bunny’ have a special relation to each other. They’ve grouped into a structural unit. Non-human communication — and even apes who are taught human language — never shows any syntax of this type.

(If you’re new to this area, here’s a really good article about it. All the major players weigh in, and it’s very readable.)

But beyond the ‘language or not’ issue, there’s an even more interesting discussion. Namely, if it’s not language, what is it?

Linguists break into two camps: There are the linguists who say that human language is something qualitatively different from animal communication. This would be Chomsky et al. They’d say there’s a Language Acquisition Device in the human brain (as yet undiscovered) that no other animal has, and though they may be intelligent and communicate, they’ll never ‘graduate’ to real language use. We humans have the principles of syntax — that all human languages follow — hard-wired natively into our human brains.

Then there’s the other team that say human language is just more complex than animal communication. Maybe there’s a continuum where animal communication can be more or less language-y, and all animals fall short of real language behaviour. Maybe if animals were doing something different, they’d slide up closer to language. Maybe syntax is something a very smart animal can do, and if other animals were smarter, they’d do syntax too. Maybe people use syntax to keep everything straight because talking is so demanding. And so on.

This view is interesting because if we suppose there’s a scale of languageness, we can see how far up the scale animals can go. Which takes us to some interesting work from a while back.

Nonhuman primates are unable to grasp a fundamental grammatical component used in all human languages, researchers at Harvard University and the University of St. Andrews in Scotland reported recently in the journal Science. Their work provides the clearest example to date of a cognitive bottleneck during the evolution of human language, suggesting a sharp limit to animals’ capacity to generate open-ended communication and possible restrictions on other domains of thought.

The experiment was this: they played sounds of a man and a woman speaking nonsense syllables to groups of cotton-topped tamarinds. The actual syllables weren’t important; what mattered was the male/female order of the voices.

One group heard patterns of male and female voices that could be generated by a regular grammar, the simplest kind of pattern generator you can have and still call it syntax. This generates very simple sequences like MFMFMF. Once the monkeys got used to the pattern, the experimenters broke the rules by switching up the sequences. Sure enough, the monkeys noticed; they would turn their heads to the loudspeaker as if to say, “What the?”

But other monkeys got patterns generated by a context-free grammar, one step up in complexity. Here the monkeys would hear patterns like MF, or FFMM, or MMMFFF. When these patterns were broken, the monkeys didn’t even notice, which indicates that these grammars were too complex for them.

Most human languages are a step up even from that, following rules allowed by ‘context-sensitive’ grammars. So, conceptually, the syntax of human language is way beyond the capabilities of even these clever types.

Some animal researchers claim that their African Gray Parrots are understanding them and generating real English sentences. I’d love to see what kind of patterns these birds are capable of. Seems this kind of test could help sort out the difference between simple parroting and real language use.

Australians inexplicably irked about ‘beaver ad’

New limits on taboo terms? Not only can you not use bodily euphemisms, some folks get huffy if you even suggest them.

A tampon company may be forced to cancel TV ads that show an attractive, young woman going about her day with a beaver in tow.

Here’s the ad.

Just think: every woman is walking around with one of those things. Interesting. But it got some people feeling all hot and confused.

The Advertising Standards Bureau in Canberra received a “large number” of complaints as soon as the ad aired on Sunday – the day after International Women’s Day.

I wonder how large a number it was. Wouldn’t you think someone that uptight would be too clueless to even get the reference? On second thought, nah. Anti-sex people have minds like sewers, and it scares them.

What might those two guys on the beach be saying, by the way? My guess was too predictable.

Older posts Newer posts

© 2026 Good Reason

Theme by Anders NorenUp ↑