Good Reason

It's okay to be wrong. It's not okay to stay wrong.

Category: computing (page 5 of 6)

Prodigious texters, those kids.

A factoid about youth and texting from this article.

American teenagers sent and received an average of 2,272 text messages per month in the fourth quarter of 2008, according to the Nielsen Company — almost 80 messages a day, more than double the average of a year earlier.

I’m ignoring the hand-wringing tone of the article — kids will be fine, and I’m glad they’re communicating to each other in writing. But is anyone else rather surprised by that rate of messages? I know I’m an old fart, but even when I was a young fart, I never communicated with friends at that rate. That’s a message every twenty minutes, day and night.

Bad Facebook friend

Cheney people are having a non-stop virtual high school reunion on Facebook. I’m communicating with people that I haven’t seen in 25 years. What a great way to bring people back into your life!

One day not long ago, a guy from my old high school sent me a friend request. To preserve his anonymity, I’ll call him ‘Barry’. Barry was never very popular in school, but I couldn’t say exactly why. He was actually a pretty decent guy, as I found out when I talked to him at a chance supermarket meeting a couple of years after graduation. But not the sort of guy you’d hang out with. You’d say hi in the halls. Maybe if you were socially conscious, you’d hope no one was looking, but you’d feel guilty about it. A very high school feeling.

So when I got Barry’s friend request, I felt that old reflex. It was like Barry had come up to me and said, “Hey, guys! What are you doing?” A brief thought: would people see him on my friend list? Then I came to myself and felt ashamed of my reaction. What was this, high school all over again? I had forgotten he existed for the better part of three decades, but he remembered me, and now here he was, asking for me to be… his friend. And all it required of me was to click.

Had I learned nothing about common human decency in the last 25 years? We weren’t kids anymore — especially not Barry, by the looks of his Facebook photo. But what did that matter? I’m past all that stuff. Yes, Barry, yes! I will be your friend! And just that simply, the pettiness of adolescence was erased in one virtuous act. If Barry had only one friend, it would be me. Even if I was just a Facebook friend.

The next day, Facebook sent me an email. I had a pillow fight request. Barry had somehow hit me with the Eiffel Tower. I ignore these requests from everyone, and so I ignored it from Barry. Over the next two days, Barry hit me with four more objects, and invited me to play backgammon and Scrabulous. I was busy. I changed my email notification preferences.

I don’t hit the ‘Book often, so the next time I logged on, I found that I had been kidnapped twelve times, each time by Barry.

I tried not to feel conflicted as I clicked the ‘Ignore all requests from this friend’ button. Stupid Facebook, bringing people back into my life. Why couldn’t they stay in the past where they belonged? It was high school, all over again.

Evolutionary car

Here’s something fun to leave on your screen for a while. It’s an evolutionary car.


Ouches in 3..2..

The problem here is: what’s the the best wheel (and load) size and position to get a little car across a rugged landscape? Genetic algorithms are good at these kinds of problems. Just release a population of slightly differing individuals into the wild, and let the best performers produce offspring that are sort of like they are. Nifty.

Sine-wave speech

This is very cool. Listen to the first clip. Sounds like twitters and blips.

Then listen to the second clip.

Now go back and listen to the first. It becomes comprehensible once you know what you’re looking for.

This is called ‘sine-wave speech’. When a linguist records your speech using a spectrometer, there are dark patches of high intensity, called formants. Draw the formants using sine waves, and you get the twittery sound that resembles somewhat-but-not-quite speech.

I guess this is yet another example of how perception depends on the knowledge and expectations of the perceiver. Something to remember when I try and understand the voting habits of others.

In what respect, computer?

Someone’s invented the Palin interview generator, so I don’t have to.

It works by Markov chaining, which I’ve had some fun with before (see here and here).

Basically, you pick — let’s say — two words from a corpus (in this case, Palin’s interviews), and use them to choose a likely third word (also from your corpus). Then pick a fourth word using words two and three. Word by word, you assemble whole sentences. Which I think might be how Palin constructs sentences herself, so bonus points for psychological reality.

Here’s a snippet.

Q: What is the role of the US in Iraq and Afghanistan?

Afghanistan will lead to war and it doesn’t have to lead, as I said, to a position like we are at a point, here, seven years later, on the side of the United States. I want you to not lose sight of the earth. That’s not a part of, I guess, that culture. The way that I have understood the world is through education, through books, through mediums that have provided me a lot of perspective on the side of the message that Americans are getting sick and tired of politics as usual, that embracing of the status quo, the politics as usual, that embracing of the status quo, going with the Bush administration that we have to stand for that.

Cut to shot of Katie Couric, brow furrowed.

Is your writing male or female?

There are lots of things in CompLing that I wish I’d thought of. Here’s one: the Gender Genie.

Just paste in some text (perhaps even a blog post from your humble Good Reason host), hit frappé, and it’ll give a guess as to the sex of the author.

I come up as male, as does PZ Myers, and (somewhat interestingly) Andrew Sullivan and TRex from Firedoglake. (Well, I had to check.) The wonderful Digby, had she not outed herself as female, would have been sniffed out by the Genie. Ann from feministing.com comes up male more than once. Walcott‘s running 50-50.

It works much differently than one might expect. It ignores all the manly or non-manly words like ‘explosion’ or ‘needlepoint’, and instead uses a very small set of function words. Each time you use ‘with’, that’s 52 points on the female side. Each use of ‘around’ gives you 42 male points. Women get pronouns: she, me, hers, we. Men get all the determiners: the, a, some, more. Add them up and see which side of the scale is heavier.

I don’t know what we can generalise from this about male/female communication, but it’s very cool.

The 100 most common words in English

Another language game: How many of the 100 most common English words can you name in five minutes?

Of course, the actual words in the list will depend largely on the corpus they’re using. Still, it’s an interesting challenge. As a computational linguist, I must have looked at loads of lists like this, but I only got 50 of them. That was more a function of how fast I could type. You have to bang out one every 3 seconds with no mistakes to get them all.

Hmm. How can I get a hold of their language data? I’d like to know what people thought were the most common words.

Knol debuts

I’m checking out Knol, a new project from Google. It’s a knowledge base that’s sort of a cross between Wikipedia and Amazon ratings. Here the focus is on authorship. Authors write their own Knols, instead of contributing to a group’s effort like some amorphous blob author. That means experts in a field can represent.

I can see how this would avoid some Wikipedia problems.

  • Wiki-vandalism wouldn’t be a problem since you have control over your own Knols.
  • No more pointless and frustrating edit wars on intractable topics.
  • You don’t have to pretend to be neutral.
  • Original primary research would be allowed, unlike on Wikipedia.
  • It also avoids classification problems. Much discussion on Wikipedia concerns whether this article should be merged into that article. In Knol, the user does the classifying with only the search terms. A Knol could pop up under many different searches instead of just appearing in one article.

But is having so many separate authors a good way to arrange the world’s knowledge? One thing I’m finding in my language research is that individual points of view are terrific, if they can be aggregated into some kind of group opinion. Wikipedia does this by forcing people to hash out the issues and decide what content will appear on the page. Knol takes a rating approach, where individual votes from readers will (presumably) cause good articles to float to the top of the search results page. So it’d be like a bit like Wikipedia, except that there would be maybe 30 articles on generative grammar instead of one, and while each individual article might not be as strong as a good solid Wikipedia article, the best would come pretty close.

Here’s an example: I tried looking up ‘atheism’, and got one result. (Yeah, it’s early days.) It was a thoroughly useless page by “United Church of God” about how great God is, blah blah blah. Basically it was an old school web page that someone ported to Knol. But it’s getting downweighted mightily, and as more articles appear, it will probably sink out of sight.

The more I think about this, the more it grows on me. The next task will be some kind of true aggregation, where instead of reading only the best article about (say) the existence of UFO’s, you take the 100 top-rated articles on the topic, and automatically generate a group opinion. Sounds like a job for someone who’s into automatic summarisation and paraphrasing.

I bags it.

Where’s Daniel? And what’s that hammering noise?

Been working on a conference paper.

You know what the worst thing is about doing a paper? No, not getting it rejected. Okay, the second worst thing. Not being able to find any research similar to yours.

Actually, it could be a great thing. You could be the genius who has figured out something new that no one’s ever thought of. On the other hand, you could be doing something worthless that no one else wants to do. Or — more terrifyingly — you’re just a sucky researcher who can’t do a literature review, and it’s already been done, and everyone knows it. Except you, you lazy person. Frightening, isn’t it?

So when I get a good result, I always feel elated, but I brace myself. Writing this paper has been a bit like that.

But it’s in the can now. I’ve sent it off to EMNLP, a Very Big and Important Conference. And if you want to see the results of the study, I’ve made a presentation that you can view. You can read a PDF, or you can have a cute Flash animation, if you’d rather.

I got your community standards right here!

I tell ya, that Google’s useful for all kinds of things.

The operator of a porn web site has been brought to trial for violating ‘community standards’. But who knows what ‘community standards’ are? Well, his lawyer has an interesting answer: check out Google Trends and see what the community’s really up to!

In Florida, it turns out that the search term ‘orgy’ is as American as ‘apple pie’.

Except that ‘apple pie’ hits a spike around Thanksgiving, but ‘orgy’ is popular year-round.

But Perth? Looks like we’re just into surfing. New South Wales is a different story. It’s just about Orgy Season over there. Meanwhile, in Victoria, the worrying fisting trend continues unabated.

Older posts Newer posts

© 2024 Good Reason

Theme by Anders NorenUp ↑