03 March 2011

OK!

I had things come up today that took away my time i was going to use to write part 2, so to hold the fort until tomorrow when i (probably) will finish up, here is a short article on the word OK that is intriguing. Enjoy!

bbc.co.uk/ok_article

25 February 2011

Computers and Language: Part 1

Last week, peppered amidst the news of uprising in the Middle East and Northern Africa are reports of a more more sinister nature. Well, according to the conspiracy theorists among us. For the rest of us, the computer Watson and his romp on Jeopardy was simple, light fun and a small precursor to what the future may hold for us.

This video is of the first half of the first episode airing the matchup of Watson against the two best contestants humanity could offer, Ken Jennings and Brad Rutter. Feel free to watch the whole nine minute ordeal, but the section that interests me is at the beginning when IBM has a small documercial (documentary commercial, a word i made up for this sentence) about Watson and the team who designed him.

With Watson and IBM so prominent in the news, articles are cropping up left and right concerning machine intelligence and the day, looming ever closer, we can refer to a machine as a thinking entity.

So far, we are safe from a machine uprising. In the first of two games, Watson went into Final Jeopardy with a commanding lead over the two human opponents. Under the category of ‘U.S. Cities,’ the clue given somehow managed to trip up Watson and leave even non-Jeopardy players scratching their heads.

‘Its largest airport is named for a World War II hero; its second largest for a World War II battle.’

The correct answer: Chicago.

Watson’s answer: Toronto.

Obviously, and most North Americans would know this, Toronto is not a U.S. city, residing as it does in the grand country known as Canada.

Jennings and Rutter both answered correctly, so why did Watson get the answer so very wrong? Steve Hamm at IBM, through their Smarter Planet blog through their had an answer for us as to why Watson was so very wrong:

First, the category names on Jeopardy! are tricky. The answers often do not exactly fit the category. Watson, in his training phase,  learned that categories only weakly suggest the kind of answer that is expected, and, therefore, the machine downgrades their significance.  The way the language was parsed provided an advantage for the humans and a disadvantage for Watson, as well. “What US city” wasn’t in the question. If it had been, Watson would have given US cities much more weight as it searched for the answer. Adding to the confusion for Watson, there are cities named Toronto in the United States and the Toronto in Canada has an American League baseball team. It probably picked up those facts from the written material it has digested. Also, the machine didn’t find much evidence to connect either city’s airport to World War II. (Chicago was a very close second on Watson’s list of possible answers.) So this is just one of those situations that’s a snap for a reasonably knowledgeable human but a true brain teaser for the machine.


The problem Watson ran into what partly due to its programing and training--not putting as much weight on the category as was necessary--and partly due to the wording of the question.

Jeopardy, as difficult as it can be due to the puns and wordplay often involved in the clues, is a static medium. There is always a category, always a clue, and always an answer question that answers the clue and fits within the category. It’s not the fluid, dynamic web of language we call a conversation.

As Dr. Katharine Frase states in the documercial, normal humans communicate more in a style she calls ‘open-question answering.’ There is a small bit of chaos in our interactions with one another.

This idea of chaotic conversation leads to more roads, and another blog for next week.

For the record, the final totals after the two games left Watson and IBM with $77,147, Jennings with $24,000 and Rutter at the bottom with $21,600. IBM says it will donate the money to charity.

10 February 2011

A Rape by Any Other Name

Last week, the Republicans in the House of Representatives removed a provision from at Act on the floor that would have been detrimental to women's rights and sent the wrong message to the public. Not to mention the damage it would do to language.

The 'No Taxpayer Funding for an Abortion Act' originated with Chris Smith, a Republican Representative from New Jersey. The purpose is pretty obvious—prevent the use of government money earmarked for health care and medicare being used to pay for abortions. The obvious goal behind this is to chip away at the current stance the government takes on abortion, with the ultimate goal to make it illegal. The stated impetus is to save government money in a time when our deficit is measured in numbers generally reserved for grade school children attempting hyperbole.

I won't get into the complicated and muddied topic of abortion; that's not in the scope of this blog, nor is it a topic i have considerable knowledge it. Suffice it to say, i know it's not always as black and white as each side paints it, so let's leave it there.

Instead, i want to talk about one word used in the No Taxpayer Funding for an Abortion Act, from this point referred to as House Resolution 3, or HR3. So much more concise. In the resolution, one passage stands out above the rest. It states the Act 'shall not apply to an abortion' if, for instance, 'the pregnancy occurred because the pregnant female was the subject of an act of forcible rape or, if a minor, an act of incest.' The document has since been changed to remove the word 'forcible,' but only after websites such as Mother Jones and MoveOn.org brought attention to the matter.

If 'forcible' were included in the bill, it would leave certain women up a creek without an option, for example, a woman raped while drugged, or a mentally challenged woman taken advantage of, or even certain instances of date rape.

I read a passage in President Obama's book The Audacity of Hope that rang true for this circumstance.
'Much of the time, the law is settled and plain. But life turns up new problems, and lawyers, officials, and citizens debate the meaning of terms that seemed clear years or even months before. For in the end laws are just words on a page—words that are sometimes malleable, opaque, as dependent on context and trust as they are in a story or poem or promise to someone, words whose meanings are subject to erosion, sometimes collapsing in the blink of an eye.'


What Obama refers to is the trouble we often have with interpretation of laws, often even the Constitution. We like to think of our laws, and especially our Constitution, as being immutable and firm. Firm they may be, but never immutable. The three branches of government are constantly re-examining documents and laws to further refine them.

The problem with HR3 is how it attempted to shortcut the process by refining the definition of 'rape.' Whomever was inspired to add the word 'forcible' probably thought they were being smart, removing the case for statutory rape, but being compassionate and allowing for abortions for those women who were physically beaten into submission.

Whether HR3 passes and becomes law is still to be seen, but if it does, it will allow for the original, broader and more compassionate, coverage.

03 February 2011

The Pyramids Are Revolting

In response to the recent unrest in Egypt, the government in China has been closely monitoring the internet, specifically preventing netizens from searching for the term 'Egypt' on social networking sites, as well as keeping the news coverage in official channels to the bare minimum.

Censorship in China is no new practice, even when it comes the internet. Since 2003, they have operated what is known as the Great Firewall of China to be able to prevent their citizens from gaining access to certain sites, IP addresses, and keyword searches.

When it comes to technology, especially when using it to prevent people from doing something, there are always work-arounds. For the Great Firewall, proxy servers outside of China, virtual private networks and various free programs allow, to varying extents, access to information and websites not allowed by the Great Firewall.

As for the keyword searches, there is an even easier fix: different keyword. In order to better monitor their citizens, China has their own version of Twitter called Weibo. It is one of the sites which prevents users from searching for the 'Egypt.' As any high school student in the United States can tell you, when one word is disallowed, another more innocuous word takes its place. Where Chinese citizens cannot search for 'Egypt,' they might very well be looking around for posts and articles about 'pyramids,' 'Nile,' 'Cairo,' or countless other new keywords.

Short of killing the entire country's internet, dissident information will always leak to the people looking for it. Even without internet, people are able to coordinate well enough to disseminate information, and this Wired article tells you how.

That's the beauty of language for me; like Malcolm tells us in Jurassic Park, '[language] finds a way.' As literacy grows, so does knowledge. As knowledge grows, so do the possibilities. Language and literacy are doors that open to wider worlds, not smaller. I guess that's what makes it so scary to the Chinese government.

27 January 2011

An Overnumerousness of Letters

Last week, NPR's Robert Krulwich threw down a gauntlet. He didn't know he had, but he did. He wrote an article concerning the longest word in the English language. He lists a few candidates, then goes down the list, taking them down one by one.

First there was honorificabilitudinitatibus. This 27-letter monstrosity was coined by Shakespeare for Love's Labour's Lost and means 'loaded with honors.' Krulwich disqualifies it because it was likely crafted for the sole purpose of being lengthy.

Then there is the long favorite among schoolmates, mostly because it's both fun to say, intelligent sounding and semi-comprehendible. Antidisestablishmentarianism is also disqualified as being only 'attention-grabbing hummock.' By the way, this collection of 28 letters describes the political position that opposed the disestablishment of the Church of England in the 19th century.

The word Krulwich eventually settles on is one we all know from our classic Disney films, supercalifragilisticexpialidocious. Invented without meaning and sung by Julie Andrews and Dick Van Dyke in Mary Poppins, this word, at 34 letters long, wins the prize for longest non-technical word, at least according to our friend at NPR.

(There are a slew of other words, mostly within the scientific community, that are far longer and even less interesting, but since they are technical, and mostly consist of stacking prefixes on top of one another to describe molecular bonds, they don't qualify, especially since they are very rarely spelled as words. The longest molecule wouldn't qualify, having published in the much-appreciated shorthand, but never written out as a word. The one that slips through is 1,185 letters long and is a tobacco protein. Yet, being a technical word, it doesn't qualify, rightfully so.)

William Lee Adams at Time NewsFeed picked up Krulwich's gauntlet and wrote his own article about long words. WIthout the technical muscle of Krulwich, he does little more than stoke the flames. Where the NPR article dismissed one word, the NewsFeed article quotes author Sam Kean's opinion of the matter with as near a final word on the matter as it gets. Kean's view is that the slang word pneumonoultramicroscopicsilicovolcanokoniosis, at 45 letters long, is worthy of the title. It refers to a disease you acquire when breathing in silicon dioxide.

The NewsFeed article then lists a few slightly-off topic locals with longer-than necessary names. One is in Wales, another in New Zealand, and the longest is the ceremonial name for Bangkok. Since they are all in Welsh, Maori, and Thai, respectively, none of them qualify as the longest word in English, which is what we actually care about.

Jason Kottke briefly steps into the mix (he's always brief, so it's allowed) with his choice in the matter: twoallbeefpattiesspecialsaucelettuce-cheesepicklesonionsonasesameseedbun. It's a definitive non-word, but still fun to say: two-all-beef-patties-special-sauce-lettuce-cheese-pickles-onions-on-a-sesame-seed-bun. Kottke also links to the Wikipedia article about the longest words, which lists one none of the previous articles did.

At 35 letters, it beats out Mary Poppins by one letter, and unlike silicon dioxide it's not technical. Hippopotomonstrosesquipedaliophobia means the fear of long words. You could argue that it's redundant, since sesquipedaliophobia means the same thing, AND is used professionally. However, what we call a jaw, doctors refer to it as a mandible; both are still correct.

My vote is cast. I say hippopotomonstrosesquipedaliophobia is the longest word in the English language. Maybe it's the humor in it all. Much like the humor in this clip from The IT Crowd.





(For the record, Negative One's word is real; Moss' word is not. I fact-checked on the Oxford English Dictionary online, which has a promotional trial through February 5. Use 'trynewoed' as both username and password for access.)

20 January 2011

Vocabulary

I'm the first to admit my vocabulary has not grown in the last few years. I have not worked on growing it. As someone who goes on about how much he enjoys words, how important words are to our lives, i stink at learning new words to enhance my worldview. Yes, enhance my worldview. Learning a new language expands your cultural worldview. On a smaller scale, learning new words in your native language expands ways to see your own culture.

For a bit of fun, here are two videos of Conan.

Conan's Campaign to Bring Back Thrice




Thrice Returns Once More


Conan has the right idea. It seems every week there's a new word crafted that lasts about as long as a disposable cup (and i'm not talking about the styrofoam kind). For example, 'hevage' is a word created to describe male cleavage; 'facepalm' is the act of striking your face with your palm due to another stupidity; and a 'dork knob' is a short ponytail. Do we need these words? Not really.

So how does one go about learning new words? Word-a-Day Calendars?

If you'd like, yes, but a more standard answer is reading slightly above your level. Another answer is adopting a word. SaveTheWord.org lists thousands of little-used English words one can 'adopt' by pledging to use said word in everyday conversation as often as possible. I am adopting labascate, which means to begin to fall or slip. With how i walk, this will get used, i'm certain.

If you like a bit of fun to learning, try out FreeRice.com, a vocab game that donates 10 grains of rice for every correct answer. It starts out easy, then gets harder the more you succeed, until it hits upon your reading level, which is where you learn many fabulous words, like quadrennium, hydrophyte, and numismatics. I made it to level 40 without missing a word. That's right, i'm throwing down the gauntlet, to see who picks it up.

I hope i only labascate, as i invariably will, and don't succumb to the whims of gravity and introduce my face to the pavement after such an invitation for word sport. (See what a little bit of learning does?)

14 January 2011

Metaphors, Superlatives and the Political Discourse

The attack on Representative Gabrielle Giffords last week was an eye-opener and a forceful shock to the entire country. One of the debates that has begun because of the attack is one i wished had started without such a high cost. As a country, we have now begun to focus on the harsh rhetoric of partisan politics. Much of the blame has been unfairly pinned on Sarah Palin. I’ve made it a known how much of a fan of Palin i truly am (not). Yet even if i don’t agree with her politics, or her use of language, i don’t think it fair to paint her as the source of the attack, however indirect.

Yes, Palin marked a map with crosshairs over the representative seats she and other Republicans wanted to take back in the midterm elections. She also used the phrase ‘Don’t retreat . . . reload’ to describe her strategy after a loss. Couple those two together, and after Republicans lost to Giffords in the midterms in a fierce Republican district, it could be interpreted as a call to arms: if the recipient of the call had no concept of illustrative language.

We use it all the time, mostly in the form of similes, metaphors and superlatives.

A simile uses ‘like’ or ‘as’ to compare two things. (The cat is acting like it doesn’t love us.)

A metaphor uses non-literal language to describe something. (You scratch my back, i’ll scratch yours.)

Superlatives are the over-the-top language we use all too often in every day life. (That splinter was the worst pain i’ve ever been in.)

When Palin talks of reloading, and puts crosshairs on a map of United States Representatives, she is not being literal, she is using both metaphorically. Not only that, but she’s not the only political figure using language in this way. The political dialogue for as long as i can remember has been characterized by superlatives and metaphors, especially concerning war imagery. Granted, i’m only 27 and didn’t pay attention for many of those years.

Not to let Pain off the hook. (See, another metaphor; they fill our language like the air we breathe.) During a video response she filmed in response to the accusations thrown at her, she used the phrase ‘blood libel:’

Acts of monstrous criminality stand on their own. Especially within hours of a tragedy unfolding, journalists and pundits should not manufacture a blood libel that serves only to incite the very hatred and violence that they purport to condemn. That is reprehensible.

Being nearly eight minutes long and with the reflection of the monitor in her glasses, we know the speech was written in advance and not an off-the-cuff response, which makes her use of the term ‘blood libel’ even more astonishing. Historically, it is a reprehensible and false accusation against Jews of killing children and using their blood in religious rites. To use the term this way, especially when Gifford is herself Jewish, is not only careless, but irresponsible.

It presents a further illustration of why the language surrounding and used in our national political dialogue needs to be carefully considered and intentionally weighed for merit and value. You can try blaming the shrinking sound bite, but at least some are saying they are a symptom and not the cause of our increasingly degraded political conversation.

What we can do is applaud the politicians who are doing their best to change not only the language they use, but the environment surrounding the political discourse. A few politicians are calling for mixed seating during the State of the Union address later this month. Historically, the House and Senate have been seated according to the party line, a visible split in how our government runs. This intermingling shows promise to begin a transformation in dialogue we so desperately need.