Zipf’s Law describes one aspect of the statistical distribution in words in language: if you rank words by their frequency in a sufficiently large collection of texts and then plot the frequency against the rank, you get a logarithmic curve (or, if you graph on a log scale, you get a straight line).  In other words, there is a small number of words that occur quite often, and then a very large number of words that occur at the statistical equivalent of zero–but, they do occur.  What this means for the second-language learner is that every single day you will come across words that you don’t know.

I’ve been studying French for the past couple years, and this blog is primarily about new words that I came across in the course of my day–mostly when I’m in France, but also just listening to the radio in the US.  I also occasionally write about Spanish, since I spend one week a year in Guatemala, and I write about medical vocabulary a lot, since I go there with a group of surgeons.  When I started getting quite a few international readers, I began adding links to the definitions of English words, especially obscure and slang ones.  If you’re only interested in French or Spanish, use the tags to find posts on one or the other.  If you’re only interested in vocabulary or grammar, look for the vocabulary tag or the morphosyntax tag–any post that doesn’t have one of those tags is probably all cultural notes.  I’m a computational linguist by training, and every once in a while I write about linguistics and/or about computer programs that process language in some way or another.

Most (although not all) of the definitions that I give are from the WordReference.com web site.  I often put a link to the WordReference page–you can recognize linked words by the fact that they are underlined.

If you’re interested in reading more about Zipf’s Law, click here for the Wikipedia page.  For some popular posts from this blog, check out the Best of Zipf’s Law page.

45 thoughts on “About”

  1. how very peculiar! I did “Math Elem” for my bac years ago (still exist?) but never thought of mixing maths and language. This might fit in with all the Big Data stuff we have today, but I won’t go into that, I’ll just follow this blog for a bit and see what’s interesting 🙂


  2. Pingback: Zipf's Law
  3. I saw on your website an example of a paper that had been “evaluated” by someone: http://www.clipular.com/c/5009767192068096.png?k=IQA_ittEWl6rRwuC3f3J5n2b72Y As a French teacher and a native-speaker, I was astonished to see that the teacher would be correcting uses of words that are absolutely correct: gerunds are used in French and there is nothing wrong with them. Many famous authors used lists like these and there are examples of great literature.


Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s

Curative Power of Medical Data

JCDL 2020 Workshop on Biomedical Natural Language Processing


Criminal Curiosities


Biomedical natural language processing

Mostly Mammoths

but other things that fascinate me, too


Adventures in natural history collections

Our French Oasis


ACL 2017

PC Chairs Blog

Abby Mullen

A site about history and life

EFL Notes

Random commentary on teaching English as a foreign language

Natural Language Processing

Université Paris-Centrale, Spring 2017

Speak Out in Spanish!

living and loving language




Exploring and venting about quantitative issues

%d bloggers like this: