Computers and Prejudice

April 6th, 2010
ism, nlp
Consider the following fake snippet of news reporting:

Sarah Jones reporting for News Station X. The CEO of Big Powerful Company announced today that BPC was on track to meet its earnings targets for the quarter. She went on to attribute the continuing success of BPC to the quality of its employees.

Imagine I were to tell you that I interpreted the "she" as being a reference back to "sarah jones" instead of "the ceo of big powerful company". What if I say that this is because I think of "ceo" as a male-gendered word, so I interpret the "she" as being more likely to refer to "sarah jones". You'd say I was being sexist, no? The assumption that a ceo is male is a sexist assumption. I'd agree with you.

Now imagine I am working on a computer program that is supposed to learn how to connect pronouns (and other words) back to their antecedents. (I am, in fact, working on such a program) Imagine it learns, from lots and lots of data, that ceos are much more likely to be male than female. So it makes the mistake I described above, identifying "she" as the reporter instead of the ceo. Is the program being sexist? Were the programmers sexist in desigining the program with the capacity to learn the gender of words? Is there sexism here, aside from that which results in there being more male ceos than female?

Comment via: facebook

Recent posts on blogs I like:

Facts I Learned From A City On Mars

Space bastardry!

via Thing of Things April 14, 2025

Which Came First, the Chicken or the Egg?

When I thought about this question it was really hard to figure out because the way it's phrased it's essentially either a chicken just pops into existence, or an egg just pops into existence, without any parent animals involved. I thought about t…

via Lily Wise's Blog Posts April 13, 2025

Advice for time management as a manager

have accurate expectations of yourself • prioritize ruthlessly • unemploy your future self • a five-step “help, I’m overwhelmed” checklist • carve out focused time

via benkuhn.net April 1, 2025

more     (via openring)