Yet again, Wikipedia is about to break new ground. The Web site that has become one of the biggest open repositories of knowledge is due — within the next week or so — to hit the mark of 3 million articles in English.
It’s all a very long way from January 2001, when Wikipedia launched. Its first million articles took five years to put together, but the second was achieved just 12 months later. It was not just the number of articles that grew, but also the number of people involved in creating them. During Wikipedia’s first burst of activity between 2004 and 2007, the number of active users on the site rocketed from just a few thousand to more than 300,000.
However, statistics released by the site’s analytics team suggest Wikipedia’s explosive growth is all but finished. The quickening pace that helped the site reach the 2 million article milestone suddenly evaporated: adding the next million has taken nearly two years. While the encyclopedia is still growing, the number of articles being added has reduced from an average of 2,200 a day in July 2007 to around 1,300 a day.
Elsewhere, the number of active Wikipedians (those contributing to the site in some way) now comes in at just under 500,000. That is a 61 percent increase in the past two years; hardly shabby, but nowhere near the increases seen in the past. At the same time, however, the base of highly active editors (who contribute new words to the project and marshall the billions of pieces of information the site contains) has remained more or less static.
It looks as though Wikipedia is stagnating. Why?
One of those who has spent his time studying what happens on Wikipedia is Ed Chi, a scientist at the Palo Alto Research Center (PARC) in California. His team, the Augmented Social Cognition group, wanted to understand what was happening on the site in order to build better collaborative software.
“For a long time, the understood model for all kinds of large knowledge systems on the Web was that they grow exponentially,” he said. “The accepted explanation was that the rich get richer — things that receive a lot of attention end up getting a lot more attention.”
Wikipedia fitted that model perfectly in its early days. However, when Chi and his colleagues looked at recent data, they realized this approach did not fit any more. But with a site as complex and sprawling as Wikipedia, simply crunching the numbers proved a major task in itself.
First they spent a significant amount of time downloading a carbon copy of Wikipedia: every article, every edit and every piece of information ever to cross the site’s servers. Even when compressed, the files stretched to an enormous 8 terabytes — the equivalent of more than 1,200 DVDs stuffed with information. Decompressing in preparation for analysis took almost a week, and when the group fed the data into their 60-machine computing cluster, they got some surprising results.
Chi’s team discovered that the way the site operated had changed significantly from the early days, when it ran an open-door policy that allowed in anyone with the time and energy to dedicate to the project. Today, they discovered, a stable group of high-level editors has become increasingly responsible for controlling the encyclopedia, while casual contributors and editors are falling away. Wikipedia — often touted as the bastion of open knowledge online — has become, in Chi’s words, “a more exclusive place.”
One of the measures the PARC team looked at was how often a user’s edit succeeds in sticking.
“We found that if you were an elite editor, the chance of your edit being reverted was something in the order of 1 percent — and that’s been very consistent over time from around 2003 or 2004,” he said.
Meanwhile, for those who did not invest vast amounts of time in editing, the experience was very different.
“For editors that make between two and nine edits a month, the percentage of their edits being reverted had gone from 5 percent in 2004 all the way up to about 15 percent by October 2008. And the ‘onesies’ — people who only make one edit a month — their edits are now being reverted at a 25 percent rate,” Chi said.
In other words, a change by a casual editor is more likely than ever to be overturned, while changes by the elite are rarely questioned.
“To power users it feels like Wikipedia operates in the way it always has — but for the newcomers or the occasional users, they feel like the resistance in the community has definitely changed,” he said.
While Chi said this does not necessarily imply causation, he suggests it is concrete evidence to back up what many people have been saying: that it is increasingly difficult to enjoy contributing to Wikipedia unless you are part of the site’s inner core of editors.
One person who typifies that feeling is Aaron Swartz, a 22-year-old programmer who lives in Cambridge, Massachusetts. Something of a wonder kid in the software development world, Swartz used to spend a lot of time working on Wikipedia — in 2006 he even stood for election to the Wikimedia Foundation, the organization behind the site. His bid failed. These days, he rarely checks in.
“I used to be one of the top editors ... now I contribute things here and there where I see something wrong.”
The reason, he explains, is that the site feels more insular and exclusive than in the past.
“In general, the biggest problem I have with the editors is their attitude,” he said. “They say: ‘We’re not going to explain how we make decisions, we basically talk amongst ourselves.’”
“There’s no place on Wikipedia that says: ‘Want to become a Wikipedia editor? Here’s how you do it.’ Instead, you basically have to really become part of that community and pick it up through osmosis and have the tradition passed down to you.”
Swartz’s experience correlates with the figures unearthed by PARC, even if his attitude is not shared by everyone.
Given the history of the online world — where escalating growth can continue for years — it seems unlikely that this gradual slowdown was inevitable. Instead, it could be the end result of a battle between two competing factions of Wikipedia editors.
On one side stand the deletionists, whose motto is “Wikipedia is not a junkyard”; on the other, the inclusionists, who argue that “Wikipedia is not paper.”
Deletionists argue for a tightly controlled and well-written encyclopedia that provides valuable information on topics of widespread interest. Why should editors waste time on articles about fly-by-night celebrities or willfully obscure topics? Inclusionists, on the other hand, believe that the more articles the site has, the better: if they are poorly referenced or badly written, they can be improved — and any article is better than nothing. After all, they say, there is no limit to the size of the site, and no limit to the information that people may want.
The two groups had been vying for control from early on in the site’s life, but the numbers suggest that the deletionists may have won. The increasing difficulty of making a successful edit; the exclusion of casual users; slower growth — all are hallmarks of the deletionist approach.
Swartz, an avowed inclusionist, says the deletionists have won — but says he understands their motivation.
“When Wikipedia is in the news, it’s always because someone found this inaccuracy, or somebody’s suing Wikipedia ... It’s always about how Wikipedia screwed up. So of course what they’re going to be worried about is not how to make Wikipedia grow and have more content, it’s about how we keep Wikipedia out of trouble and how we stop people from messing it up.”
Still, there remain unanswered questions. Could its growth ever halt completely? How big will the site be when the editors decide that the sum of human knowledge is catalogued? Could a new site take Wikipedia’s place by toeing an inclusionist line?
PARC’s research doesn’t give any answers, but Chi has identified one model that Wikipedia’s growth pattern matches.
“In my experience, the only thing we’ve seen these growth patterns [in] before is in population growth studies — where there’s some sort of resource constraint that results in this model.”
The site, he suggests, is becoming like a community where resources have started to run out.
“As you run out of food, people start competing for that food, and that results in a slowdown in population growth and means that the stronger, more well-adapted part of the population starts to have more power,” he said.
The White House’s decision to take a 9.9 percent stake in Intel Corp is looking like very shrewd business indeed. Since the government bought in at US$20.47 a share last August, the US chipmaker’s surging stock price has delivered the US a US$43 billion return. One of the reasons the investment has so far proved so sound is that the White House has made sure of it. According to The Wall Street Journal, Howard personally pushed deals on Intel’s behalf with some of the most lucrative clients imaginable. They include Nvidia Corp, the company at the heart of the AI
A single photograph can cut through a lot of noise, but it can also be used to misrepresent the truth. At the very least, it can concentrate the mind on something that requires further investigation. On Monday last week, Ma Ying-jeou Foundation CEO Tai Hsia-ling (戴遐齡) and former National Security Council secretary-general King Pu-tsung (金溥聰) held a news conference in which they showed a photograph of former foundation CEO Hsiao Hsu-tsen (蕭旭岑), now Chinese Nationalist Party (KMT) deputy chairman. In the image Hsiao is seated next to Xiamen Taiwan Businessmen Association chairman Han Ying-huan (韓螢煥). The two men were holding
I first met Professor Ray Jiing (井迎瑞) as a film and documentary student at Shih Hsin University’s (SHU) Department of Radio Television and Film in 1988. The following year, he went on to become the director of the Chinese Taipei Film Archive — forerunner of the Taiwan Film and Audiovisual Institute (TFAI). Over his eight-year tenure, Jiing rescued and restored over 200 classic Taiwanese films. In 1997, he established the Graduate Institute of Studies in Documentary and Film Archiving at Tainan National University of the Arts (TNNUA), and I joined the program in his third cohort of students. Beyond a
A recent report concerning a student who is suing his teacher posed the question in its headline: Does failing a student in two subjects constitute bullying? The college student in Chiayi County apparently sought NT$2 million (US$63,603) in state compensation, but a court dismissed the case. The first reaction of many might have been to ask: What has happened to students nowadays? Some say that teachers have lost their authority, while others say students are overindulged. Some even start reminiscing over the days when “whatever the teacher says goes.” However, the real issue might be overlooked if emotional reactions like that are the