Category Archives: Open Science

The line between science and journalism is getting blurry….again


Human #1: “Hello, nice weather today, isn’t it?”

Human #2: “Ummm…actually not. It’s a gray, cold, windy, rainy kind of day!”

Many a joke depends on confusion about the meaning of language, as in the example above. But understanding the sources of such confusion is important in realms other than stand-up comedy, including in the attempts to convey facts about the world to one’s target audience.

In the example above, Human #1 is using Phatic language, sometimes referred to as ‘small talk‘ and usually exemplified, at least in the British Isles, with the talk about the highly unpredictable weather. (image: by striatic on Flickr)

Phatic language

Phatic discourse is just one of several functions of language. Its role is not to impart any factual information, but to establish a relationship between the people. It conveys things like emotional state, relative social status, alliance, intentions and limits to further conversation (i.e., where the speaker “draws the line”).

If a stranger rides into a small town, a carefully chosen yet meaningless phrase establishes a state of mind that goes something like this: “I come in peace, mean no harm, I hope you accept me in the same way”. The response of the local conveys how the town looks at strangers riding in, for example: “You are welcome…for a little while – we’ll feed you and put you up for the night, but then we hope you leave”. (image: Clint Eastwood in ‘Fistful of Dollars’ from Squidoo)

An important component of phatic discourse is non-verbal communication, as the tone, volume and pitch of the voice, facial expression and body posture modify the language itself and confirm the emotional and intentional state of the speaker.

It does not seem that linguistics has an official term for the opposite – the language that conveys only pure facts – but the term usually seen in such discussions (including the domain of politics and campaigning) is “Conceptual language” so this is what I will use here. Conceptual language is what Human #2 in the joke above was assuming and using – just the facts, ma’am.

Rise of the earliest science and journalism

For the sake of this article, I will use two simplified definitions of science and journalism.

Journalism is communication of ‘what’s new’. A journalist is anyone who can say “I’m there, you’re not, let me tell you about it.”

Science is communication of ‘how the world works’. A scientist is anyone who can say “I understand something about the world, you don’t, let me explain it to you”.

Neither definition necessitates that what they say is True, just what they know to the best of their ability and understanding.

Note that I wrote “science is communication”. Yes, science is the process of discovery of facts about the way the world works, but the communication of that discovery is the essential last step of the scientific process, and the discoverer is likely to be the person who understands the discovery the best and is thus likely to be the person with the greatest expertise and authority (and hopefully ability) to do the explaining.

For the greatest part of human history, none of those distinctions made any sense. Most of communication contained information about what is new, some information about the way the world works, and a phatic component. Knowing how the world works, knowing what is happening in that world right now, and knowing if you should trust the messenger, were all important for survival.

For the most part, the information was local, and the messengers were local. A sentry runs back into the village alerting that a neighboring tribe, painted with war-paints, is approaching. Is that person a member of your tribe, or a stranger, or the well-known Boy Who Cried Wolf? What do you know about the meaning of war-paint? What do you know about the neighboring tribe? Does all this information fit with your understanding of the world? Is information coming from this person to be taken seriously? How are village elders responding to the news? Is this piece of news something that can aid in your personal survival?

For the longest time, information was exchanged between people who knew each other to some degree – family, neighbors, friends, business-partners. Like in a fishing village, the news about the state of fishing stocks coming from the ships at sea is important information exchanged at the local tavern. But is that fish-catch information ‘journalism’ (what’s new) or ‘science’ (how the world works)? It’s a little bit of both. And you learn which sailors to trust by observing who is trusted by the locals you have already learned to trust. Trust is transitive.

Someone in the “in-group” is trusted more than a stranger – kids learned from parents, the community elders had the authority: the trust was earned through a combination of who you are, how old you are, and how trustworthy you tended to be in the past. New messengers are harder to pin down on all those criteria, so their information is taken with a degree of skepticism. The art of critical thinking (again, not necessarily meaning that you will always pick the Truth) is an ancient one, as it was essential for day-to-day survival. You trust your parents (or priests or teachers) almost uncritically, but you put up your BS filters when hearing a stranger.

Emergence of science and of journalism

The invention of the printing press precipitated the development of both journalism and science. But that took a very long time – almost two centuries (image: 1851, printing press that produced early issues of Scientific American). After Gutenberg printed the Bible, most of what people printed were political pamphlets, church fliers and what for that time and sensibilities went for porn.

London Gazette of 1666 is thought to be the first newspaper in the modern sense of the word. (image: from DavidCo) Until then, newspapers were mostly irregular printings by individuals, combining news, opinion, fiction and entertainment. After this, newspapers gradually became regular (daily, weekly, monthly) collections of writings by numerous people writing in the same issue.

The first English scientific journal was published a year before – the Philosophical Transactions of the Royal Society of London in 1665 (image: Royal Society of London).

Until then, science was communicated by letters – those letters were often read at the meetings of scientists. Those meetings got formalized into scientific societies and the letters read at such meetings started getting printed. The first scientific journals were collections of such letters, which explains why so many journals have the words “Letters”, “Annals” or “Proceedings” in their titles.

Also, before as well as for a quite a long time after the inception of first journals, much of science was communicated via books – a naturalist would spend many years collecting data and ideas before putting it all in long-form, leather-bound form. Those books were then discussed at meetings of other naturalists who would often respond by writing books of their own. Scientists at the time did not think that Darwin’s twenty-year wait to publish The Origin was notable (William Kimler, personal communication) – that was the normal timeline for research and publishing at the time, unusual only to us from a modern perspective of 5-year NIH grants and the ‘publish or perish’ culture.

As previously oral communication gradually moved to print over the centuries, both journalistic and scientific communication occured in formats – printed with ink on paper – very similar to blogging (that link leads to the post that served as a seed from which this article grew). If born today, many of the old writers, like Montaigne, would be Natural Born Bloggers (‘NBBs’ – term coined by protoblogger Dave Winer). A lot of ship captains’ logs were essentially tweets with geolocation tags.

People who wanted to inform other people printed fliers and pamphlets and books. Personal letters and diaries were meant to be public: they were as widely shared as was possible, they were publicly read, saved, then eventually collected and published in book-form (at least posthumously). Just like blogs, tweets and Facebook updates today….

The 18th century ‘Republic of Letters’ (see the amazing visualization of their correspondence) was a social network of intellectual leaders of Europe who exchanged and publicly read their deep philosophical thoughts, scientific ideas, poetry and prose.

Many people during those centuries wrote their letters in duplicate: one copy to send, one to keep for publishing Collected Letters later in life. Charles Darwin did that, for example (well, if I remember correctly, his wife made copies from his illegible originals into something that recipients could actually read), which is why we have such a complete understanding of his work and thought – it is all well preserved and the availability of such voluminouos correspondence gave rise to a small industry of Darwinian historical scholarship.

What is important to note is that, both in journalism and in science, communication could be done by anyone – there was no official seal of approval, or licence, to practice either of the two arts. At the same time, communication in print was limited to those who were literate and who could afford to have a book printed – people who, for the most part, were just the wealthy elites. Entry into that intellectual elite from a lower social class was possible but very difficult and required a lot of hard work and time (see, for example, a biography of Alfred Russell Wallace). Membership in the worlds of arts, science and letters was automatic for those belonging to the small group of literate aristocracy. They had no need to establish formalized gatekeeping as bloodlines, personal sponsorship and money did the gatekeeping job quite well on their own.

As communication has moved from local to global, due to print, trust had to be gained over time – by one’s age, stature in society, track record, and by recommendation – who the people you trust say you should trust. Trust is transitive.

Another thing to note is that each written dispatch contained both ‘what’s new’ and ‘how the world works’ as well as a degree of phatic discourse: “This is what happened. This is what I think it means. And this is who I am so you know why you should trust me.” It is often hard to tell, from today’s perspective, what was scientific communication and what was journalism.

Personal – and thus potentially phatic – communication was a norm in the early scientific publishing. For example, see “A Letter from Mr J. Breintal to Peter Collinfoxl, F. RXS. contairnng an Account of what he felt after being bit by a Rattle-fnake” in Philosophical Transactions, 1747. – a great account of it can be found at Neurotic Physiology. It is a story of a personal interaction with a rattlesnake and the discovery leading from it. It contained “I was there, you were not, let me tell you what happened” and “I understand something, you don’t, let me explain that to you” and “Let me tell you who I am so you can know you can trust me”.

Apparently, quite a lot of scientific literature of old involved exciting narratives of people getting bitten by snakes – see this one from 1852 as well.

The anomalous 20th century – effects of technology

The gradual changes in society – invention of printing, rise of science, rise of capitalism, industrial revolution, mass migration from rural to urban areas, improvements in transportation and communication technologies, to name just a few – led to a very different world in the 20th century.

Technology often leads societal changes. If you were ever on a horse, you understand why armies that used stirrups defeated the armies that rode horses without this nifty invention.

Earlier, the speed of spreading news was much slower (see image: Maps of rates of travel in the 19th century – click on the link to see bigger and more). By 1860 Telegraph reached to St. Louis. During its short run the Pony Express could go the rest of the way to San Francisco in 10 days. After that, telegraph followed the rails. First transcontinental line was in 1869. Except for semaphores (1794) information before the telegraph (1843) could only travel as fast as a rider or boat (Thanks to John McKay for this brief primer on the history of speed of communication in Northern America. I am assuming that Europe was slightly ahead and the rest of the world somewhat behind).

The 20th century saw invention or improvement of numerous technologies in transportation – cars, fast trains, airplanes, helicopters, space shuttles – and in communication – telephone, radio, and television. Information could now travel almost instantly.

But those new technologies came with a price – literally. While everyone could write letters and send them by stagecoach, very few people could afford to buy, run and serve printing presses, radio stations and television studios. These things needed capital, and increasingly became owned by rich people and corporations.

Each inch of print or minute of broadcast costs serious money. Thus, people were employed to become official filters of information, the gatekeepers – the editors who decided who will get access to that expensive real estate. As the editors liked some people’s work better than others, those people got employed to work in the nascent newsrooms. Journalism became professionalized. Later, universities started journalism programs and codified instruction for new journalists, professionalizing it even more.

Instead of people informing each other, now the few professionals informed everyone else. And the technology did not allow for everyone else to talk back in the same medium.

The broadcast media, a few large corporations employing professional writers informing millions – with no ability for the receivers of information to fact-check, talk back, ask questions, be a part of the conversation – is an exception in history, something that lasted for just a few decades of the 20th century.

The anomalous 20th century – industrialization

Industrial Revolution brought about massive migration of people into big cities. The new type of work required a new type of workforce, one that was literate and more educated. This led to the invention of public schools and foundation of public universities.

In the area of science, many more people became educated enough (and science still not complex and expensive yet) to start their own surveys, experiments and tinkering. The explosion of research led to an explosion of new journals. Those too became expensive to produce and started requiring professional filters – editors. Thus scientific publishing also became professionalized. Not every personal anecdote could make it past the editors any more. Not everyone could call oneself a scientist either – a formal path emerged, ending with a PhD at a university, that ensured that science was done and published by qualified persons only.

By the 1960s, we got a mass adoption of peer-review by scientific journals that was experimentally done by some journals a little earlier. Yes, it is that recent! See for example this letter to Physical Review in 1936:


Dear Sir,

We (Mr. Rosen and I) had sent you our manuscript for publication and had not authorized you to show it to specialists before it is printed. I see no reason to address the — in any case erroneous — comments of your anonymous expert. On the basis of this incident I prefer to publish the paper elsewhere.


Albert Einstein

Or this one:


John Maddox, former editor of Nature: The Watson and Crick paper was not peer-reviewed by Nature… the paper could not have been refereed: its correctness is self-evident. No referee working in the field … could have kept his mouth shut once he saw the structure…

Migration from small towns into big cities also meant that most people one would meet during the day were strangers. Meeting a stranger was not something extraordinary any more, so emergence and enforcement of proper proscribed conduct in cities replaced the need for one-to-one encounters and sizing up strangers using phatic language. Which is why even today phatic language is much more important and prevalent in rural areas where it aids personal survival than in urban centers where more general rules of behavior among strangers emerged (which may partially explain why phatic language is generally associated with conservative ideology and conceptual language with politicial liberalism, aka, the “reality-based community“).

People moving from small hometowns into big cities also led to breaking up of families and communities of trust. One needed to come up with new methods for figuring out who to trust. One obvious place to go was local media. They were stand-ins for village elders, parents, teachers and priests.

If there were many newspapers in town, one would try them all for a while and settle on one that best fit one’s prior worldview. Or one would just continue reading the paper one’s parents read.

But other people read other newspapers and brought their own worldviews into the conversation. This continuous presence of a plurality of views kept everyone’s BS filters in high gear – it was necessary to constantly question and filter all the incoming information in order to choose what to believe and what to dismiss.

The unease with the exposure to so many strangers with strange ideas also changed our notions of privacy. Suddenly we craved it. Our letters are now meant for one recepient only, with the understanding it will not be shared. Personal diaries now have lockets. After a century of such craving for privacy, we are again returning to a more historically traditional notions, by much more freely sharing our lives with strangers online.

The anomalous 20th century – cleansing of conceptual language in science and journalism

Until the 20th century we did not see the consolidation of media into large conglomerates, and of course, there were no mass radio or TV until mid-20th century. Not until later in the century did we see the monopolization of local media markets by a single newspaper (competitors going belly-up) which, then, had to serve everyone, so it had to invent the fake “objective” HeSaidSheSaid timid style of reporting in order not to lose customers of various ideological stripes and thus lose advertising revenue.

Professionalising of journalism, coupled with the growth of media giants serving very broad audiences, led to institutionalization of a type of writing that was very much limited to “what’s new”.

The “let me explain” component of journalism fell out of favor as there was always a faction of the audience that had a problem with the empirical facts – a faction that the company’s finances could not afford to lose. The personal – including phatic – was carefully eliminated as it was perceived as unobjective and inviting the criticism of bias. The way for a reporter to inject one’s opinion into the article was to find a person who thinks the same in order to get the target quote. A defensive (perhaps cowardly) move that became the norm. And, once the audience caught on, led to the loss of trust in traditional media.

Reduction of local media to a single newspaper, a couple of local radio stations and a handful of broadcast TV channels (that said esentially the same thing), left little choice for the audience. With only one source in town, there was no opportunity to filter among a variety of news sources. Thus, many people started unquestioningly accepting what 20th-century style broadcast media served them.

Just because articles were under the banners of big companies did not make them any more trustworthy by definition, but with no alternative it is still better to be poorly informed than not informed at all. Thus, in the 20th century we gradually lost the ability to read everything critically, awed by the big names like NYT and BBC and CBS and CNN. Those became the new parents, teachers, tribal elders and priests, the authority figures whose words are taken unquestioningly.

In science, explosion in funding not matched by explosion of job positions, led to overproduction of PhDs and a rise of hyper-competitive culture in academia. Writing books became unproductive. The only way to succeed is to keep getting grants and the only way to do that is to publish very frequently. Everything else had to fall by the wayside.

False measures of journal quality – like the infamous Impact Factor – were used to determine who gets a job and tenure and who falls out of the pipeline. The progress of science led inevitably to specialization and to the development of specialized jargon. Proliferation of expensive journals ensured that nobody but people in highest-level research institutions had access to the literature, so scientists started writing only for each other.

Scientific papers became dense, but also narrowed themselves to only “this is how the world works”. The “this is new” became left out as the audience already knew this, and it became obvious that a paper would not be published if it did not produce something new, almost by definition.

And the personal was so carefully excised for the purpose of seeming unbiased by human beings that it sometimes seems like the laboratory equipment did all the experiments of its own volition.

So, at the close of the 20th century, we had a situation in which journalism and science, for the first time in history, completely separated from each other. Journalism covered what’s new without providing the explanation and context for new readers just joining the topic. Science covered only explanation and only to one’s peers.

In order to bridge that gap, a whole new profession needed to arise. As scientists understood the last step of the scientific method – communication – to mean only ‘communication to colleagues’, and as regular press was too scared to put truth-values on any statements of fact, the solution was the invention of the science journalist – someone who can read what scientists write and explain that to the lay audience. With mixed success. Science is hard. It takes years to learn enough to be able to report it well. Only a few science journalists gathered that much expertise over the years of writing (and making mistakes on the way).

So, many science journalists fell back on reporting science as news, leaving the explanation out. Their editors helped in that by severely restricting the space – and good science coverage requires ample space.

A good science story should explain what is known by now (science), what the new study brings that is new (news) and why does that matter to you (phatic discourse). The lack of space usually led to omission of context (science), shortening of what is new (news) and thus leaving only the emotional story intact. Thus, the audience did not learn much, Certainly not enough to be able to evaluate next day’s and next week’s news.

This format also led to the choice of stories. It is easy to report in this way if the news is relevant to the audience anyway, e.g., concerning health (the “relevant” stories). It is also easy to report on misconduct of scientists (the “fishy” stories) – which is not strictly science reporting. But it was hard to report on science that is interesting for its own sake (the “cool” stories).

What did the audience get out of this? Scientists are always up to some mischief. And every week they change the story as to what is good or bad for my health. And it is not very fun, entertaining and exciting. No surprise that science as endeavour slowly started losing trust with the (American) population, and that it was easy for groups with financial, political or religious interests to push anti-science rhetoric on topics from hazards of smoking to stem-cell research to evolution to climate change.

At the end of the 20th century, thus, we had a situation in which journalism and science were completely separate endeavors, and the bridge between them – science journalism – was unfortunately operating under the rules of journalism and not science, messing up the popular trust in both.

Back to the Future

It is 2010. The Internet has been around for 30 years, the World Wide Web for 20. It took some time for the tools to develop and spread, but we are obviously undergoing a revolution in communication. I use the word “revolution” because it is so almost by definition – when the means of production change hands, this is a revolution.

The means of production, in this case the technology for easy, cheap and fast dissemination of information, are now potentially in the hands of everyone. When the people formerly known as the audience employ the press tools they have in their possession to inform one another, we call that ‘citizen journalism.’ And some of those citizens possess much greater expertise on the topics they cover than the journalists that cover that same beat. This applies to science as well.

In other words, after the deviation that was the 20th century, we are going back to the way we have evolved as a species to communicate – one-to-one and few-to-few instead of one-to-many. Apart from technology (software instead of talking/handwriting/printing), speed (microseconds instead of days and weeks by stagecoach, railroad or Pony Express, see image above) and the number of people reached (potentially – but rarely – millions simultaneously instead of one person or small group at a time), blogging, social networking and other forms of online writing are nothing new – this is how people have always communicated. Like Montaigne. And the Republic of Letters in the 18th century. And Charles Darwin in the 19th century.

All we are doing now is returning to a more natural, straightforward and honest way of sharing information, just using much more efficient ways of doing it. (Images from Cody Brown)

And not even that – where technology is scarce, the analog blogging is live and well (image: Analog blogger, from AfriGadget).

What about trustworthiness of all that online stuff? Some is and some isn’t to be trusted. It’s up to you to figure out your own filters and criteria, and to look for additional sources, just like our grandparents did when they had a choice of dozens of newspapers published in each of their little towns.

With the gradual return of a more natural system of communication, we got to see additional opinions, the regular fact-checks on the media by experts on the topic, and realized that the mainstream media is not to be trusted.

With the return of a more natural system of communication, we will all have to re-learn how to read critically, find second opinions, evaluate sources. Nothing new is there either – that is what people have been doing for millennia – the 20th century is the exception. We will figure out who to trust by trusting the judgment of people we already trust. Trust is transitive.

Return of the phatic language

What does this all mean for the future of journalism, including science journalism?

The growing number of Web-savvy citizens have developed new methods of establishing trustworthiness of the sources. It is actually the old one, pre-20th century method – relying on individuals, not institutions. Instead of treating WaPo, Fox, MSNBC and NPR as the proxies for the father, teacher, preacher and the medicine man, we now once again evaulate individuals.

As nobody enters a news site via the front page and looks around, but we all get to individual articles via links and searches, we are relying on bylines under the titles, not on the logos up on top. Just like we were not born trusting NYTimes but learned to trust it because our parents and neighbors did (and then perhaps we read it for some time), we are also not born knowing which individuals to trust. We use the same method – we start with recommendations from people we already trust, then make our own decisions over time.

If you don’t link to your sources, including to scientific papers, you lose trust. If you quote out of context without providing that context, you lose trust. If you hide who you are and where you are coming from – that is cagey and breeds mistrust. Transparency is the new objectivity.

And transparency is necessarily personal, thus often phatic. It shows who you are as a person, your background, your intentions, your mood, your alliances, your social status.

There are many reasons sciencebloggers are more trusted than journalists covering science.

First, they have the scientific expertise that journalists lack – they really know what they are talking about on the topic of their expertise and the audience understands this.

Second, they link out to more, more diverse and more reliable sources.

Third, being digital natives, they are not familiar with the concept of word-limits. They start writing, they explain it as it needs to be explained and when they are done explaining they end the post. Whatever length it takes to give the subject what it’s due.

Finally, not being trained by j-schools, they never learned not to let their personality shine through their writing. So they gain trust by connecting to their readers – the phatic component of communication.

Much of our communication, both offline and online, is phatic. But that is necessary for building trust. Once the trust is there, the conceptual communication can work. If I follow people I trust on Twitter, I will trust that they trust the sources they link to so I am likely to click on them. Which is why more and more scientists use Twitter to exchage information (PDF). Trust is transitive.

Scientists, becoming journalists

Good science journalists are rare. Cuts in newsrooms, allocation of too little space for science stories, assigning science stories to non-science journalists – all of these factors have resulted in a loss of quantity and quality of science reporting in the mainstream media.

But being a good science journalist is not impossible. People who take the task seriously can become experts on the topic they cover (and get to a position where they can refuse to cover astronomy if their expertise is evolution) over time. They can become temporary experts if they are given sufficient time to study instead of a task of writing ten stories per day.

With the overproduction of PhDs, many scientists are choosing alternative careers, including many of them becoming science writers and journalists, or Press Information Officers. They thus come into the profession with the expertise already there.

There is not much difference between a research scientist who blogs and thus is an expert on the topic s/he blogs about, and a research scientist who leaves the lab in order to write as a full-time job. They both have scientific expertise and they both love to write or they wouldn’t be doing it.

Blog is software. A medium. One of many. No medium has a higher coefficient of trustworthiness than any other. Despite never going to j-school and writing everything on blogs, I consider myself to be a science writer.

Many science journalists, usually younger though some of the old ones caught on quickly and became good at it (generation is mindset, not age), grok the new media ecosystem in which online collaboration between scientists and journalists is becoming a norm.

At the same time, many active scientists are now using the new tools (the means of production) to do their own communication. As is usually the case with novelty, different people get to it at different rates. The conflicts between 20th and 21st style thinking inevitably occur. The traditional scientists wish to communicate the old way – in journals, letters to the editor, at conferences. This is the way of gatekeeping they are used to.

But there have been a number of prominent cases of such clashes between old and new models of communication, including the infamous Roosevelts on toilets (the study had nothing to do with either US Presidents or toilets, but it is an instructive case – image by Dr.Isis), and several other smaller cases.

The latest one is the Arsenic Bacteria Saga in which the old-timers do not seem to undestand what a ‘blog’ means, and are seemingly completely unaware of the important distinction between ‘blogs’ and ‘scienceblogs’, the former being online spaces by just about anyone, the latter being blogs written by people who actually know their science and are vetted or peer-reviewed in some way e.g., at or or by virtue of being hand-picked and invited to join one of the science blogging networks (which are often run by traditional media outlets or scientific publishers or societies) or simply by gaining resepect of peers over time.

Case by case, old-time scientists are learning. Note how both in the case of Roosevelts on toilets and the Arsenic bacteria the initially stunned scientists quickly learned and appreciated the new way of communication.

In other words, scientists are slowly starting to get out of the cocoon. Instead of just communicating to their peers behind the closed doors, now they are trying to reach out to the lay audience as well.

As more and more papers are Open Access and can be read by all, they are becoming more readable (as I predicted some years ago). The traditional format of the paper is changing. So they are covering “let me explain” portion better, both in papers and on their own blogs.

They may still be a little clumsy about the “what’s new” part, over-relying on the traditional media to do it for them via press releases and press conferences (see Darwinius and arsenic bacteria for good examples) instead of doing it themselves or taking control of the message (though they do need to rely on MSM to some extent due to the distinction between push and pull strategies as the media brands are still serving for many people as proxies for trustworthy sources).

But most importantly, they are now again adding the phatic aspect to their communication, revealing a lot of their personality on social networks, on blogs, and even some of them venturing into doing it in scientific papers.

By combining all three aspects of good communication, scientists will once again regain the trust of their audience. And what they are starting to do looks more and more like (pre-20th century) journalism.

Journalists, becoming scientists

On the other side of the divide, there is a renewed interest in journalism expanding from just “this is new” to “let me explain how the world works”. There are now efforts to build a future of context, and to design explainers.

If you are not well informed on an issue (perhaps because you are too young to remember when it first began, or the issue just started being relevant to you), following a stream of ‘what is new’ articles will not enlighten you. There is not sufficient information there. There is a lot of tacit knowledge that the writer assumes the readers possess – but many don’t.

There has to be a way for news items to link to some kind of collection of background information – an ‘explainer’. Such an explainer would be a collection of verifiable facts about the topic. A collection of verifiable facts about the way the world works is….scientific information!

With more and more journalists realizing they need to be transparent about where they are coming from, injecting personality into their work in order to build trust, some of that phatic language is starting to seep in, completing the trio of elements of effective communication.

Data Journalism – isn’t this science?

Some of the best journalism of the past – yes, the abominable 20th century – was done when a reporter was given several months to work on a single story requiring sifting through boxes and boxes of documents. The reporter becomes the expert on the topic, starts noticing patterns and writes a story that brings truly new knowledge to the world. That is practically science! Perhaps it is not the hardest of the hard sciences like physics, but as good as well-done social science like cultural anthropology, sociology or ethnography. There is a system and a method very much like the scientific method.

Unfortunately, most reporters are not given such luxury. They have to take shortcuts – interviewing a few sources to quote for the story. The sources are, of course, a very small and very unrepresentative sample of the relevant population – from a rolodex. Call a couple of climate scientists, and a couple of denialists, grab a quote from each and stick them into a formulaic article. That is Bad Science as well as Bad Journalism. And now that the people formerly known as audience, including people with expertise on the topic, have the tools to communicate to the world, they often swiftly point out how poorly such articles represent reality.

But today, most of the information, data and documents are digital, not in boxes. They are likely to be online and can be accessed without travel and without getting special permissions (though one may have to steal them – as Wikileaks operates: a perfect example of the new data journalism). Those reams of data can be analyzed by computers to find patterns, as well as by small armies of journalists (and other experts) for patterns and pieces of information that computer programs miss.

This is what bioinformaticists do (and have already built tools to do it – contact them, steal their tools!).

Data journalism. This is what a number of forward-thinking journalists and media organizations are starting to do.

This is science.

On the other hand, a lot of distributed, crowdsourced scientific research, usually called Citizen Science, is in the business of collecting massive amounts of data for analysis. How does that differ from data journalism? Not much?

Look at this scientific paper – Coding Early Naturalists’ Accounts into Long-Term Fish Community Changes in the Adriatic Sea (1800–2000) – is this science or data journalism? It is both.

The two domains of communicating about what is new and how the world works – journalism and science – have fused again. Both are now starting to get done by teams that involve both professionals and amateurs. Both are now led by personalities who are getting well-known in the public due to their phatic communication in a variety of old and new media.

It is important to be aware of the shortness of our lives and thus natural tendency for historical myopia. Just because we were born in the 20th century does not mean that the way things were done then are the way things were ‘always done’, or the best ways to do things – the pinnacle of cultural and social development. The 20th century was just a strange and deviant blip in the course of history.

As we are leaving the 20th century behind with all of its unusual historical quirks, we are going back to an older model of communicating facts – but with the new tools we can do it much better than ever, including a much broader swath of society – a more democratic system than ever.

By the way, while it’s still cold, the rain has stopped. And that is Metaphorical language…

This article was commissioned by Science Progress and will also appear on their site in 24 hours.

Seven Questions….with Yours Truly

Last week, my SciBling Jason Goldman interviewed me for his blog. The questions were not so much about blogging, journalism, Open Access and PLoS (except a little bit at the end) but more about science – how I got into it, what are my grad school experiences, what I think about doing research on animals, and such stuff. Jason posted the interview here, on his blog, on Friday, and he also let me repost it here on my blog as well, under the fold:

Continue reading

Open Laboratory – old Prefaces and Introductions

One difference between reading Open Laboratory anthologies and reading the original posts included in them is that the printed versions are slightly edited and polished. Another difference is that the Prefaces and Introductions can be found only in the books. They have never been placed online.
But now that four books are out and we are halfway through collecting entries for the fifth one, when only the 2009 book is still selling, I think it is perfectly OK to place Prefaces and Introductions that I wrote myself online. I wrote Prefaces for the 2006, 2007 and 2008 book, as well as the Introduction for the 2006 one. The introductions for the subsequent editions were written by the year’s guest editor, i.e., Reed Cartwright in 2007, Jennifer Rohn in 2008, and SciCurious in 2009.
So, under the fold are my three Prefaces and one Introduction. See how the world (and my understanding of it) of the online science communication has changed over the last few years:

Continue reading

Good article about the history and current state of Open Access

US seeks to make science free for all by Declan Butler:

The push to open up scientific knowledge to all looks set to go into overdrive. Over the past decade, the accessibility offered by the Internet has transformed science publishing. Several efforts have already tried to harness the web’s power to make research papers available for free. Now two parallel efforts from the US government could see almost all federally funded research made available in free, publicly accessible repositories…..

Read the whole thing….

Why it is important for media articles to link to scientific papers

You may be aware that, as of recently, one of my tasks at work is to monitor media coverage of PLoS ONE articles. This is necessary for our own archives and monthly/annual reports, but also so I could highlight some of the best media coverage on the everyONE blog for everyone to see. As PLoS ONE publishes a large number of articles every week, we presume that many of you would appreciate getting your attention drawn to that subset of articles that the media found most interesting.
So, for example, as I missed last week due to my trip to AAAS, I posted a two-week summary of media coverage this Monday. And that took far more time and effort (and some silent cursing) than one would expect. Why?
I don’t think I am a slouch at googling stuff. Some people joke that the entire Internet passes through my brain before it goes to the final audience. After all, I have been monitoring the Web for mentions of ‘PLoS’ and ‘Public Library of Science’ on blogs, Twitter, FriendFeed, Facebook and elsewhere for a few years now. If I don’t catch a mention within minutes of it being posted, you can bet one of my many online friends/followers/subscribers is bound to quickly let me know by e-mail or Direct Messaging somewhere. If someone says something nice about PLoS, I am quick to post a ThankYou note. If someone asks a question, I try to answer or to connect the person with the appropriate member of the PLoS staff. If someone is publicly musing about submitting a manuscript to one of our journals, I am right there to give encouragement. If someone makes a factual error, I gently correct it. It is very, very rare that I need to raise the Immense Online Armies because someone is wrong on the Internet ;-)
So, why is it difficult then to compile a collection of weekly media coverage? Let me walk you through the process….
First, as you probably already know, PLoS makes no distinction between Old and New media. We have bloggers on our press list who apply/sign-up in the same way and abide by the same rules as traditional journalists (and, unlike mainstream media, bloggers NEVER break embargos, not once in the past three years since we started adding bloggers to our press list). For the kind of coverage we prefer to see, we point bloggers to the criteria. In return, bloggers can send trackbacks to our articles, their work is showcased side-by-side with the traditional outlets in our weekly posts, they can be discovered via Google Blogsearch, Postgenomic and links directly from each article, and one blogger per month wins a t-shirt and special recognition.
So, I start with blog posts first. The first thing I do is take a look at Those are the best of the best posts – not merely mentioning our articles, but adding analysis, commentary, critique, context and additional information. How do I find them? I just search the site for the phrase ‘journal.pone‘. That search brings up every single post that mentions a PLoS ONE article because that phrase is a part of every possible form of the URL of the article (including the shortest one, which includes just the DOI). If a post links to our article (and that is the only way to get aggregated on I will find it this way. Needless to say, this process takes just a few minutes per week.
Knowing that there are some good blogs out there that are not registered at (which is strange and unfathomable why – is a ‘stamp-of-approval’ place for science blogs recognized by the outside world of journals and media, as well as a nice way to get extra recognition and traffic, and even awards), I then repeat the same search – for ‘journal.pone‘ – on Google Blogsearch. This may bring up a few more posts that I did not catch yet. Occasionally, some of these are good. Another couple of minutes. Blogs are now done. Move on to traditional media….
And this is where the Hell starts. Try searching Google News for ‘journal.pone‘…?! All I get are a couple of prominent blogs that I have already counted, e.g., those blogs that are listed by Google News ( blogs, Ars Technica, Wired blogs, etc.). Where are the others?
The problem is, nobody in the mainstream media links to papers.
So I have to search for PLoS and for Public Library Of Science and then figure out which ones are covering specifically PLoS ONE articles (sometimes they don’t specify, sometimes they name the wrong journal – last week an article on PLoS Current-Influenza was reported to be in PLoS ONE by a number of outlets copying the error from each other). Then I have to search for keywords for individual articles I suspect may have received some coverage. Last week, for example, I searched for “swallows+antioxidants” and “St. Birgitta”, among many others. This lasts for hours! And at the end I am still not 100% sure I caught everything. How frustrating!
Not just is there a big difference in time and effort spent between finding blog posts and finding media articles, but there is an even bigger disparity when one considers what results come out of these searches. I have been doing this for a month now. I expected that there would be poor blog posts and poor media articles, that there would be good blog posts and good media articles, and that there would occasionally be some excellent blog posts and excellent media articles. So far, that is true…. except I have yet to discover an excellent media article. As a rule, the very best coverage of every paper in the past month was done by a blogger or two or three. Then there are some other, good pieces of coverage in both the New and Old media, and then there are some really bad pieces in both realms as well (not all blog posts I count here are really bad – they may just be too detailed, technical and dry for lay audience because the blogger is intentionally targeting scientific peers as audience, which is fair thing to acknowledge).
So, every week, it takes me a few minutes to find the very best coverage (which is on blogs, usually those aggregated on And then I spend hours looking for remnants, in the traditional media, which turn out to be so-so, some OK, some not so good, some horrible. If I wasn’t paid to do this, I would not do it – it cannot be good for my long-term mental health.
The resistance to post links is an atavism, a remnant of an old age before the Web. I know (because I asked many times) many good science journalists keep trying to add links, but the editors say No. The traditional media has still not caught on to the Ethic of the Link, which is an essential aspect of ethics of online communication.
I can think, off the top of my head, of three good reasons why everyone who publishes online should include a link to the scientific paper described in the article (just post the DOI link that comes with the press release if you are on the press list – if it does not resolve immediately, it is not your fault, you can always blame the journals for being slow on it – though this should never happen with PLoS articles):
Reason One: I will not go crazy every week. I am assuming that every scientific publisher has people on the staff whose task is to monitor media coverage and each one of these people is cussing and cursing YOU, the Media, every day. Try to make friends with people who provide you with source material on a regular basis.
Reason Two: Media coverage is one of the many elements of article-level metrics. Furthermore, links from the media affect the number of views and downloads of the article, and those are also elements of article-level metrics. Number of views/downloads then, in the future, affects the number of citations the work gets which is also and element of article-level metrics. Thus omitting the link skewes the ability of readers and observers to evaluate the papers properly.
The current ecosystem of science communication has a scientific paper at its core, additions to the paper (e.g., notes, comments and ratings, as well as Supplemental materials, videos posted on, etc) as a shell, and incoming and outgoing links – trackbacks, cited papers, citing papers, links to other papers in the same Collection, links to other papers with the same keywords, and yes, incoming links from the media – as connections building a network: the entire inter-connected ecosystem of scientific knowledge.
By not linking to scientific papers, traditional media is keeping itself outside of the entire ecosystem of empirical knowledge. By doing this, the traditional media is fast making itself irrelevant.
Reason Three: if an article in the media discusses a scientific study, that scientific paper is the source material for the article. If the link is missing, this is an automatic red flag for the readers. What is the journalist hiding? Why is the article making it difficult for readers to fact-check the journalist? Something does not smell good if the link is not provided (or worse, it is impossible to figure out even who are the authors and in which journal did they publish – yes, that is more common than you think).
The instant and automatic response of the readers is mistrust. Every time you fail to link to the paper, you further erode whatever trust and reputation you still may have with the audience. You soon cease to be a legitimate source of information. Sure, most readers will not go hunting for the paper to read it in order to fact-check you. But two or three will, and they will let everyone else know if your article is trustworthy or not, either in the comments under the article on your own site, or on their blogs which will be quickly picked up by Google (remember: Google loves blogs).
So please, media types, hurry up and catch up with the world. The 21st century is already a decade in – you really need to do some very fast learning. Right now. Or you’ll go extinct in a nanosecond. And despite my reputation, I never said that I’d consider that result to be a Good Thing. We are in this together, you just need to do your part. To begin with, start linking.

AAAS 2010 meeting

In San Diego this week. Check it out. I’ll be there – see my session. If you will be there, let me know. Let’s have coffee or lunch, etc. My session is on 21st in the morning, and there is a lot of social stuff I agreed to on the 19th in the afternoon and evening, and of course I want to see a lot of other sessions, but I am generally flexible. Just ping me over e-mail or Twitter or phone (if you have my number) or post a comment here.

Aves 3D

Aves 3D is a ‘three dimensional database of avian skeletal morphology’ and it is awesome!
Aves3D logo.pngThis is an NSF-funded project led by Leon Claessens, Scott Edwards and Abby Drake. What they are doing is making surface scans of various bones of different bird species and placing the 3D scans on the website for everyone to see and use. With simple use of the mouse or arrow buttons, one can move, zoom and rotate each image any way one wants.
The collection is growing steadily and already contains some very interesting bones from a number of species, both extinct and extant. You can see examples of bones of the dodo or the Diatryma gigantea (aka Gaston’s Bird), as well as many skulls and sternums and various limb bones of currently existing species.
The database is searchable by
Cladogram, Scientific Name, Common Name, Skeletal Element, geological era, Geographical Location or Specimen Number.
Most of the actual scanning is done by undergraduate students and the database is already being use for several scientific projects. You can get involved and help build the database, you can use the scans for teaching and research, or you can just go and have fun rotating the cool-looking bird bones.