Data Cleaning Best Practices

Start by getting in a data mindset and understanding how to bulletproof your data. Ask these questions: Who collected the data? Why did they collect it? How did they collect it? When did they start (and stop) collecting it? What happened to the data between the day it was born and the day you are harvesting it? Source: https://github.com/sarahcnyt/stabile/blob/master/docs/bulletproof.md Get a data dictionary (or make […]

Read More Leave comment

Free Training: Intro to Data Journalism with the Center for Investigative Reporting

Reveal/The Center for Investigative Reporting will be coming to North Carolina June 18-19 to train journalists and students in the art of data reporting. Emmanuel Martinez, data reporter for Reveal/CIR, will lead the training. Martinez is the lead data journalist in Reveal’s ongoing coverage of modern-day redlining, which includes complex statistical analyses touching nearly every mortgage application in the country. Martinez will work with journalists on […]

Read More Leave comment

The state of data in newsrooms: findings and recommendations

The following is a condensed version of a report by Melody Kramer that was written for the Carolina Data Desk, which is part of the UNC School of Media and Journalism‘s Reese News Lab, and funded by the Center for Innovation & Sustainability in Local Media. Names and organizations have been made anonymous for privacy reasons. Kramer interviewed journalists and civic hackers both inside and […]

Read More Leave comment

How we helped a journalist fact-check with data

In this post, we’ll walk through a real-life example of fact-checking using data analysis techniques.

Read More Leave comment

Tutorial: Intro to Web Scraping with R

Let’s take a look at how you can use R to scrape data from text in a standard form across many pages. We’ll start by going over some of the key functions that will help us, and then we’ll walk through an example.

Read More Leave comment

Legality and Ethics of Webscraping

Web scraping can be a powerful tool for journalists, allowing them to quickly gather large amounts of data. It can also be confusing and intimidating at times. In this post, we’ll look at some things to consider when deciding whether web scraping is a good solution for your data needs.

Read More Leave comment

On Our Desk — March 31

On Our Desk is a round-up of national or North Carolina news outlets using data to tell stories in innovative ways. Here’s what inspired us this week.

Read More Leave comment

Free Tools for Freelancers

Gathering, cleaning, and analyzing data can take some time, but it doesn’t need to be costly. Here are some of our favorite, free data tools.

Read More Leave comment

On Our Desk — March 3, 2017

On Our Desk is a round-up of national or North Carolina news outlets using data to tell stories in innovative ways. Here’s what inspired us this week.

Read More Leave comment

Replicating Voter Turnout Analysis at County Level

In a recent release, Democracy North Carolina analyzed voter turnout in the state in last November’s election among different demographic groups. In this post, we’ll see how you can replicate this analysis for your county.

Read More Leave comment