Dirty Data in the Newsroom: Comparing Data Preparation in Journalism and Data Science. CHI 2023.

  Рет қаралды 1,051

Tamara Munzner

Tamara Munzner

Күн бұрын

Stephen Kasica, Charles Berret, Tamara Munzner.
Dirty Data in the Newsroom: Comparing Data Preparation in Journalism and Data Science. Proc. CHI 2023, Honorable Mention.
Paper page: www.cs.ubc.ca/group/infovis/pu...
Supplemental materials: osf.io/nbtvm/
Abstract:
The work involved in gathering, wrangling, cleaning, and otherwise preparing data for analysis is often the most time consuming and tedious aspect of data work. Although many studies describe data preparation within the context of data science workflows, there has been little research on data preparation in data journalism. We address this gap with a hybrid form of thematic analysis that combines deductive codes derived from existing accounts of data science workflows and inductive codes arising from an interview study with 36 professional data journalists. We extend a previous model of data science work to incorporate detailed activities of data preparation. We synthesize 60 dirty data issues from 16 taxonomies on dirty data and our interview data, and we provide a novel taxonomy to characterize these dirty data issues as discrepancies between mental models. We also identify four challenges faced by journalists: diachronic, regional, fragmented, and disparate data sources.
programs.sigchi.org/chi/2023/...

Пікірлер: 2
@NewInkFoHalo
@NewInkFoHalo Жыл бұрын
As a scientist with a lot of passion for but little training in data science this was a fascinating presentation that helped put into words many concepts that had only existed loosely in my head before now. Also, great methodology and inspired questions! Will definitely check the paper out.
@TheMoni7548
@TheMoni7548 Жыл бұрын
You have datasets issue. Every AI Company promises the world a few have the Yottaflops, and the datasets necessary. I think its a good homework I think believing in its inherent power bring activism is stupid AI Safety design
Color (Ch 10) II, Visualization Analysis & Design, 2021
6:00
Tamara Munzner
Рет қаралды 3,1 М.
Data Scientist vs Data Analyst | A Deep Dive
7:29
Justin Shin
Рет қаралды 10 М.
Amazing weight loss transformation !! 😱😱
00:24
Tibo InShape
Рет қаралды 58 МЛН
Mom's Unique Approach to Teaching Kids Hygiene #shorts
00:16
Fabiosa Stories
Рет қаралды 31 МЛН
Hopf fibration
1:06
manheim991
Рет қаралды 16 М.
AI-Driven, Physics-Based Character Animation
1:35
NVIDIA
Рет қаралды 116 М.
Why you should not be a data scientist
12:33
Tina Huang
Рет қаралды 755 М.
Unwrapping a tesseract (4d cube aka hypercube)
1:39
Vladimir Panfilov
Рет қаралды 840 М.
Fugue for Polyhdra in B-flat Major, by Olaf Holt. (Audio mangled)
2:55
Petrie-Coxeter Polyhedron. Heidi Burgiel. Jan 1997
6:38
Tamara Munzner
Рет қаралды 1,6 М.
5 Impactful Data Science Projects For Your Portfolio
7:28
Ken Jee
Рет қаралды 98 М.
Что делать если в телефон попала вода?
0:17
Лена Тропоцел
Рет қаралды 2,7 МЛН
iPhone 16 с инновационным аккумулятором
0:45
ÉЖИ АКСЁНОВ
Рет қаралды 9 МЛН
تجربة أغرب توصيلة شحن ضد القطع تماما
0:56
صدام العزي
Рет қаралды 62 МЛН
Как правильно выключать звук на телефоне?
0:17
Люди.Идеи, общественная организация
Рет қаралды 1,9 МЛН
Красиво, но телефон жаль
0:32
Бесполезные Новости
Рет қаралды 1,5 МЛН