SlideShare a Scribd company logo
1 of 94
Download to read offline
In search of global
corporate data
           featuring OpenCorporates




            Chris Taggart, OpenCorporates, NICAR, Feb 2012
Corporate data for journalists
is a solved problem. Right?
Corporate data for journalists
is a solved problem. Right?

 Hoovers, Lexis-Nexis
Corporate data for journalists
is a solved problem. Right?

 Hoovers, Lexis-Nexis
Corporate data for journalists
is a solved problem. Right?

 Hoovers, Lexis-Nexis
 Jigsaw/Salesforce
Corporate data for journalists
is a solved problem. Right?

 Hoovers, Lexis-Nexis
 Jigsaw/Salesforce
Corporate data for journalists
is a solved problem. Right?

 Hoovers, Lexis-Nexis
 Jigsaw/Salesforce
 Kompass, Mint/Orbis, etc
Corporate data for journalists
is a solved problem. Right?

 Hoovers, Lexis-Nexis
 Jigsaw/Salesforce
 Kompass, Mint/Orbis, etc
 Linked In
Corporate data for journalists
is a solved problem. Right?

 Hoovers, Lexis-Nexis
 Jigsaw/Salesforce
 Kompass, Mint/Orbis, etc
 Linked In
Corporate data for journalists
is a solved problem. Right?

 Hoovers, Lexis-Nexis
 Jigsaw/Salesforce
 Kompass, Mint/Orbis, etc
 Linked In
 Google/Yahoo Finance etc
Corporate data for journalists
is a solved problem. Right?

 Hoovers, Lexis-Nexis
 Jigsaw/Salesforce
 Kompass, Mint/Orbis, etc
 Linked In
 Google/Yahoo Finance etc
Corporate data for journalists
is a solved problem. Right?

 Hoovers, Lexis-Nexis
 Jigsaw/Salesforce
 Kompass, Mint/Orbis, etc
 Linked In
 Google/Yahoo Finance etc
 Annual reports
Corporate data for journalists
is a solved problem. Right?

 Hoovers, Lexis-Nexis
 Jigsaw/Salesforce
 Kompass, Mint/Orbis, etc
 Linked In
 Google/Yahoo Finance etc
 Annual reports
That’s fine for big
corporations
But...
But...

 Typically only good coverage of largest companies
But...

 Typically only good coverage of largest companies
 Most are just aggregators of same standard sources,
 and rarely connect to or show the original data
But...

 Typically only good coverage of largest companies
 Most are just aggregators of same standard sources,
 and rarely connect to or show the original data
 Information gets poorer once outside the US/UK
But...

 Typically only good coverage of largest companies
 Most are just aggregators of same standard sources,
 and rarely connect to or show the original data
 Information gets poorer once outside the US/UK
 Doesn’t cover smaller companies well
But...

 Typically only good coverage of largest companies
 Most are just aggregators of same standard sources,
 and rarely connect to or show the original data
 Information gets poorer once outside the US/UK
 Doesn’t cover smaller companies well
 Rarely gives access to data
But...

 Typically only good coverage of largest companies
 Most are just aggregators of same standard sources,
 and rarely connect to or show the original data
 Information gets poorer once outside the US/UK
 Doesn’t cover smaller companies well
 Rarely gives access to data
 Very proprietary... and no provenance
Why is this important?
Because
companies
no longer
look like this
nor like this




                http://www.flickr.com/photos/ahxcjb/518357242
it’s far more like this
or even like this
or even like this
So, a bit of a mess, but
investigation is still possible
So, a bit of a mess, but
investigation is still possible


  FOR KNOWN STORIES
So, you’re reliant upon




http://fr.fotopedia.com/items/flickr-3346906435
So, you’re reliant upon




http://fr.fotopedia.com/items/flickr-3346906435
or




http://www.flickr.com/photos/corywendorf/3620929918/sizes/z/in/photostream/
But this is about data
 journalism, right?
And there’s a lot of data out
there
And there’s a lot of data out
there
Most isn’t linked to the legal
entity, making it difficult to use
Most isn’t linked to the legal
entity, making it difficult to use
But it does include a wealth
of other information...
If only we could tie it all
        together...
And legal entity matters
And legal entity matters


 It’s the thing that ends up in court
And legal entity matters


 It’s the thing that ends up in court
 It’s the way that provides firewalls for associated
 people, companies, organisations – information,
 regulation, tax
And legal entity matters


 It’s the thing that ends up in court
 It’s the way that provides firewalls for associated
 people, companies, organisations – information,
 regulation, tax
 It allows a corporate entity to take advantage of
 different rules in different jursidictions – regulatory
 arbitrage
If you don’t
think this
affects your
life, you’ve
slept through
the past few
years
                http://www.flickr.com/photos/aaronjacobs/64368770
So... OpenCorporates
A simple (but huge) goal: an
entry for every corporate
legal entity in the world
Based on the company number and jurisdiction
(no monopoly id)
A simple (but huge) goal: an
entry for every corporate
legal entity in the world
Based on the company number and jurisdiction
(no monopoly id)
A simple (but huge) goal: an
entry for every corporate
legal entity in the world
Based on the company number and jurisdiction
(no monopoly id)
[Digression]   The DUNS number
[Digression]   The DUNS number
  Genius idea. Developed by Dun &
  Bradstreet in 1962
[Digression]   The DUNS number
  Genius idea. Developed by Dun &
  Bradstreet in 1962
  Create a monopoly ID system
[Digression]   The DUNS number
  Genius idea. Developed by Dun &
  Bradstreet in 1962
  Create a monopoly ID system
[Digression]   The DUNS number
  Genius idea. Developed by Dun &
  Bradstreet in 1962
  Create a monopoly ID system
  Get governments around the world
  to use it instead of the company
  IDs they created themselves...
[Digression]   The DUNS number
  Genius idea. Developed by Dun &
  Bradstreet in 1962
  Create a monopoly ID system
  Get governments around the world
  to use it instead of the company
  IDs they created themselves...
  Persuade them to integrate deeply
  into their systems, & thus do the
  selling for you
[Digression]   The DUNS number
  Genius idea. Developed by Dun &
  Bradstreet in 1962
  Create a monopoly ID system
  Get governments around the world
  to use it instead of the company
  IDs they created themselves...
  Persuade them to integrate deeply
  into their systems, & thus do the
  selling for you
[Digression]   The DUNS number
  Genius idea. Developed by Dun &
  Bradstreet in 1962
  Create a monopoly ID system
  Get governments around the world
  to use it instead of the company
  IDs they created themselves...
  Persuade them to integrate deeply
  into their systems, & thus do the
  selling for you
  Assert your IP so that they can’t
  use it freely (as in free speech)
We’ve got data too
We’ve got data too
All openly licensed
All openly licensed
4 core uses for journalists
The simple search
The simple search

Not to be underestimated
The simple search

Not to be underestimated
The simple search

Not to be underestimated
Massively reduces friction
(how long will it take you
to find and search
multiple jurisdictions)
The simple search

Not to be underestimated
Massively reduces friction
(how long will it take you
to find and search
multiple jurisdictions)
The simple search

Not to be underestimated
Massively reduces friction
(how long will it take you
to find and search
multiple jurisdictions)
Allows what if questions
The simple search

Not to be underestimated
Massively reduces friction
(how long will it take you
to find and search
multiple jurisdictions)
Allows what if questions
Potentially generates
stories in its own right
The simple search

Not to be underestimated
Massively reduces friction
(how long will it take you
to find and search
multiple jurisdictions)
Allows what if questions
Potentially generates
stories in its own right
Source for additional info
Source for additional info

 Addresses, filings,
 status, websites...
Source for additional info

 Addresses, filings,
 status, websites...
Source for additional info

 Addresses, filings,
 status, websites...
 Intl trademarks, UK
 govt spending,
 official notices,
 health & safety...
Source for additional info

 Addresses, filings,
 status, websites...
 Intl trademarks, UK
 govt spending,
 official notices,
 health & safety...
Source for additional info

 Addresses, filings,
 status, websites...
 Intl trademarks, UK
 govt spending,
 official notices,
 health & safety...
 Other IDs: SEC,
 CAGE, charity....
Source for additional info

 Addresses, filings,
 status, websites...
 Intl trademarks, UK
 govt spending,
 official notices,
 health & safety...
 Other IDs: SEC,
 CAGE, charity....
 Coming soon:
 lobbying registers
Reconciliation
(matching names to legal entities)


Cleans up
messy
company
names (&
previous
names) to
legal entity,
and from there
to other data
Reconciliation
(matching names to legal entities)




We provide
Google
Refine
reconciliation
service
(specific to
jurisdiction)
Reconciliation
(matching names to legal

Used by
Open
Spending &
discussing
with govts
to clean up
data at
source
Reconciliation
(matching names to legal entities)




And can
even be used
to find out
useful
information
on its own
The database/platform




API: allows all
information to be
retrieved as data,
even searches
The database/platform



User-
contributed
data: Users
can now add
websites,
telephone
numbers,
addresses
The database/platform


Corporate
Groupings – a
user-curated
way of grouping
companies
together,
mapped to the
Wikipedia article
about them
The database/platform



Coming
soon: giving
users the
option to
match data
to
companies
One last thing...	




We’ve just
started
importing and
indexing
company
officers
New feature: officers




You can now
search by
officer name
New feature: officers


Early stage:
we’re still
fetching the
info (and can
only get for
jurisdictions
that publish
it), but even
that’s useful
New feature: officers


Early stage:
we’re still
fetching the
info (and can
only get for
jurisdictions
that publish
it), but even
that’s useful
New feature: officers


Early stage:
we’re still
fetching the
info (and can
only get for
jurisdictions
that publish
it), but even
that’s useful
New feature: officers


Early stage:
we’re still
fetching the
info (and can
only get for
jurisdictions
that publish
it), but even   similarly named
that’s useful
New feature: officers


Early stage:
we’re still
fetching the
info (and can
only get for         other resources
jurisdictions
that publish
it), but even   similarly named
that’s useful
Still...
 Though it’s by far the biggest and best open database
 of companies is the world, there’s a lot more to do
 Lots of data we haven’t matched. Quite a few US
 jurisdictions we haven’t added, and some where the
 information is fairly laggy
 We’re starting to get official recognition (EU, G20, etc),
 but some company registers see as threat to their
 ‘business model’
 Provenance is given for everything, so easy to identify
 source of ‘errors’
Information is the currency
of democracy
                    Thomas Jefferson
ATA is the currency
Information
    D
of democracy
                  Thomas Jefferson

More Related Content

Viewers also liked

Cool Tools for Recruiting HRevolution 2011
Cool Tools for Recruiting HRevolution 2011Cool Tools for Recruiting HRevolution 2011
Cool Tools for Recruiting HRevolution 2011Craig Fisher
 
Social Media Tools for Business Buyer's Guide
Social Media Tools for Business Buyer's GuideSocial Media Tools for Business Buyer's Guide
Social Media Tools for Business Buyer's GuideNeal Schaffer
 
Data Business Model 2017-2019
Data Business Model 2017-2019Data Business Model 2017-2019
Data Business Model 2017-2019Luciano Gregoris
 
Designing and developing Business Process dimensional Model or Data Warehouse
Designing and developing  Business Process dimensional Model  or Data WarehouseDesigning and developing  Business Process dimensional Model  or Data Warehouse
Designing and developing Business Process dimensional Model or Data WarehouseSlava Kokaev
 
From big data overload to business impact
From big data overload to business impactFrom big data overload to business impact
From big data overload to business impactMiguel Garcia
 
Career Highlights r
Career Highlights rCareer Highlights r
Career Highlights rBob Richie
 
SAP EM data model
SAP EM data modelSAP EM data model
SAP EM data modelQ Data USA
 
Data is not a business model moving knowledge to action presentation
Data is not a business model  moving knowledge to action presentationData is not a business model  moving knowledge to action presentation
Data is not a business model moving knowledge to action presentationJennifer van der Meer
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesBen Siscovick
 
Trends in Big Data & Business Challenges
Trends in Big Data & Business Challenges   Trends in Big Data & Business Challenges
Trends in Big Data & Business Challenges Experian_US
 
Traditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewTraditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewNagaraj Yerram
 
SharePoint BCS, OK. But what is the SharePoint Business Data List Connector (...
SharePoint BCS, OK. But what is the SharePoint Business Data List Connector (...SharePoint BCS, OK. But what is the SharePoint Business Data List Connector (...
SharePoint BCS, OK. But what is the SharePoint Business Data List Connector (...Layer2
 
Ultimate Real-Time — Monitor Anything, Update Anything
Ultimate Real-Time — Monitor Anything, Update AnythingUltimate Real-Time — Monitor Anything, Update Anything
Ultimate Real-Time — Monitor Anything, Update AnythingSafe Software
 

Viewers also liked (17)

Cool Tools for Recruiting HRevolution 2011
Cool Tools for Recruiting HRevolution 2011Cool Tools for Recruiting HRevolution 2011
Cool Tools for Recruiting HRevolution 2011
 
Social Media Tools for Business Buyer's Guide
Social Media Tools for Business Buyer's GuideSocial Media Tools for Business Buyer's Guide
Social Media Tools for Business Buyer's Guide
 
Data Business Model 2017-2019
Data Business Model 2017-2019Data Business Model 2017-2019
Data Business Model 2017-2019
 
Designing and developing Business Process dimensional Model or Data Warehouse
Designing and developing  Business Process dimensional Model  or Data WarehouseDesigning and developing  Business Process dimensional Model  or Data Warehouse
Designing and developing Business Process dimensional Model or Data Warehouse
 
From big data overload to business impact
From big data overload to business impactFrom big data overload to business impact
From big data overload to business impact
 
Career Highlights r
Career Highlights rCareer Highlights r
Career Highlights r
 
BI Presentation
BI PresentationBI Presentation
BI Presentation
 
SAP EM data model
SAP EM data modelSAP EM data model
SAP EM data model
 
Data is not a business model moving knowledge to action presentation
Data is not a business model  moving knowledge to action presentationData is not a business model  moving knowledge to action presentation
Data is not a business model moving knowledge to action presentation
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA Ventures
 
Trends in Big Data & Business Challenges
Trends in Big Data & Business Challenges   Trends in Big Data & Business Challenges
Trends in Big Data & Business Challenges
 
Ways to write Social Media Updates?
Ways to write Social Media Updates?Ways to write Social Media Updates?
Ways to write Social Media Updates?
 
Regional data journalism
Regional data journalismRegional data journalism
Regional data journalism
 
Traditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overviewTraditional Data-warehousing / BI overview
Traditional Data-warehousing / BI overview
 
SharePoint BCS, OK. But what is the SharePoint Business Data List Connector (...
SharePoint BCS, OK. But what is the SharePoint Business Data List Connector (...SharePoint BCS, OK. But what is the SharePoint Business Data List Connector (...
SharePoint BCS, OK. But what is the SharePoint Business Data List Connector (...
 
R at Microsoft
R at MicrosoftR at Microsoft
R at Microsoft
 
Ultimate Real-Time — Monitor Anything, Update Anything
Ultimate Real-Time — Monitor Anything, Update AnythingUltimate Real-Time — Monitor Anything, Update Anything
Ultimate Real-Time — Monitor Anything, Update Anything
 

Similar to Data for Business Journalism, NICAR 2012

The Closed World Of Company Data
The Closed World Of Company DataThe Closed World Of Company Data
The Closed World Of Company DataChris Taggart
 
Secrets Revealed: Get Found Online Faster
Secrets Revealed: Get Found Online FasterSecrets Revealed: Get Found Online Faster
Secrets Revealed: Get Found Online FasterDeluxe Corporation
 
Information Innovation: Turning Insights into Opportunities
Information Innovation: Turning Insights into OpportunitiesInformation Innovation: Turning Insights into Opportunities
Information Innovation: Turning Insights into OpportunitiesHubbard One
 
Legal And Ethical Issues Of Copyright
Legal And Ethical Issues Of CopyrightLegal And Ethical Issues Of Copyright
Legal And Ethical Issues Of CopyrightErin Ross
 
Houston / Galveston PIO Nework Social Media Training (ppt)
Houston / Galveston PIO Nework Social Media Training (ppt)Houston / Galveston PIO Nework Social Media Training (ppt)
Houston / Galveston PIO Nework Social Media Training (ppt)Nate Ritter
 
Letter Request For A Business Intelligence And Big Data...
Letter Request For A Business Intelligence And Big Data...Letter Request For A Business Intelligence And Big Data...
Letter Request For A Business Intelligence And Big Data...Kelly Gomez
 
BUSI 3460U - Fall 2010 [complete]
BUSI 3460U - Fall 2010 [complete]BUSI 3460U - Fall 2010 [complete]
BUSI 3460U - Fall 2010 [complete]kstymest
 
From Attention to Trust:
 Data-driven journalism and the urban future
From Attention to Trust:
 Data-driven journalism and the urban futureFrom Attention to Trust:
 Data-driven journalism and the urban future
From Attention to Trust:
 Data-driven journalism and the urban futureMirko Lorenz
 
Houston / Galveston PIO Network Social Media Training
Houston / Galveston PIO Network Social Media TrainingHouston / Galveston PIO Network Social Media Training
Houston / Galveston PIO Network Social Media TrainingNate Ritter
 
International Business Information Resources 2007
International Business Information Resources 2007International Business Information Resources 2007
International Business Information Resources 2007Jane Macoustra
 
Essay On Janmashtami In Sanskrit. Online assignment writing service.
Essay On Janmashtami In Sanskrit. Online assignment writing service.Essay On Janmashtami In Sanskrit. Online assignment writing service.
Essay On Janmashtami In Sanskrit. Online assignment writing service.Amy Cruz
 
How Enterprises Can Gain Data Privacy, and Build their Bottom Lines, By Compl...
How Enterprises Can Gain Data Privacy, and Build their Bottom Lines, By Compl...How Enterprises Can Gain Data Privacy, and Build their Bottom Lines, By Compl...
How Enterprises Can Gain Data Privacy, and Build their Bottom Lines, By Compl...Dana Gardner
 
Diving into UK corporation ownership with Neo4j
Diving into UK corporation ownership with Neo4j Diving into UK corporation ownership with Neo4j
Diving into UK corporation ownership with Neo4j Adam Hill
 
Understanding corporate networks the open data way
Understanding corporate networks the open data wayUnderstanding corporate networks the open data way
Understanding corporate networks the open data wayChris Taggart
 
Data governance-for-dummies
Data governance-for-dummiesData governance-for-dummies
Data governance-for-dummiesManfred Gramlich
 
Company Profile Template & Sources
Company Profile Template & SourcesCompany Profile Template & Sources
Company Profile Template & SourcesJennifer Wegman
 
Using (competitive) intelligence to build your legal business
Using (competitive) intelligence to build your legal businessUsing (competitive) intelligence to build your legal business
Using (competitive) intelligence to build your legal businessAllen Matkins
 
The Ins, Outs, and Nuances of Internet Privacy
The Ins, Outs, and Nuances of Internet PrivacyThe Ins, Outs, and Nuances of Internet Privacy
The Ins, Outs, and Nuances of Internet PrivacyeBoost Consulting
 

Similar to Data for Business Journalism, NICAR 2012 (20)

The Closed World Of Company Data
The Closed World Of Company DataThe Closed World Of Company Data
The Closed World Of Company Data
 
Cracking Private Companies by Jodi Schneider
Cracking Private Companies by Jodi SchneiderCracking Private Companies by Jodi Schneider
Cracking Private Companies by Jodi Schneider
 
Secrets Revealed: Get Found Online Faster
Secrets Revealed: Get Found Online FasterSecrets Revealed: Get Found Online Faster
Secrets Revealed: Get Found Online Faster
 
Information Innovation: Turning Insights into Opportunities
Information Innovation: Turning Insights into OpportunitiesInformation Innovation: Turning Insights into Opportunities
Information Innovation: Turning Insights into Opportunities
 
Business at Your Library
Business at Your LibraryBusiness at Your Library
Business at Your Library
 
Legal And Ethical Issues Of Copyright
Legal And Ethical Issues Of CopyrightLegal And Ethical Issues Of Copyright
Legal And Ethical Issues Of Copyright
 
Houston / Galveston PIO Nework Social Media Training (ppt)
Houston / Galveston PIO Nework Social Media Training (ppt)Houston / Galveston PIO Nework Social Media Training (ppt)
Houston / Galveston PIO Nework Social Media Training (ppt)
 
Letter Request For A Business Intelligence And Big Data...
Letter Request For A Business Intelligence And Big Data...Letter Request For A Business Intelligence And Big Data...
Letter Request For A Business Intelligence And Big Data...
 
BUSI 3460U - Fall 2010 [complete]
BUSI 3460U - Fall 2010 [complete]BUSI 3460U - Fall 2010 [complete]
BUSI 3460U - Fall 2010 [complete]
 
From Attention to Trust:
 Data-driven journalism and the urban future
From Attention to Trust:
 Data-driven journalism and the urban futureFrom Attention to Trust:
 Data-driven journalism and the urban future
From Attention to Trust:
 Data-driven journalism and the urban future
 
Houston / Galveston PIO Network Social Media Training
Houston / Galveston PIO Network Social Media TrainingHouston / Galveston PIO Network Social Media Training
Houston / Galveston PIO Network Social Media Training
 
International Business Information Resources 2007
International Business Information Resources 2007International Business Information Resources 2007
International Business Information Resources 2007
 
Essay On Janmashtami In Sanskrit. Online assignment writing service.
Essay On Janmashtami In Sanskrit. Online assignment writing service.Essay On Janmashtami In Sanskrit. Online assignment writing service.
Essay On Janmashtami In Sanskrit. Online assignment writing service.
 
How Enterprises Can Gain Data Privacy, and Build their Bottom Lines, By Compl...
How Enterprises Can Gain Data Privacy, and Build their Bottom Lines, By Compl...How Enterprises Can Gain Data Privacy, and Build their Bottom Lines, By Compl...
How Enterprises Can Gain Data Privacy, and Build their Bottom Lines, By Compl...
 
Diving into UK corporation ownership with Neo4j
Diving into UK corporation ownership with Neo4j Diving into UK corporation ownership with Neo4j
Diving into UK corporation ownership with Neo4j
 
Understanding corporate networks the open data way
Understanding corporate networks the open data wayUnderstanding corporate networks the open data way
Understanding corporate networks the open data way
 
Data governance-for-dummies
Data governance-for-dummiesData governance-for-dummies
Data governance-for-dummies
 
Company Profile Template & Sources
Company Profile Template & SourcesCompany Profile Template & Sources
Company Profile Template & Sources
 
Using (competitive) intelligence to build your legal business
Using (competitive) intelligence to build your legal businessUsing (competitive) intelligence to build your legal business
Using (competitive) intelligence to build your legal business
 
The Ins, Outs, and Nuances of Internet Privacy
The Ins, Outs, and Nuances of Internet PrivacyThe Ins, Outs, and Nuances of Internet Privacy
The Ins, Outs, and Nuances of Internet Privacy
 

More from Chris Taggart

Corruption, corporate transparency and open data
Corruption, corporate transparency and open dataCorruption, corporate transparency and open data
Corruption, corporate transparency and open dataChris Taggart
 
Open Data For Journalists : How it works, why it matters
Open Data For Journalists : How it works, why it mattersOpen Data For Journalists : How it works, why it matters
Open Data For Journalists : How it works, why it mattersChris Taggart
 
How The Open Data Community Died - A Warning From The Future
How The Open Data Community Died - A Warning From The FutureHow The Open Data Community Died - A Warning From The Future
How The Open Data Community Died - A Warning From The FutureChris Taggart
 
Open Global Data: A Threat Or Saviour For Democracy
Open Global Data: A Threat Or Saviour For DemocracyOpen Global Data: A Threat Or Saviour For Democracy
Open Global Data: A Threat Or Saviour For DemocracyChris Taggart
 
Isle of Man open data overview
Isle of Man open data overviewIsle of Man open data overview
Isle of Man open data overviewChris Taggart
 
OpenlyLocal & Open Local Data in the UK
OpenlyLocal & Open Local Data in the UKOpenlyLocal & Open Local Data in the UK
OpenlyLocal & Open Local Data in the UKChris Taggart
 
The good (and bad) news about open data
The good (and bad) news about open dataThe good (and bad) news about open data
The good (and bad) news about open dataChris Taggart
 
Can Open Data Save The Public Realm
Can Open Data Save The Public RealmCan Open Data Save The Public Realm
Can Open Data Save The Public RealmChris Taggart
 
Open local data: challenges and opportunities
Open local data: challenges and opportunitiesOpen local data: challenges and opportunities
Open local data: challenges and opportunitiesChris Taggart
 
News rewired presentation
News rewired presentationNews rewired presentation
News rewired presentationChris Taggart
 
Open Data & The Rewards of Failure
Open Data & The Rewards of FailureOpen Data & The Rewards of Failure
Open Data & The Rewards of FailureChris Taggart
 
Open local data presentation for okcon
Open local data presentation for okconOpen local data presentation for okcon
Open local data presentation for okconChris Taggart
 
Open Local Data Presentation
Open Local Data PresentationOpen Local Data Presentation
Open Local Data PresentationChris Taggart
 
Opening up local government data: APPSI Presentation
Opening up local government data: APPSI PresentationOpening up local government data: APPSI Presentation
Opening up local government data: APPSI PresentationChris Taggart
 

More from Chris Taggart (14)

Corruption, corporate transparency and open data
Corruption, corporate transparency and open dataCorruption, corporate transparency and open data
Corruption, corporate transparency and open data
 
Open Data For Journalists : How it works, why it matters
Open Data For Journalists : How it works, why it mattersOpen Data For Journalists : How it works, why it matters
Open Data For Journalists : How it works, why it matters
 
How The Open Data Community Died - A Warning From The Future
How The Open Data Community Died - A Warning From The FutureHow The Open Data Community Died - A Warning From The Future
How The Open Data Community Died - A Warning From The Future
 
Open Global Data: A Threat Or Saviour For Democracy
Open Global Data: A Threat Or Saviour For DemocracyOpen Global Data: A Threat Or Saviour For Democracy
Open Global Data: A Threat Or Saviour For Democracy
 
Isle of Man open data overview
Isle of Man open data overviewIsle of Man open data overview
Isle of Man open data overview
 
OpenlyLocal & Open Local Data in the UK
OpenlyLocal & Open Local Data in the UKOpenlyLocal & Open Local Data in the UK
OpenlyLocal & Open Local Data in the UK
 
The good (and bad) news about open data
The good (and bad) news about open dataThe good (and bad) news about open data
The good (and bad) news about open data
 
Can Open Data Save The Public Realm
Can Open Data Save The Public RealmCan Open Data Save The Public Realm
Can Open Data Save The Public Realm
 
Open local data: challenges and opportunities
Open local data: challenges and opportunitiesOpen local data: challenges and opportunities
Open local data: challenges and opportunities
 
News rewired presentation
News rewired presentationNews rewired presentation
News rewired presentation
 
Open Data & The Rewards of Failure
Open Data & The Rewards of FailureOpen Data & The Rewards of Failure
Open Data & The Rewards of Failure
 
Open local data presentation for okcon
Open local data presentation for okconOpen local data presentation for okcon
Open local data presentation for okcon
 
Open Local Data Presentation
Open Local Data PresentationOpen Local Data Presentation
Open Local Data Presentation
 
Opening up local government data: APPSI Presentation
Opening up local government data: APPSI PresentationOpening up local government data: APPSI Presentation
Opening up local government data: APPSI Presentation
 

Recently uploaded

Light Rail in Canberra: Too much, too little, too late: Is the price worth th...
Light Rail in Canberra: Too much, too little, too late: Is the price worth th...Light Rail in Canberra: Too much, too little, too late: Is the price worth th...
Light Rail in Canberra: Too much, too little, too late: Is the price worth th...University of Canberra
 
Green Aesthetic Ripped Paper Thesis Defense Presentation_20240311_111012_0000...
Green Aesthetic Ripped Paper Thesis Defense Presentation_20240311_111012_0000...Green Aesthetic Ripped Paper Thesis Defense Presentation_20240311_111012_0000...
Green Aesthetic Ripped Paper Thesis Defense Presentation_20240311_111012_0000...virgfern3011
 
Por estos dos motivos, defensa de JOH solicita repetir juicio
Por estos dos motivos, defensa de JOH solicita repetir juicioPor estos dos motivos, defensa de JOH solicita repetir juicio
Por estos dos motivos, defensa de JOH solicita repetir juicioAlexisTorres963861
 
Ministry of Justice Extradition Eswatini 3.pdf
Ministry of Justice Extradition Eswatini 3.pdfMinistry of Justice Extradition Eswatini 3.pdf
Ministry of Justice Extradition Eswatini 3.pdfSABC News
 
Européennes 2024 : projection du Parlement européen à trois mois du scrutin
Européennes 2024 : projection du Parlement européen à trois mois du scrutinEuropéennes 2024 : projection du Parlement européen à trois mois du scrutin
Européennes 2024 : projection du Parlement européen à trois mois du scrutinIpsos France
 
Another Day, Another Default Judgment Against Gabe Whitley
Another Day, Another Default Judgment Against Gabe WhitleyAnother Day, Another Default Judgment Against Gabe Whitley
Another Day, Another Default Judgment Against Gabe WhitleyAbdul-Hakim Shabazz
 
19032024_First India Newspaper Jaipur.pdf
19032024_First India Newspaper Jaipur.pdf19032024_First India Newspaper Jaipur.pdf
19032024_First India Newspaper Jaipur.pdfFIRST INDIA
 
Anantkumar Hegde
Anantkumar Hegde  Anantkumar Hegde
Anantkumar Hegde NewsFeed1
 
One India vs United India by Dream Tamilnadu
One India vs United India by Dream TamilnaduOne India vs United India by Dream Tamilnadu
One India vs United India by Dream TamilnaduDreamTamilnadu
 

Recently uploaded (9)

Light Rail in Canberra: Too much, too little, too late: Is the price worth th...
Light Rail in Canberra: Too much, too little, too late: Is the price worth th...Light Rail in Canberra: Too much, too little, too late: Is the price worth th...
Light Rail in Canberra: Too much, too little, too late: Is the price worth th...
 
Green Aesthetic Ripped Paper Thesis Defense Presentation_20240311_111012_0000...
Green Aesthetic Ripped Paper Thesis Defense Presentation_20240311_111012_0000...Green Aesthetic Ripped Paper Thesis Defense Presentation_20240311_111012_0000...
Green Aesthetic Ripped Paper Thesis Defense Presentation_20240311_111012_0000...
 
Por estos dos motivos, defensa de JOH solicita repetir juicio
Por estos dos motivos, defensa de JOH solicita repetir juicioPor estos dos motivos, defensa de JOH solicita repetir juicio
Por estos dos motivos, defensa de JOH solicita repetir juicio
 
Ministry of Justice Extradition Eswatini 3.pdf
Ministry of Justice Extradition Eswatini 3.pdfMinistry of Justice Extradition Eswatini 3.pdf
Ministry of Justice Extradition Eswatini 3.pdf
 
Européennes 2024 : projection du Parlement européen à trois mois du scrutin
Européennes 2024 : projection du Parlement européen à trois mois du scrutinEuropéennes 2024 : projection du Parlement européen à trois mois du scrutin
Européennes 2024 : projection du Parlement européen à trois mois du scrutin
 
Another Day, Another Default Judgment Against Gabe Whitley
Another Day, Another Default Judgment Against Gabe WhitleyAnother Day, Another Default Judgment Against Gabe Whitley
Another Day, Another Default Judgment Against Gabe Whitley
 
19032024_First India Newspaper Jaipur.pdf
19032024_First India Newspaper Jaipur.pdf19032024_First India Newspaper Jaipur.pdf
19032024_First India Newspaper Jaipur.pdf
 
Anantkumar Hegde
Anantkumar Hegde  Anantkumar Hegde
Anantkumar Hegde
 
One India vs United India by Dream Tamilnadu
One India vs United India by Dream TamilnaduOne India vs United India by Dream Tamilnadu
One India vs United India by Dream Tamilnadu
 

Data for Business Journalism, NICAR 2012

  • 1. In search of global corporate data featuring OpenCorporates Chris Taggart, OpenCorporates, NICAR, Feb 2012
  • 2. Corporate data for journalists is a solved problem. Right?
  • 3. Corporate data for journalists is a solved problem. Right? Hoovers, Lexis-Nexis
  • 4. Corporate data for journalists is a solved problem. Right? Hoovers, Lexis-Nexis
  • 5. Corporate data for journalists is a solved problem. Right? Hoovers, Lexis-Nexis Jigsaw/Salesforce
  • 6. Corporate data for journalists is a solved problem. Right? Hoovers, Lexis-Nexis Jigsaw/Salesforce
  • 7. Corporate data for journalists is a solved problem. Right? Hoovers, Lexis-Nexis Jigsaw/Salesforce Kompass, Mint/Orbis, etc
  • 8. Corporate data for journalists is a solved problem. Right? Hoovers, Lexis-Nexis Jigsaw/Salesforce Kompass, Mint/Orbis, etc Linked In
  • 9. Corporate data for journalists is a solved problem. Right? Hoovers, Lexis-Nexis Jigsaw/Salesforce Kompass, Mint/Orbis, etc Linked In
  • 10. Corporate data for journalists is a solved problem. Right? Hoovers, Lexis-Nexis Jigsaw/Salesforce Kompass, Mint/Orbis, etc Linked In Google/Yahoo Finance etc
  • 11. Corporate data for journalists is a solved problem. Right? Hoovers, Lexis-Nexis Jigsaw/Salesforce Kompass, Mint/Orbis, etc Linked In Google/Yahoo Finance etc
  • 12. Corporate data for journalists is a solved problem. Right? Hoovers, Lexis-Nexis Jigsaw/Salesforce Kompass, Mint/Orbis, etc Linked In Google/Yahoo Finance etc Annual reports
  • 13. Corporate data for journalists is a solved problem. Right? Hoovers, Lexis-Nexis Jigsaw/Salesforce Kompass, Mint/Orbis, etc Linked In Google/Yahoo Finance etc Annual reports
  • 14. That’s fine for big corporations
  • 16. But... Typically only good coverage of largest companies
  • 17. But... Typically only good coverage of largest companies Most are just aggregators of same standard sources, and rarely connect to or show the original data
  • 18. But... Typically only good coverage of largest companies Most are just aggregators of same standard sources, and rarely connect to or show the original data Information gets poorer once outside the US/UK
  • 19. But... Typically only good coverage of largest companies Most are just aggregators of same standard sources, and rarely connect to or show the original data Information gets poorer once outside the US/UK Doesn’t cover smaller companies well
  • 20. But... Typically only good coverage of largest companies Most are just aggregators of same standard sources, and rarely connect to or show the original data Information gets poorer once outside the US/UK Doesn’t cover smaller companies well Rarely gives access to data
  • 21. But... Typically only good coverage of largest companies Most are just aggregators of same standard sources, and rarely connect to or show the original data Information gets poorer once outside the US/UK Doesn’t cover smaller companies well Rarely gives access to data Very proprietary... and no provenance
  • 22. Why is this important?
  • 24. nor like this http://www.flickr.com/photos/ahxcjb/518357242
  • 25. it’s far more like this
  • 26. or even like this
  • 27. or even like this
  • 28. So, a bit of a mess, but investigation is still possible
  • 29. So, a bit of a mess, but investigation is still possible FOR KNOWN STORIES
  • 30. So, you’re reliant upon http://fr.fotopedia.com/items/flickr-3346906435
  • 31. So, you’re reliant upon http://fr.fotopedia.com/items/flickr-3346906435
  • 33. But this is about data journalism, right?
  • 34. And there’s a lot of data out there
  • 35. And there’s a lot of data out there
  • 36. Most isn’t linked to the legal entity, making it difficult to use
  • 37. Most isn’t linked to the legal entity, making it difficult to use
  • 38. But it does include a wealth of other information...
  • 39. If only we could tie it all together...
  • 40. And legal entity matters
  • 41. And legal entity matters It’s the thing that ends up in court
  • 42. And legal entity matters It’s the thing that ends up in court It’s the way that provides firewalls for associated people, companies, organisations – information, regulation, tax
  • 43. And legal entity matters It’s the thing that ends up in court It’s the way that provides firewalls for associated people, companies, organisations – information, regulation, tax It allows a corporate entity to take advantage of different rules in different jursidictions – regulatory arbitrage
  • 44. If you don’t think this affects your life, you’ve slept through the past few years http://www.flickr.com/photos/aaronjacobs/64368770
  • 46. A simple (but huge) goal: an entry for every corporate legal entity in the world Based on the company number and jurisdiction (no monopoly id)
  • 47. A simple (but huge) goal: an entry for every corporate legal entity in the world Based on the company number and jurisdiction (no monopoly id)
  • 48. A simple (but huge) goal: an entry for every corporate legal entity in the world Based on the company number and jurisdiction (no monopoly id)
  • 49. [Digression] The DUNS number
  • 50. [Digression] The DUNS number Genius idea. Developed by Dun & Bradstreet in 1962
  • 51. [Digression] The DUNS number Genius idea. Developed by Dun & Bradstreet in 1962 Create a monopoly ID system
  • 52. [Digression] The DUNS number Genius idea. Developed by Dun & Bradstreet in 1962 Create a monopoly ID system
  • 53. [Digression] The DUNS number Genius idea. Developed by Dun & Bradstreet in 1962 Create a monopoly ID system Get governments around the world to use it instead of the company IDs they created themselves...
  • 54. [Digression] The DUNS number Genius idea. Developed by Dun & Bradstreet in 1962 Create a monopoly ID system Get governments around the world to use it instead of the company IDs they created themselves... Persuade them to integrate deeply into their systems, & thus do the selling for you
  • 55. [Digression] The DUNS number Genius idea. Developed by Dun & Bradstreet in 1962 Create a monopoly ID system Get governments around the world to use it instead of the company IDs they created themselves... Persuade them to integrate deeply into their systems, & thus do the selling for you
  • 56. [Digression] The DUNS number Genius idea. Developed by Dun & Bradstreet in 1962 Create a monopoly ID system Get governments around the world to use it instead of the company IDs they created themselves... Persuade them to integrate deeply into their systems, & thus do the selling for you Assert your IP so that they can’t use it freely (as in free speech)
  • 61. 4 core uses for journalists
  • 63. The simple search Not to be underestimated
  • 64. The simple search Not to be underestimated
  • 65. The simple search Not to be underestimated Massively reduces friction (how long will it take you to find and search multiple jurisdictions)
  • 66. The simple search Not to be underestimated Massively reduces friction (how long will it take you to find and search multiple jurisdictions)
  • 67. The simple search Not to be underestimated Massively reduces friction (how long will it take you to find and search multiple jurisdictions) Allows what if questions
  • 68. The simple search Not to be underestimated Massively reduces friction (how long will it take you to find and search multiple jurisdictions) Allows what if questions Potentially generates stories in its own right
  • 69. The simple search Not to be underestimated Massively reduces friction (how long will it take you to find and search multiple jurisdictions) Allows what if questions Potentially generates stories in its own right
  • 71. Source for additional info Addresses, filings, status, websites...
  • 72. Source for additional info Addresses, filings, status, websites...
  • 73. Source for additional info Addresses, filings, status, websites... Intl trademarks, UK govt spending, official notices, health & safety...
  • 74. Source for additional info Addresses, filings, status, websites... Intl trademarks, UK govt spending, official notices, health & safety...
  • 75. Source for additional info Addresses, filings, status, websites... Intl trademarks, UK govt spending, official notices, health & safety... Other IDs: SEC, CAGE, charity....
  • 76. Source for additional info Addresses, filings, status, websites... Intl trademarks, UK govt spending, official notices, health & safety... Other IDs: SEC, CAGE, charity.... Coming soon: lobbying registers
  • 77. Reconciliation (matching names to legal entities) Cleans up messy company names (& previous names) to legal entity, and from there to other data
  • 78. Reconciliation (matching names to legal entities) We provide Google Refine reconciliation service (specific to jurisdiction)
  • 79. Reconciliation (matching names to legal Used by Open Spending & discussing with govts to clean up data at source
  • 80. Reconciliation (matching names to legal entities) And can even be used to find out useful information on its own
  • 81. The database/platform API: allows all information to be retrieved as data, even searches
  • 82. The database/platform User- contributed data: Users can now add websites, telephone numbers, addresses
  • 83. The database/platform Corporate Groupings – a user-curated way of grouping companies together, mapped to the Wikipedia article about them
  • 84. The database/platform Coming soon: giving users the option to match data to companies
  • 85. One last thing... We’ve just started importing and indexing company officers
  • 86. New feature: officers You can now search by officer name
  • 87. New feature: officers Early stage: we’re still fetching the info (and can only get for jurisdictions that publish it), but even that’s useful
  • 88. New feature: officers Early stage: we’re still fetching the info (and can only get for jurisdictions that publish it), but even that’s useful
  • 89. New feature: officers Early stage: we’re still fetching the info (and can only get for jurisdictions that publish it), but even that’s useful
  • 90. New feature: officers Early stage: we’re still fetching the info (and can only get for jurisdictions that publish it), but even similarly named that’s useful
  • 91. New feature: officers Early stage: we’re still fetching the info (and can only get for other resources jurisdictions that publish it), but even similarly named that’s useful
  • 92. Still... Though it’s by far the biggest and best open database of companies is the world, there’s a lot more to do Lots of data we haven’t matched. Quite a few US jurisdictions we haven’t added, and some where the information is fairly laggy We’re starting to get official recognition (EU, G20, etc), but some company registers see as threat to their ‘business model’ Provenance is given for everything, so easy to identify source of ‘errors’
  • 93. Information is the currency of democracy Thomas Jefferson
  • 94. ATA is the currency Information D of democracy Thomas Jefferson

Editor's Notes

  1. \n
  2. \n
  3. \n
  4. \n
  5. \n
  6. \n
  7. \n
  8. \n
  9. \n
  10. \n
  11. \n
  12. \n
  13. \n
  14. \n
  15. \n
  16. \n
  17. \n
  18. \n
  19. \n
  20. \n
  21. \n
  22. \n
  23. \n
  24. \n
  25. \n
  26. \n
  27. \n
  28. \n
  29. \n
  30. \n
  31. \n
  32. \n
  33. \n
  34. \n
  35. \n
  36. \n
  37. \n
  38. \n
  39. \n
  40. \n
  41. \n
  42. \n
  43. \n
  44. \n
  45. \n
  46. \n
  47. \n
  48. \n
  49. \n
  50. \n
  51. \n
  52. \n
  53. \n
  54. \n
  55. \n
  56. \n
  57. \n
  58. \n
  59. \n
  60. \n
  61. \n
  62. \n
  63. \n
  64. \n
  65. \n
  66. \n
  67. \n
  68. \n
  69. \n
  70. \n
  71. \n
  72. \n
  73. \n
  74. \n
  75. \n
  76. \n
  77. \n
  78. \n
  79. \n
  80. \n
  81. \n
  82. \n
  83. \n
  84. \n
  85. \n
  86. \n
  87. \n
  88. \n
  89. \n