ENGAGE Project Information Acronym ENGAGE Title An Infrastructure for Open, Linked Governmental Data Provision towards Research Communities and Citizens Website http://www.engage-project.eu Platform http://www.engagedata.eu
Project participants Research Infrastructures Contract no RI-283700 Project type CP-CSA Start date 01/06/2011 Duration 36 months Partners 9 Framework Programme 7 (2007-2013)
NTUA (Coordinator) GR TU-DELFT NL MIC-GR GR IBM-ISRAEL IL INTRASOFT LU STFC UK FhG-FOKUS DE AEGEAN GR EUROCRIS NL
Ministries / local public agencies websites
Publicdata.eu National Statistical Offices
Unstructured / “Semi-structured” Public data sources
ENGAGE traverses across distributed and diverse public sector information resources
ENGAGE provides a single point of access to PSI sources as well as relevant tools in order to cover the needs of researchers and citizens
0 Integration of original PSI data and derived / curated datasets created, maintained and extended by users (researchers, citizens, journalists, computer specialists) in a collaborative environment. A research / data curation community platform with focus on the SSH domain. 0 The vision of the ENGAGE infrastructure is to extract, highlight and enhance the RE‐USE value of PSI data. 0
HOW: Moving slowly from low‐structured, isolated, difficult to find PSI data to high‐structured, easy to link , easy to process datasets => Crowd‐sourcing.
Unstructured / Semi-structured / Structured Public data sources
JSON
Discovery and Context Metadata
Crowdsourcing Moving from low structured, low value datasets to highly structured and / or derived datasets
ENGAGE
Low Re-Use Value / Quality structure / metadata
High Re-Use Value / Quality structure / metadata Conversion Data Enrichment Metadata Enrichment Cleansing “Snapshots”
ENGAGE 2.0 0 On top of ENGAGE basic functions (catalog, search, visualizations, API) Researchers / Citizens / Journalists: 0 Extend other datasets (official or already extended - derived datasets) 0 0 0 0 0 0
Conversions (e.g. HTML- PDF to xls, PDF to RDF) Data Cleansing (e.g. duplicate records, empty rows, errors) Metadata Enrichment (missing metadata, Linked Data Enablers!) Data Enrichment (enrich datasets with more information) Snapshots of real-time data (e.g. Diavgeia_decisions_10_2012_to_12_2012.xls) Mash-ups / Interlinking (e.g. Combine Election results to UV radiation levels!)
0 View the version tree of official – derived datasets (clean solution - easy to understand and manage the contributions / versions)
ENGAGE 2.0 Researchers / Citizens / Journalists: 0 Data Requests 0 0 0 0
Looking for a dataset (e.g. I can’t find it elsewhere. Does it exist?) Looking for a curation / conversion / enrichment (e.g. I am looking for the election results in Greece in XLS. ) Looking for data verification (e.g. Do you think this dataset is valid?) Freedom of Information Requests
0 Integration of tools 0 0 0
Google Refine ScraperWiki Visualizations
ENGAGE 2.0 Data Providers: 0 Maintainers of Official Datasets 0 Work as a group 0 Bring the community which works on their data closer to them/ direct communication 0 See and take advantage of ENGAGE Data Curation Community work (e.g. cleansing, better formats) 0 Easy to see / gather all the Applications that are based on their official datasets. 0 See the impact of their datasets. 0 Understand which datasets have RE-USE value for users. 0 Community Help in the process of Digitalization and Opening of current or older Public Data (history dimension)
Search for a dataset...
...use your own language
Check dataset information...
...and download it
Faceted search available...
...with several filters
Extend the datasets...
...in several ways...
...and keep the provenance information
www.engagedata.eu
Open Refine
Describe the metadata...
Join the community...
..and create your groups
Rate the datasets...
..and share your thoughts
Find out about Open Data sites...
...per country
...or other criteria
Learn more about...
...ENGAGE, the ENGAGE API and Data Curation Methods
Value Proposition through individual tools Search in diverse and dispersed data sources in EU supported by ENGAGE Be able to transform your datasets keeping the valuable information with the ENGAGE external tools (Open Refine, Scrapperwiki etc.) See your results through visualisation tools Structure your data according to your needs – control all the levels of your dataset (data, metadata, format) Refine existing datasets by metadata enrichment
Value Proposition through collaboration Create your community(ies) with members of mutual interests Each community will be able to increase the value of its data sets by applying their own perspectives based on its unique needs Upload your work and share it with your community Find other data sets, valuable for your work, uploaded by your community (Collaborate / Exchange / Ask / Provide) Combine their results with yours – make new datasets
Gateways and integrated tools
ScraperWiki API
Ckan API
Open Refine
User Interface
Django Framework
HTML / Jquery
ENGAGE Core Components
Python / Django Framework
Elastic Search
Postgresql
Heroku
Storage Components
Virtuoso
PostgreSQL
Django Wiki
Translate Apache SolR
Amazon S3
CERIF