Already a member?

Sign In

Conference Presentations 2012

  • IASSIST 2012-Data Science for a Connected World: Unlocking and Harnessing the Power of Information, Washington, DC
    Host Institution: National Opinion Research Center (NORC)

C3: Technological approaches to enhancing data interoperability (Thu, 2012-06-07)

  • Going Local with a World Class Data Infrastrucure: Enabling SDMX for Research Support
    Rob Grim (Tilburg University)

    [abstract]

    At Tilburg University tools are needed to support the workflows of researchers. This paper reports on the use of SDMX to build the World Taxation Indicators portal. The project aims to fill in data gaps that limit research on taxation and to enhance the visibility of taxation research methods and concepts. SDMX is used to capture and register both metadata and research data that are collected in addition to data that are publicly avaialble. An SDMX registry is used to populate a metadata repository. An SDMX repository is used to store the taxation indicators and the time series data that are collected by a macro economic research group. SDMX was chosen as the preferred technology as this standard interoperates with the existing infrastructure for statistical data exchange and can be used for cross-disciplinary research suppport. The CARDS project (Controlled Access to Research Data Storage) project was granted by the SURFfoundation and ran from January to December 2011.

    Presentation:
  • Administrative and Survey Data: DDI-based Documentation for a Combined Analysis
    Marcel Hebing (German Institute for Economic Research (DIW Berlin))
    David Schiller (The Institute for Employment Research (IAB))

    [abstract]

    In the search for data more powerful, resources are often created by combining data from different sources, e.g. administrative and survey data. Such merged data sets could only serve the scientific community, if they are high quality. Thereby data documentation is of vital importance, and no easy task. Data that accrued out of two different sources needs an adjusted, standardized and easy to understand documentation. The DDI standard can fulfil these needs. The Institute for Employment Research (IAB) and the German Institute for Economic Research (DIW Berlin) are two major data providers in Germany, the IAB for administrative data and the DIW Berlin for survey data (German Socio Economic Panel, SOEP). Within this presentation the authors will show the challenges in implementing a standardized metadata documentation, the importance of a well-suited documentation for data quality and the advantages of an agreed data documentation for comparison and combination of datasets. The focus will lie on the Data Documentation Initiative (DDI), a metadata standard for research data.

    Presentation:
  • Data without Boundaries: A DDI-Based Metadata Model for Supporting Cross-National Data Discovery
    Arofan T Gregory (Metadata Technology North America)

    [abstract]

    This presentation discusses the work of Data without Boundaries Work Package 8, exploring the requirements for a joint European-wide portal for the discovery of microdata held by statistical agencies and social science data archives across Europe. In support of this work, a survey of the various organization's metadata holdings has been explored, and work undertaken to produce a metadata model for implementation in Work Package 12. This metadata model will span both the Data Documentation Initiative for documentaing microdata and the Statistical Data and Metadata Exchange (SDMX) model for aggregate data holdings in the statistical offices. while European researchers may be familiar with the data holdings in their own national archives and statistical offices, they may not have as great a familiarity with holdings in other European countries.Aggregate data will be indexed and linked to microdata holdings to provide for improved discovery capabilities for European researchers. Similarities to the ongoing work on RDF expressions of SDMX and DDI are also explored.

    Presentation:
  • Supporting the Sharing of Longitudinal Health Data
    Veerle Van den Eynden (UK Medical Research Council)

    [abstract]

    The Data Support Service project of the UK Medical Research Council (MRC DSS) developed a Research Data Gateway to enable the deep discovery of MRC-funded population and patient studies and their datasets and variables. The Gateway enables researchers to find and explore variables across longitudinal cohort studies, to support data linkage for new research. A federated approach is used, whereby studies are responsible for storing, preserving, curating and disseminating data; publishing standardised metadata into the gateway. The system uses a Drupal content management system and Apache solr search and browse functionality, with metadata organised into modular units representing studies, time periods, collection events and variables. Users can search and discover variables across studies and export baskets of variables to request access to data. The directory holds over 45,000 variables for four case studies: Avon Longitudinal Study of Parents and Children (ALSPAC), National Survey for Health and Development (NSHD), Southampton Women's Study (SWS), Whitehall II. Variables for a further ten cohort studies are being incorporated. Development towards a DDI3.1 metadata exchange standard is ongoing, enabling metadata from diverse formats and structures to be ingested into the gateway. MRC DSS also works closely with research units towards integrated data management planning.

    Presentation:

C4: Practical approaches of record linkage in RDCs (Thu, 2012-06-07)

  • Panel: Practical approaches of record linkage in RDCs
    Stefan Bender (Research Data Centre (FDZ), the Institute for Employment Research)
    Christopher Gürke (Federal Statistical Office)

    [abstract]

    The planned session deals with the application of record linkage by several German institutions. The paper "German Record Linkage Center (GRLC)" describes the GRLC, a long-term infastructure facility with the main goal to increase the number and quality of record-linkage applications in order to increase the analytical power of existing data, and to unlock new data sources for research. Afterwards, two practical applications of record linkage are presented. The paper "The project "combined firm data for Germany" Access to combined business micro data" is about a research project carried out by different institutions which provide researchers with enterprise-level micro data. In the course of the project data of the participating instiutions has been merged. Because of the lack of a direct identifier the process of data integration has been very complex and time consuming. That is why record linkage was used. In this context, different string comparisons had to be tested and evaluated. The final paper "German census 2011 as a mixed method design" has a related background: For this years census it was - for the first time - decided not to interview every household but to use a register-based design. Thus, the final data will be a mix of register based complete census and sample survey. Next to a description of the assessment method used the presentation will introduce the way the collected datasets are merged. Concerning the methodology, this procedure is appealing because the used data have no common identifiers. Aditionally, the presentation will introduce the statistical generation of the households. Overall, the session should be of interest for all conference participants dealing with data integration and the application of record linkage.

    Presentation:
  • German Census 2011 as a Mixed Method Design
    Anja Crössmann (Federal Statistical Office)

    [abstract]

    The planned session deals with the application of record linkage by several German institutions. The paper "German Record Linkage Center (GRLC)" describes the GRLC, a long-term infastructure facility with the main goal to increase the number and quality of record-linkage applications in order to increase the analytical power of existing data, and to unlock new data sources for research. Afterwards, two practical applications of record linkage are presented. The paper "The project "combined firm data for Germany" Access to combined business micro data" is about a research project carried out by different institutions which provide researchers with enterprise-level micro data. In the course of the project data of the participating instiutions has been merged. Because of the lack of a direct identifier the process of data integration has been very complex and time consuming. That is why record linkage was used. In this context, different string comparisons had to be tested and evaluated. The final paper "German census 2011 as a mixed method design" has a related background: For this years census it was - for the first time - decided not to interview every household but to use a register-based design. Thus, the final data will be a mix of register based complete census and sample survey. Next to a description of the assessment method used the presentation will introduce the way the collected datasets are merged. Concerning the methodology, this procedure is appealing because the used data have no common identifiers. Aditionally, the presentation will introduce the statistical generation of the households. Overall, the session should be of interest for all conference participants dealing with data integration and the application of record linkage.

    Presentation:

D1: Supporting online access to geospatial, micro-, and qualitative data (Thu, 2012-06-07)

  • VizLab: A Tool for the Interactive Exploration of Geospatial Election Data on the Web
    Adam Schaal (The Center for Socio-Political Data, Sciences Po University)

    [abstract]

    Advanced desktop tools for geospatial data management and visualization are nothing new. What are new are developments in advanced online tools which help bring some of the same functionalities to the web, and in doing so make the data more accessible to the masses. Each of these solutions has its strengths and its weaknesses- depending on the goals of the application. With a lack of complete solutions (particularly non-commercial ones), custom application development is sometimes needed to meet an institution's goals. Requiring an online visualization/exploration tool for its election and demographic data holdings, the CDSP has developed an application that provides certain features key to its research community. Unlike with most existing tools, the user is able to create custom variables from those already available. Further, users can analyze multiple variables simultaneously through a combination of choropleths, proportional circles, charts, and tables. Also supported is the analysis of geospatial data across changing administrative boundaries. Additionally, the user is able to save the online application state for later analysis, as well as for sharing with colleagues. The online application was developed using entirely free and open source tools.

    Presentation:
  • Open Source Solutions for Open Microdata: The IHSN Tools
    Matthew Welch (World Bank)
    Olivier Dupriez (World Bank)

    [abstract]

    A key objective of the International Household Survey Network is to provide data management tools to producers of microdata. These cover all phases of survey implementation, from survey design to data dissemination. Products include a suite of free, open source, DDI compliant data curation tools, known as the IHSN Microdata Management Toolkit. The IHSN Microdata Management Toolkit is being used in National Statistics Offices, the main data producers in developing countries, as well as increasingly by Universities and large International Development Agencies. This presentation will discuss the latest version of our data dissemination application, the National Data Archive as well as our future roadmap for this tool. We will also introduce our Question Bank and our new citations tool and central survey catalogue. Usage examples will include those at a large International Development Agency, a University and some National Statistics Offices.

    Presentation:
  • Implementation of DDI in the National Institute of Statistics and Geography of Mexico
    Eric M Rodriguez (National Institute of Statistics and Geography)

    [abstract]

    The National Institute of Statistics and Geography (INEGI) of Mexico has an experience of more than 10 years in the elaboration of metadata. This work has been developed to satisfy different needs for the information users. In 2009 began the adoption of the DDI with the objective of integrating the metadata projects to produce a more detailed metadata and develop a system of metadata. The adoption of the standard implied the review of the different metadata, the production of a vocabulary of metadata for the INEGI, and the development of materials to facilitate the documentation. In this document we will detail the activities realized for the adoption of the DDI at INEGI, as well as the advances and perspectives for the future.

D2: Infrastructure to support research data management (Thu, 2012-06-07)

  • Managing and Sharing Data within the Collaborative Research Center SFB882: Concepts and Requirements of a VRE for the Social Sciences
    Johanna Vompras (University Library Bielefeld)
    Wolfram Horstmann (University Library Bielefeld)

    [abstract]

    The recently founded Collaborative Research Center (SFB) 882 "From Heterogeneities to Social Inequalities" at the University Bielefeld represents a framework for 17 sub-projects on social inequalities. With the promotion of a Virtual Research Environment (VRE) by the INF project (integral part of the SFB) advisory and developmental services in the domain of information infrastructure will be provided. The VRE combines both a work and a project-specific research platform. The work platform bundles IT resources by bringing together various tools for administration, project management, and time- and location-independent collaboration work in a single environment adapted to researcher's specific working processes. The research component combines data management with further developments of social science methodologies. It provides services for archiving and re-use of datasets and is responsible for the infrastructural and methodological coordination of the data documentation. Almost entire data life-cycle is documented by implementing and applying DDI3 across all projects. In this talk, we present the challenges and barriers in designing and building-up a VRE for the social sciences. We mainly focus on documentation aspects, especially on the unification of varying documentation requirements which came up. Additionally, various key issues of research collaboration, as an outcome of the user requirement analysis, will be discussed.

    Presentation:
  • IASSIST Quarterly

    Publications Special issue: A pioneer data librarian
    Welcome to the special volume of the IASSIST Quarterly (IQ (37):1-4, 2013). This special issue started as exchange of ideas between Libbie Stephenson and Margaret Adams to collect

    more...

  • Resources

    Resources

    A space for IASSIST members to share professional resources useful to them in their daily work. Also the IASSIST Jobs Repository for an archive of data-related position descriptions. more...

  • community

    • LinkedIn
    • Facebook
    • Twitter

    Find out what IASSISTers are doing in the field and explore other avenues of presentation, communication and discussion via social networking and related online social spaces. more...