NIH 2011 Open Standards-Based Data Extraction Web Tool For Complex Longitudinal Datasets

Open Standards-Based Data Extraction Web Tool For Complex Longitudinal Datasets
Award last edited on: 10/7/11

Awarding Agency

NIH : NIA

Total Award Amount

$150,000

Award Phase

Solicitation Topic Code

-----

Principal Investigator

Dan Smith

Algenta Technologies LLC (AKA: Dragonmount Networks)

1428 Washington Avenue South Suite 203
Minneapolis, MN 55454

(608) 213-1637

jeremy@algenta.com

www.algenta.com

Location: Single
Congr. District: 05
County: Hennepin

Phase I

Contract Number: 1R43AG039898-01
Start Date: 4/1/11 Completed: 9/30/11

Phase I year

2011

Phase I Amount

$150,000

The NIH funds many long running longitudinal studies that have collected massive amounts of data. These surveys continue to add additional data collection waves to their datasets which increases the wealth of information collected. Unfortunately, as additional waves continue to be added the data becomes more complex for researchers to work with. This is especially true when the study contains many thousands of variables. Researchers are often interested in a subset of the data pertaining to their research question, but have to traverse multiple data files and many pages of documentation to find the variables associated with their topic. It also becomes challenging to replicate research, since it many times involves going through these same burdensome steps. To address this challenge, advanced tools for researchers are needed to navigate and extract data from large longitudinal studies. This project aims to create an open standards based web tool to provide data extracts from large public use longitudinal surveys. The tool will allow researchers to select variables and variable groups to create data extracts. The tool will also create codebook documentation and standardized Data Documentation Initiative (DDI) 3 metadata for the extracts, enabling citation of the extract using the DDI standard. The tool will also be generalized to work for multiple studies by using the DDI open standard for social science research, which is an innovation over today's generation of one-off tools developed on a per study basis. This Phase I feasibility study aims to analyze to data preparation and metadata creation workflow needed to prepare a study for online data extraction, to validate the use of the Data Documentation Initiative's DDI 3 standard for the basis of such a tool, and to create prototype web-based data extraction software. While the focus is on longitudinal surveys, the proposed system would also handle cross-sectional, time-series, and non-repeated studies. The aim is to improve research methodologies through a simplification of the process used for discovering, retrieving, and analyzing data relevant to a researcher's investigation.

Public Health Relevance:
Researchers who wish to use public use data from longitudinal studies or replicate other's research must currently navigate thousands of variables across multiple waves and datasets to answer simple analysis questions. The proposed web tool allows researchers to create data extracts that are directly related to their queries, allowing more time to be spent on public health research questions instead of data management.

Thesaurus Terms:
Address;Analysis, Data;Area;Benchmarking;Best Practice Analysis;Complex;Computer Programs;Computer Software;Custom;Data;Data Analyses;Data Banks;Data Bases;Data Collection;Data Files;Data Set;Databank, Electronic;Databanks;Database, Electronic;Databases;Dataset;Development;Documentation;Feasibility Studies;Funding;Generations;Goals;Ingestion;Internet;Investigation;Investigators;Language;Longitudinal Studies;Longitudinal Surveys;Maps;Measures;Metadata;Methodology, Research;Methods;Modeling;Nih;National Institutes Of Health;National Institutes Of Health (U.S.);On-Line Systems;Online Systems;Pain;Painful;Phase;Preparation;Process;Programs (Pt);Programs [publication Type];Research;Research Methodology;Research Methods;Research Personnel;Researchers;Running;Sbir;Sbirs (R43/44);Series;Services;Small Business Innovation Research;Small Business Innovation Research Grant;Social Sciences;Software;Solutions;Staging;Survey Instrument;Surveys;System;System, Loinc Axis 4;Testing;Time;United States National Institutes Of Health;Www;Work;Writing;Base;Clinical Data Repository;Clinical Data Warehouse;Computer Program /Software;Computer Program/Software;Data Management;Data Modeling;Data Repository;Improved;Innovate;Innovation;Innovative;Interest;Long-Term Study;Longitudinal Database;Model;Online Computer;Programs;Prototype;Public Health Research;Relational Database;Social Science Research;Tool;Web;Web Based;World Wide Web

Phase II

Contract Number: ----------
Start Date: 00/00/00 Completed: 00/00/00

Phase II year

----

Phase II Amount

----

SBIR-STTR Award

Open Standards-Based Data Extraction Web Tool For Complex Longitudinal Datasets
Award last edited on: 10/7/11

Sponsored Program

Awarding Agency

Total Award Amount

Award Phase

Solicitation Topic Code

Principal Investigator

Company Information

Algenta Technologies LLC (AKA: Dragonmount Networks)

Phase I

Phase I year

Phase I Amount

Phase II

Phase II year

Phase II Amount

New To Inknowvation.com?

SBIR-STTR Award

Open Standards-Based Data Extraction Web Tool For Complex Longitudinal DatasetsAward last edited on: 10/7/11

Sponsored Program

Awarding Agency

Total Award Amount

Award Phase

Solicitation Topic Code

Principal Investigator

Company Information

Algenta Technologies LLC (AKA: Dragonmount Networks)

Phase I

Phase I year

Phase I Amount

Phase II

Phase II year

Phase II Amount

Open Standards-Based Data Extraction Web Tool For Complex Longitudinal Datasets
Award last edited on: 10/7/11