2018
DataMed – an open source discovery index for finding biomedical datasets
Chen X, Gururaj A, Ozyurt B, Liu R, Soysal E, Cohen T, Tiryaki F, Li Y, Zong N, Jiang M, Rogith D, Salimi M, Kim H, Rocca-Serra P, Gonzalez-Beltran A, Farcas C, Johnson T, Margolis R, Alter G, Sansone S, Fore I, Ohno-Machado L, Grethe J, Xu H. DataMed – an open source discovery index for finding biomedical datasets. Journal Of The American Medical Informatics Association 2018, 25: 300-308. PMID: 29346583, PMCID: PMC7378878, DOI: 10.1093/jamia/ocx121.Peer-Reviewed Original ResearchIngestion pipelineBiomedical datasetsSearch enginesBiomedical domainAdvanced natural language processingRelevant datasetsUser-entered queryData discovery systemUnified metadata modelData ingestion pipelinesNatural language processingOpen-source packageRetrieval engineTerminology servicesMetadata modelMetadata informationDiscovery systemData reuseDataMedBenchmark datasetsBiomedical dataData indexAverage precisionLanguage processingSource package
2017
User needs analysis and usability assessment of DataMed – a biomedical data discovery index
Dixit R, Rogith D, Narayana V, Salimi M, Gururaj A, Ohno-Machado L, Xu H, Johnson T. User needs analysis and usability assessment of DataMed – a biomedical data discovery index. Journal Of The American Medical Informatics Association 2017, 25: 337-344. PMID: 29202203, PMCID: PMC7378884, DOI: 10.1093/jamia/ocx134.Peer-Reviewed Original ResearchData discoveryUsability evaluationInformation needsUser interfaceBiomedical dataIterative usability evaluationsInformation retrieval toolsUser interface needsHigh-quality metadataResearchers informationCommon search enginesDiscovery systemRetrieval toolsDataMedUser studyRelevance judgmentsSearch enginesUser needsDataset explorationUsability assessmentRetrieval techniquesNew retrieval techniqueIncomplete metadataMetadataUsersSearch Datasets in Literature: A Case Study of GWAS.
Dong X, Zhang Y, Xu H. Search Datasets in Literature: A Case Study of GWAS. AMIA Joint Summits On Translational Science Proceedings 2017, 2017: 40-49. PMID: 28815103, PMCID: PMC5543360.Peer-Reviewed Original ResearchRecognition systemMEDLINE abstractsDataset search enginePattern-based rulesText mining methodsData setsUnderlying data setSearch datasetsData discoverabilityUse casesSearch enginesDataset attributesMining methodsF-measureDomain dictionaryScalable approachHybrid approachDatasetFinderRetrieving literatureDiscoverabilityUltimate goalCase studySetScientific publicationsDATS, the data tag suite to enable discoverability of datasets
Sansone S, Gonzalez-Beltran A, Rocca-Serra P, Alter G, Grethe J, Xu H, Fore I, Lyle J, Gururaj A, Chen X, Kim H, Zong N, Li Y, Liu R, Ozyurt I, Ohno-Machado L. DATS, the data tag suite to enable discoverability of datasets. Scientific Data 2017, 4: 170059. PMID: 28585923, PMCID: PMC5460592, DOI: 10.1038/sdata.2017.59.Peer-Reviewed Original ResearchFinding useful data across multiple biomedical data repositories using DataMed
Ohno-Machado L, Sansone S, Alter G, Fore I, Grethe J, Xu H, Gonzalez-Beltran A, Rocca-Serra P, Gururaj A, Bell E, Soysal E, Zong N, Kim H. Finding useful data across multiple biomedical data repositories using DataMed. Nature Genetics 2017, 49: 816-819. PMID: 28546571, PMCID: PMC6460922, DOI: 10.1038/ng.3864.Peer-Reviewed Original ResearchConceptsBiomedical data repositoriesHealth big dataData setsKnowledge discoveryBig dataMultiple repositoriesSearch enginesData indexFAIR principlesDataMedData repositoryService providersKnowledge initiativesKnowledge expertsBiomedical research communityResearch communityRepositoryScience landscapeUseful dataInteroperabilityMetadataFindabilitySetEngineData