2023
Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions
Kuo T, Pham A, Edelson M, Kim J, Chan J, Gupta Y, Ohno-Machado L, Anderson D, Balacha C, Bath T, Baxter S, Becker-Pennrich A, Bell D, Bernstam E, Ngan C, Day M, Doctor J, DuVall S, El-Kareh R, Florian R, Follett R, Geisler B, Ghigi A, Gottlieb A, Hinske L, Hu Z, Ir D, Jiang X, Kim K, Kim J, Knight T, Koola J, Kuo T, Lee N, Mansmann U, Matheny M, Meeker D, Mou Z, Neumann L, Nguyen N, Nick A, Ohno-Machado L, Park E, Paul P, Pletcher M, Post K, Rieder C, Scherer C, Schilling L, Soares A, SooHoo S, Soysal E, Steven C, Tep B, Toy B, Wang B, Wu Z, Xu H, Yong C, Zheng K, Zhou Y, Zucker R. Blockchain-enabled immutable, distributed, and highly available clinical research activity logging system for federated COVID-19 data analysis from multiple institutions. Journal Of The American Medical Informatics Association 2023, 30: 1167-1178. PMID: 36916740, PMCID: PMC10198529, DOI: 10.1093/jamia/ocad049.Peer-Reviewed Original ResearchConceptsFederated data analysisUser activity logsSmart contract deploymentRun-time efficiencyData analysis systemData analysis activitiesActivity logsData discoveryQuerying timeBlockchain systemBlockchain technologyNetwork transactionsCOVID-19 data analysisMultiple institutionsLow deploymentBlockchainGitHub repositoryMultiple nodesLarge networksQueriesAnalysis activitiesHigh availabilityLanguage codeBaseline solutionData analysis
2022
The evolving privacy and security concerns for genomic data analysis and sharing as observed from the iDASH competition
Kuo T, Jiang X, Tang H, Wang X, Harmanci A, Kim M, Post K, Bu D, Bath T, Kim J, Liu W, Chen H, Ohno-Machado L. The evolving privacy and security concerns for genomic data analysis and sharing as observed from the iDASH competition. Journal Of The American Medical Informatics Association 2022, 29: 2182-2190. PMID: 36164820, PMCID: PMC9667175, DOI: 10.1093/jamia/ocac165.Peer-Reviewed Original ResearchConceptsSensitive personal informationGenomic data analysisPotential future research directionsPersonal informationSecurity concernsGenomics data repositoryData repositoryReport lessonsProtection techniquesFuture research directionsPrivacyResearch directionsData usePractical challengesGenomic dataData analysisAnonymizationCommunity effortsRepositorySecurityBiomedical researchInformationDataChallenges
2021
Ultrafast homomorphic encryption models enable secure outsourcing of genotype imputation
Kim M, Harmanci A, Bossuat J, Carpov S, Cheon J, Chillotti I, Cho W, Froelicher D, Gama N, Georgieva M, Hong S, Hubaux J, Kim D, Lauter K, Ma Y, Ohno-Machado L, Sofia H, Son Y, Song Y, Troncoso-Pastoriza J, Jiang X. Ultrafast homomorphic encryption models enable secure outsourcing of genotype imputation. Cell Systems 2021, 12: 1108-1120.e4. PMID: 34464590, PMCID: PMC9898842, DOI: 10.1016/j.cels.2021.07.010.Peer-Reviewed Original ResearchConceptsHomomorphic encryption techniqueResource-intensive computationsSecure outsourcingGenomic data analysisData securityEncryption modelEncryption techniquePrivacy concernsSource codeMemory requirementsGenetic data analysisData analysisComparable accuracyFundamental stepGenotype imputationImputationDownloadSecurityOutsourcingComputationCodeServicesRequirementsAccuracyMethod
2019
Secure and Differentially Private Logistic Regression for Horizontally Distributed Data
Kim M, Lee J, Ohno-Machado L, Jiang X. Secure and Differentially Private Logistic Regression for Horizontally Distributed Data. IEEE Transactions On Information Forensics And Security 2019, 15: 695-710. DOI: 10.1109/tifs.2019.2925496.Peer-Reviewed Original ResearchPrivacy-preserving modelHomomorphic encryption techniqueDifferential privacy methodReal-world datasetsPrivacy methodsPrivate dataSensitive dataEncryption techniqueSecurity methodsDifferential privacyInformation leakageNaive solutionPrivacyNatural wayGood accuracyScientific collaborationData analysisEncouraging resultsMajor concernSecurityDatasetPotential leakageComputationScenariosPracticability
2015
VERTIcal Grid lOgistic regression (VERTIGO)
Li Y, Jiang X, Wang S, Xiong H, Ohno-Machado L. VERTIcal Grid lOgistic regression (VERTIGO). Journal Of The American Medical Informatics Association 2015, 23: 570-579. PMID: 26554428, PMCID: PMC4901373, DOI: 10.1093/jamia/ocv146.Peer-Reviewed Original ResearchConceptsFederated data analysisReal-world medical classification problemsMedical classification problemsLogistic regression algorithmAccurate global modelData setsReal data setsClassification problemExchange of informationLR problemTime complexityComputational complexityExpensive operationRegression algorithmComputational costData analysisAlgorithmDual optimizationTechnical challengesLarge amountComplexityPatient recordsLR modelNovel techniqueHessian matrix
2014
Development of a Web Service for Analysis in a Distributed Network
Jiang X, Wu Y, Marsolo K, Ohno-Machado L. Development of a Web Service for Analysis in a Distributed Network. Healthcare 2014, 2: 1053. PMID: 25848586, PMCID: PMC4371401, DOI: 10.13063/2327-9214.1053.Peer-Reviewed Original ResearchWeb servicesCross-platform deploymentSoftware development processUser interface developmentNetwork security personnelUser interface designDevelopment processProject management systemGoogle CodeApache SubversionInterface developmentUser experiencePlatform deploymentSystem testingFunctional specificationManagement systemPotential usersSecurity personnelAlgorithm validationConcept proofDevelopment approachData analysisFeasibility of buildingModel constructionUsersMAGI: a Node.js web service for fast microRNA-Seq analysis in a GPU infrastructure
Kim J, Levy E, Ferbrache A, Stepanowsky P, Farcas C, Wang S, Brunner S, Bath T, Wu Y, Ohno-Machado L. MAGI: a Node.js web service for fast microRNA-Seq analysis in a GPU infrastructure. Bioinformatics 2014, 30: 2826-2827. PMID: 24907367, PMCID: PMC4173015, DOI: 10.1093/bioinformatics/btu377.Peer-Reviewed Original ResearchConceptsWeb servicesWeb reportsLarge input filesNovel feature extractionEnd performance improvementsExploration of resultsGPU infrastructureInteractive visualizationJavaScript frameworkParallel computingGPU devicesHypertext PreprocessorCUDA CFeature extractionDrop operationInput filesPlot generationSalient featuresPerformance improvementInfrastructureNodesServicesData analysisComputingBrowser
2004
A primer on gene expression and microarrays for machine learning researchers
Kuo W, Kim E, Trimarchi J, Jenssen T, Vinterbo S, Ohno-Machado L. A primer on gene expression and microarrays for machine learning researchers. Journal Of Biomedical Informatics 2004, 37: 293-303. PMID: 15465482, DOI: 10.1016/j.jbi.2004.07.002.Peer-Reviewed Reviews, Practice Guidelines, Standards, and Consensus StatementsConceptsNew algorithmSupervised learning modelUCI machineLearning modelMicroarray data analysisAlgorithmic developmentsTypes of dataMachineData setsMain challengesGene expression dataMain motivationAlgorithmData analysisBiomedical experimentsLarge numberExpression dataMicroarray dataResearchersRepositoryWebMicroarray experimentsNew waveDataSet
2002
Visualization and evaluation of clusters for exploratory analysis of gene expression data
Kim J, Kohane I, Ohno-Machado L. Visualization and evaluation of clusters for exploratory analysis of gene expression data. Journal Of Biomedical Informatics 2002, 35: 25-36. PMID: 12415724, DOI: 10.1016/s1532-0464(02)00001-1.Peer-Reviewed Original ResearchConceptsClustering algorithmDifferent clustering algorithmsPopular clustering algorithmNew clustering algorithmComprehensive data visualizationGene expression data analysisData visualization strategiesExpression data analysisEvaluation of clustersData visualizationSoftware toolsCluster qualityCluster consistencyAlgorithmActual implementationData setsGene expression dataQuality measuresVisualizationPromising resultsFrameworkData analysisObjective evaluationUsersExpression data