- Data Sets
- Training Resources
MetaSRA: normalized metadata for the Sequence Read Archive
MetaSRA is an annotation/re-coding of sample-specific metadata in the Sequence Read Archive using biomedical ontologies. Currently, MetaSRA maps biological samples to biologically relevent terms in the Disease Ontology, Experimental Factor Ontology, Cell Ontology, Uberon, and Cellosaurus.
This pipeline was used to construct the MetaSRA database which provides normalized metadata for the Sequence Read Archive .This pipeline re-annotates key-value descriptions of biological samples using biomedical ontologies.
Structure-leveraged methods in breast cancer risk prediction
Modeling the temporal evolution of postoperative complications
CPCP Privacy Symposium 2016: Privacy Preserving Federated Biomedical Data Analysis Symposium Video
Learn about the challenges associated with the technical approaches for utilizing data from multiple sources to build more accurate machine learning algorithms from Dr. Xiaoquian Jiang. We know that having more types of data and data from distributed sources provides a stronger platform for research and discovering with machine learning. To address privacy in this context, Dr. Jiang proposes a privacy-preserving distributed data framework and describes various models implemented to solve the such problems. Dr. Jiang's research group has produced versions of this framework in R and Java as well as an online web-service. All version of this framework are available for other researchers to use for their own analysis.
CPCP Privacy Symposium 2016: Privacy is an Essentially Contested Concept Symposium Video
What does privacy mean in the context of Big Data? Dr. Deirdre Mulligan discusses various definitions of privacy in law, philosophy and computer science. Traditional approaches to privacy in data place most of the responsibility for the control of private information flow on the individual with mechanisms such as consent. This idea, known as informational actualization, has limitations that have been exposed by machine learning on big data. These limitations cause violations of privacy such as uncovering identity of individuals where it has been withheld or unexpected inferences made from data that have been intentionally disclosed. Dr. Mulligan suggests new ways of viewing privacy that evolve as social life and technology change.
CPCP Privacy Symposium 2016: Proving that Programs Do Not Discriminate Symposium Video
As the field of Artificial Intelligence (AI) continues to advance, an increasing number of prediction are made by computer programs about humans. These predictions affect decisions made about humans in a wide variety for areas including decisions about: who should get the job, the bank loan, or early release from prison. As we increasingly rely on AI programs to help make decisions about peoples lives, it becomes vitally important that we are able to ensure the programs we are depending on do not have an unfairly biased against certain groups of people. Dr. Aws Albarghouthi of the University of Wisconsin - Madison Computer Sciences department uses his expertise in programming languages to address this issue of fairness.
CPCP Privacy Symposium 2016: Panel Discussion Symposium Video
Dr. Pilar Ossorio from Morgridge Institute for Research at the University of Wisconsin-Madison, and Dr. Peggy Peissig from the Biomedical Informatics Research Foundation join Dr. Aws Albarghouthi, Dr. Deirdre Mulligan, and Dr. Xiaoquian Jiang to answer questions from the audience about privacy and fairness in the context of computational analysis.
CPCP Privacy Symposium 2016: Welcome Symposium Video
Dr. Pilar Ossorio and Dr. Mark Craven welcome attendees to the 2016 Big Privacy Symposium. At the Center for Predictive Computational Phenotyping (CPCP) we have expertise in the ethical and legal aspects of analyzing complex datasets. Experts in these fields work with our experts in computational analysis to ensure that new methods for improving human health developed at CPCP are discovered and used in a legally and ethically manner.
Adaptive signal recovery on graphs via harmonic analysis for experimental design in neuroimaging