national family survey of pregancy
https://www.cdc.gov/nchs/nsfg/index.htm
todo: request to restrickted access variables.
This site is to serve as my note-book and to effectively communicate with my students and collaborators. Every now and then, a blog may be of interest to other researchers or teachers. Views in this blog are my own. All rights of research results and findings on this blog are reserved. See also http://youtube.com/c/hongqin @hongqin
national family survey of pregancy
https://www.cdc.gov/nchs/nsfg/index.htm
todo: request to restrickted access variables.
The Chan Zuckerberg Initiative (CZI) supports several projects involving Asian single-cell genomics data with age-related metadata through its CELLxGENE Discover platform and affiliated research networks. Here are key resources:
---
## **Asian Immune Diversity Atlas (AIDA)**
**Focus**: Immune cell variation across six Asian populations (Chinese, Japanese, Korean, Indian, Malay, Thai)[1][4].
**Methods**:
- Single-cell transcriptomics (5’ scRNA-seq)
- TCR/BCR profiling
- Enhancer RNA analysis
**Cohort**:
- Over 2,000 healthy individuals spanning 20 ancestrally diverse Asian groups
- Includes non-urban and underrepresented populations
- Analyzes effects of **age**, ancestry, sex, and environmental factors on immune phenotypes[4].
**Access**: Data will contribute to the Human Cell Atlas and likely integrate with CZ CELLxGENE Discover[1][4].
---
## **CZ CELLxGENE Discover Platform**
**Key Features**:
- Hosts >85 million single cells from 1,317 datasets
- Includes metadata filters for **age**, cell type, and tissue[5][6].
- Tools for analysis, visualization, and AI model training
**Relevant Data**:
- Search using filters like `organism: Homo sapiens`, `tissue: blood`, and `ethnicity: Asian`
- Age metadata available for many datasets (e.g., developmental stages, adult vs. aging)[5][6][9].
---
## **Billion Cells Project**
**Partners**: 10x Genomics, Ultima Genomics
**Scope**:
- Includes primary human cell models, potentially covering Asian cohorts
- Focuses on genetic perturbations across cell types and tissues[2][7].
---
## **How to Access**
1. Visit [CZ CELLxGENE Discover](https://chanzuckerberg.github.io/cellxgene-census/)
2. Use the Census API or PyTorch loaders for large-scale analysis[6][9].
3. Filter datasets using variables like `donor_age` and `ethnicity`.
For AIDA-specific data, monitor updates via the [Human Cell Atlas-Asia network](https://chanzuckerberg.com/science/programs-resources/cell-science/ancestry-networks/)[1][4].
Citations:
[1] https://chanzuckerberg.com/science/programs-resources/cell-science/seednetworks/asian-immune-diversity-atlas/
[2] https://www.insideprecisionmedicine.com/topics/precision-medicine/chan-zuckerberg-initiative-10x-genomics-and-ultima-genomics-launch-billion-cells-project/
[3] https://chanzuckerberg.com/science/programs-resources/cell-science/seednetworks/
[4] https://chanzuckerberg.com/science/programs-resources/cell-science/ancestry-networks/immune-cell-atlas-of-asian-populations/
[5] https://www.nature.com/articles/d41586-024-01217-y
[6] https://chanzuckerberg.github.io/cellxgene-census/
[7] https://www.prnewswire.com/news-releases/chan-zuckerberg-initiative-launches-billion-cells-project-with-10x-genomics-and-ultima-genomics-to-advance-ai-in-biology-302369647.html
[8] https://www.genomeweb.com/genetic-research/chan-zuckerberg-initiative-launches-new-york-biohub-immune-cell-based-early
[9] https://www.biorxiv.org/content/10.1101/2023.10.30.563174v1.full.pdf
[10] https://www.science.org/doi/10.1126/science.abf1970
[11] https://www.science.org/doi/10.1126/science.abf3041
---
Answer from Perplexity: pplx.ai/share
https://pmc.ncbi.nlm.nih.gov/articles/PMC9871912/
The YRBSS, conducted by the Centers for Disease Control and Prevention (CDC), monitors health-related behaviors in youth, including experiences with cyberbullying and its impact on mental health.
The HBSC study, a cross-national research study conducted in collaboration with the World Health Organization, collects data on adolescents' health and well-being, including experiences with cyberbullying and mental health outcomes.
Add Health is a nationally representative study that explores the health behaviors of adolescents and young adults in the U.S. It includes data on cyberbullying and mental health.
ESPAD collects data on substance use and other health-related behaviors among European adolescents, including questions related to cyberbullying and mental health.
The NCVS-SCS, conducted by the Bureau of Justice Statistics, collects data on school-related victimization, including experiences with cyberbullying and its impact on students' mental health.
Global Kids Online is an international research project that provides data on children's online activities, risks, and opportunities, including cyberbullying and its effects on mental health.
YISS, conducted by the Crimes Against Children Research Center, explores online experiences of youth, including cyberbullying and its psychological impact.
These datasets are valuable for researchers interested in studying the relationship between cyberbullying and mental health. To access these datasets, researchers typically need to follow specific protocols, which may include applying for access, providing a research proposal, and agreeing to terms of use.
Exploring these datasets can provide insights into the prevalence and effects of cyberbullying across different populations, helping to inform prevention and intervention strategies aimed at mitigating its impact on mental health.
https://www.nimhd.nih.gov/resources/schare/
Jackson Heart Study (JHS):
ScHARe Data Ecosystem:
1. **Social determinants of health data**:
- URL: [Social Determinants of Health Data](https://healthdata.gov/dataset/social-determinants-health)
2. **Genomic data**:
- URL: [National Center for Biotechnology Information (NCBI)](https://www.ncbi.nlm.nih.gov/)
- URL: [Ensembl Genome Browser](https://www.ensembl.org/)
3. **Data on PTSD and burnout among clinicians**:
- URL: [National Institute of Mental Health (NIMH) Data Archive](https://nda.nih.gov/)
4. **Referral networks data**:
- This data may be more specific and proprietary, typically found through healthcare providers or specific research collaborations.
5. **Psychiatric patient data**:
- URL: [National Institute of Mental Health (NIMH) Data Archive](https://nda.nih.gov/)
6. **Health records**:
- URL: [HealthData.gov](https://www.healthdata.gov/)
- URL: [Centers for Medicare & Medicaid Services (CMS)](https://www.cms.gov/Research-Statistics-Data-and-Systems/Research-Statistics-Data-and-Systems)
7. **Clinical trial data**:
- URL: [ClinicalTrials.gov](https://clinicaltrials.gov/)
8. **Population health data**:
- URL: [World Health Organization (WHO) Global Health Observatory](https://www.who.int/data/gho)
- URL: [HealthData.gov](https://www.healthdata.gov/)
9. **Electronic health records**:
- Typically proprietary, but some de-identified data sets can be found at:
- URL: [MIMIC-III Clinical Database](https://mimic.physionet.org/)
10. **Survey data**:
- URL: [CDC Behavioral Risk Factor Surveillance System (BRFSS)](https://www.cdc.gov/brfss/index.html)
- URL: [National Health Interview Survey (NHIS)](https://www.cdc.gov/nchs/nhis/index.htm)
11. **Public health datasets**:
- URL: [HealthData.gov](https://www.healthdata.gov/)
- URL: [Centers for Disease Control and Prevention (CDC) Data and Statistics](https://www.cdc.gov/datastatistics/)
12. **Machine learning datasets**:
- URL: [UCI Machine Learning Repository](https://archive.ics.uci.edu/ml/index.php)
- URL: [Kaggle Datasets](https://www.kaggle.com/datasets)
13. **Biomedical data**:
- URL: [Bioinformatics.org](https://www.bioinformatics.org/)
- URL: [NIH Database of Genotypes and Phenotypes (dbGaP)](https://www.ncbi.nlm.nih.gov/gap)
14. **Hospital records**:
- Typically proprietary, but aggregated data can be found at:
- URL: [HealthData.gov](https://www.healthdata.gov/)
- URL: [American Hospital Association (AHA) Data](https://www.aha.org/data)
15. **Mental health data**:
- URL: [National Institute of Mental Health (NIMH) Data Archive](https://nda.nih.gov/)
16. **AI-generated data**:
- This data is typically generated within research projects or specific AI applications, but some examples can be found in open repositories:
- URL: [OpenAI](https://openai.com/)
- URL: [Hugging Face Datasets](https://huggingface.co/datasets)
The National Institute of Mental Health Data Archive (NDA) makes available human subjects data collected from hundreds of research projects across many scientific domains. NDA provides infrastructure for sharing research data, tools, methods, and analyses enabling collaborative science and discovery. De-identified human subjects data, harmonized to a common standard, are available to qualified researchers. Summary data are available to all.
The NDA mission is to accelerate scientific research and discovery through data sharing, data harmonization, and the reporting of research results.
NIMH common data elements now available: Go to NIMH Common Data Elements
https://mobilize.stanford.edu/data/available-datasets/
https://simtk.org/frs/index.php?group_id=285
PUSLE PUF file
In the context of Census data, PUF stands for Public Use File1. A Public Use File (PUF) is a dataset that contains individual responses to survey questions1. These files are used to create custom tabulations and allow users to delve further into the rich detail collected in surveys like the Household Pulse Survey (HPS)1.
Each weekly HPS microdata file is released two weeks after the Household Pulse Survey Data Tables1. Each of these weekly releases includes a Public Use Data File (PUF), a replicate weight data file, and a data dictionary1.
The PUFs contain individual responses to the survey questions and can be downloaded in various formats like SAS and CSV1. They provide researchers and the public with access to detailed census data, while still protecting the confidentiality of respondents12.
Household Pulse Survey Public Use File (PUF) (census.gov)
synthetic patient data
https://synthetichealth.github.io/synthea/
https://www.mitre.org/news-insights/impact-story/mitre-created-synthea-designated-digital-public-good
electronic microscopy data bank
https://www.ebi.ac.uk/emdb/
https://data.cdc.gov/NCHS/Provisional-COVID-19-Deaths-by-Sex-and-Age/9bhg-hcku