This site is to serve as my note-book and to effectively communicate with my students and collaborators. Every now and then, a blog may be of interest to other researchers or teachers. Views in this blog are my own. All rights of research results and findings on this blog are reserved. See also http://youtube.com/c/hongqin @hongqin
Wednesday, September 30, 2020
Genome-wide analysis of SARS-CoV-2 virus strains circulating worldwide implicates heterogeneity
worldmet
Tuesday, September 29, 2020
EpiEstim cran pacakge
EpiNow2 package
Monday, September 28, 2020
possible criteria of a head.
Friday, September 25, 2020
down load ERA5 land hourly data
ERA5 land hours data was last updated on July 12.
I selected all 2020, all day, and all hours, it is 45.8 GB, which will take 12 hours to download at home.
I ssh into my simcenter workstation,
wget url,
which runs with an estimated time of a little over 2 hours.
Kalman filter versus smoothing splines
https://stats.stackexchange.com/questions/15070/kalman-filter-vs-smoothing-splines
Wednesday, September 23, 2020
beautiful soup
Beautiful Soup is a Python library for pulling data out of HTML and XML files. It works with your favorite parser to provide idiomatic ways of navigating, searching, and modifying the parse tree. It commonly saves programmers hours or days of work.
https://www.crummy.com/software/BeautifulSoup/bs4/doc/
Monday, September 21, 2020
notes, NSF AI institute meeting
Dear Panelists: Please be advised that you are in Listen Mode Only. Thank you. Link to FAQ: https://www.nsf.gov/publications/pub_summ.jsp?ods_key=nsf20123 Link to FAQ: https://www.nsf.gov/publications/pub_summ.jsp?ods_key=nsf20123
Link to FAQ: https://www.nsf.gov/publications/pub_summ.jsp?ods_key=nsf20123
From MEM AV TECH-Angel Ntumy to All panelists and other attendees: (15:47)
https://www.nsf.gov/publications/pub_summ.jsp?ods_key=nsf20123 https://www.nsf.gov/publications/pub_summ.jsp?ods_key=nsf20123
https://www.captionedtext.com/client/event.aspx?EventID=4573120&CustomerID=321
Dear attendees, more information of the Theme 5 - AI Institute in Dynamic Systems Webinar is available at https://www.nsf.gov/events/event_summ.jsp?cntn_id=301241&WT.mc_id=USNSF_13&WT.mc_ev=click
Friday, September 18, 2020
The DOD NDSEG Fellowship program
The DOD NDSEG Fellowship program is now open for applications. Please pass this opportunity along to your eligible students! https://ndseg.sysplus.com/NDSEG/About/
The fellowship lasts for 3 years and pays for full tuition and all mandatory fees; a monthly stipend ($38,400 annually); a $5,000 travel budget over the Fellow’s tenure for professional development; and up to $1,200 a year in health insurance. Applications are due November 2, 2020, and students pursuing any graduate degree in an appropriate field may apply (the field are what you might expect—math, physics, CS, pretty much any kind of engineering).
Eligibility
- The NDSEG Fellowship Program is open only to U.S. citizens or U.S. nationals. Dual citizens may apply.
- Eligible applicants are required to be enrolled in their final year of undergraduate studies through the second year of a traditional PhD program. See below for a breakout of eligibility criteria:
- Undergraduate Students (degree must be received prior to the Fellowship Start date)
- Recent Bachelor's Degree Recipients
- Graduate Students (1st year, 2nd year)
- Recent Master's Degree Recipients
- Pre-Doctoral Students (1st year, 2nd year)
- Dual MD-PHD Students
- Dual PSYD-PHD Students
- Applicants must plan to enroll in a full-time program at an accredited U.S. institution of higher education leading to graduate degrees in fields specified in the DoD services Broad Agency Announcements (BAAs) research and development discipline
Thursday, September 17, 2020
tidyverse, ggplot, FAQ
# Count the number of full duplicates
sum(duplicated(bike_share_rides))
# Remove duplicates
bike_share_rides_unique <- distinct(bike_share_rides)
# Count the full duplicates in bike_share_rides_unique
sum(duplicated(bike_share_rides_unique))
# Find duplicated ride_ids
bike_share_rides %>%
count(ride_id) %>%
filter(n > 1)
# Remove full and partial duplicates
bike_share_rides_unique <- bike_share_rides %>%
# Only based on ride_id instead of all cols
distinct(ride_id, .keep_all = TRUE)
bike_share_rides %>%
# Group by ride_id and date
group_by(ride_id, date) %>%
# Add duration_min_avg column
mutate(duration_min_avg = mean(duration_min)) %>%
# Remove duplicates based on ride_id and date, keep all cols
distinct(ride_id, date, .keep_all = TRUE) %>%
# Remove duration_min column
select(-duration_min)
# Find bad dest_size rows
sfo_survey %>%
# Join with dest_sizes data frame to get bad dest_size rows
anti_join(dest_sizes, by = "dest_size") %>%
# Select id, airline, destination, and dest_size cols
select(id, airline, destination, dest_size)
# Add new columns to sfo_survey
sfo_survey <- sfo_survey %>%
# dest_size_trimmed: dest_size without whitespace
mutate(dest_size_trimmed = str_trim(dest_size),
# cleanliness_lower: cleanliness converted to lowercase
cleanliness_lower = str_to_lower(cleanliness))
# Count values of dest_size_trimmed
sfo_survey %>%
count(dest_size_trimmed)
# Count values of cleanliness_lower
sfo_survey %>%
count(cleanliness_lower)
# Count categories of dest_region
sfo_survey %>%
count(dest_region)
# Categories to map to Europe
europe_categories <- c("EU", "eur", "Europ")
# Add a new col dest_region_collapsed
sfo_survey %>%
# Map all categories in europe_categories to Europe
mutate(dest_region_collapsed = fct_collapse(dest_region,
Europe = europe_categories)) %>%
# Count categories of dest_region_collapsed
count(dest_region_collapsed)
sfo_survey %>%
filter(str_detect(phone, "-"))
Wednesday, September 16, 2020
Outside Interest Disclosure Form, UTC
Outside Interest Disclosure Form, UTC
*** government response stringency index, COVID19
university of oxford
COVID19
https://www.bsg.ox.ac.uk/research/research-projects/coronavirus-government-response-tracker
https://github.com/OxCGRT/covid-policy-tracker/raw/master/data/timeseries/OxCGRT_timeseries_all.xlsx
https://github.com/OxCGRT/covid-policy-tracker
this is related to Twitter sentiment analysis
Tuesday, September 15, 2020
mass spec data base
MassIVE.quant: a community resource of quantitative mass spectrometry–based proteomics datasets
x
https://www.nature.com/articles/s41592-020-0955-0
Sunday, September 13, 2020
resampling white-box to examine deep learning
Deep learning for gravitational-wave data analysis: A resampling white-box approach
Manuel D. Morales, Javier M. Antelis, Claudia Moreno, Alexander I. Nesterov
x
https://arxiv.org/abs/2009.04088?fbclid=IwAR3INBZBVRi7JL0tyeoHc2yM2W0LhSRssaErdODwhfA664ht-vGS9OZGM-4
zhang genome wide SARS- cov2 protein structure modeling
https://zhanglab.ccmb.med.umich.edu/COVID-19/
Friday, September 11, 2020
mask rcnn, Okinawa group, github
https://github.com/oist/Usiigaci
Usiigaci: Instance-aware cell tracking in stain-free phase contrast microscopy enabled by machine learning
Thursday, September 10, 2020
Wednesday, September 9, 2020
Chattanooga open data
Chattanooga open data
https://www.chattadata.org/
policing and racial equipty
https://www.chattadata.org/stories/s/26bg-ejs3
Liu et al, Genome biology, 2020, Genome-wide studies reveal the essential and opposite roles of ARID1A in controlling human cardiogenesis and neurogenesis from pluripotent stem cells
in virto human embryonic stem cell study. hESC
ARID1A in H9 hESC were deleted using dual guided-RNA mediated CRISP-Cas9 method.
" Mutations in 4 different SWI/SNF subunits including ARID1A/B were identified in three congenital syndromes that include both neural and cardiac defects: Coffin-Siris syndrome (CSS), Nicolaides- Baraitser syndrome (NCBRS), and ARID1B-related intellectual disability (ID) syndrome Patients with these syndromes show severe intellectual deficits as well as cardiac defects such as atrial/ventricular septal defects, patent ductus
arteriosus (PDA), mitral and pulmonary atresia, aortic stenosis, and single right ventricle. These data indicate that abnormal ARID1A activity can lead to defective formation of both the heart and brain in humans. However, the molecular mechanisms by which ARID1A controls human cardiogenesis and neurogenesis still remain. elusive."
in hESC, Surprisingly, knockout-of-ARID1A in hESCs (ARID1A−/−) led to spontaneous neural. differentiation even under pluripotent stem cell culture conditions. Additionally, under conditions of targeted cardiac differentiation, ARID1A−/− hESCs gave rise to robustly increased numbers of neural cells, including neural stem cells and neurons, whereas cardiac differentiation was significantly suppressed
single-cell RNA reveals spontaneous differentiation neural differentiation in ARIA-/- cells.
So, scRNA data are available for WT and KO in cardiac differential and neural differentiation. Based on my understanding of Liu, GB, 2020, there are WT and ARID1A-/- hESC cells, and the hESC cells are induced for cardiac differentation in CDM3 and neural differentation in N2B27 medium. The scRNA results show ARIDA-/- lead to different clusters of ScRNA in WT and KO in both differentation conditions. So, it seems to me that these data sets can be used to contruct weight single-cell gene network to study network control.
The noise levels seem to be good testing data sets on noises and weight in controllability analysis.
https://genomebiology.biomedcentral.com/articles/10.1186/s13059-020-02082-4
Tuesday, September 8, 2020
Shor's factorizing quantum computing algorithm
Shor's factorizing quantum computing algorithm
https://en.wikipedia.org/wiki/Shor%27s_algorithm
A good video: https://youtu.be/lvTqbM5Dq4Q
Shor's video (confusing) https://youtu.be/hOlOY7NyMfs
IBM https://youtu.be/yy6TV9Dntlw
Shor code, quantaum error correction
https://en.wikipedia.org/wiki/Quantum_error_correction#The_Shor_code
Monday, September 7, 2020
student created videos
Learning through videos involved trials and errors, lessons and failure.
https://www.edutopia.org/article/student-created-videos-classroom
making thinking visible
visible learning (this is different from my SLVideo concept).
Friday, September 4, 2020
covid policy data set
https://healthdata.gov/dataset/covid-19-state-and-county-policy-orders
Thursday, September 3, 2020
yeast aging, somatic mutation, Dr. Dang
Dr. Dang Modulus aging model discussion
DNA double strand break. Increased in old cells. Gamma H2A, phophlylation. A double strand breaker marker. Conserved in mammalian cells. It does mark single strand break.
https://www.nature.com/articles/leu20106#:~:text=Phosphorylation%20of%20the%20Ser%2D139,DNA%20damage%20initiation%20and%20resolution.
Yeast only has H2A (not H2AX).
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3889172/#:~:text=In%20Saccharomyces%20cerevisiae%2C%20histone%20H2A,about%2050%20kb%20in%20yeast).
Mitosis crisis, short lived. Otherwise, long-lived. G1-death long0-lived. G2-death short lived.
SIR2-OE give third mode.
Rapamycin can be absolved by microfluidic device matrix, so it has been studied by microfluidics, according to Dang's conversation with N Hao
TOR1Delta and SIR2-OE are additive in RLS, based on Dang’s own data. Hong needs to check Kaberlein lab data sets.