Open-source deep-learning software for bioimage segmentation
https://carpenterlab.broadinstitute.org/files/anne/files/mbc.e20-10-0660.pdf
This site is to serve as my note-book and to effectively communicate with my students and collaborators. Every now and then, a blog may be of interest to other researchers or teachers. Views in this blog are my own. All rights of research results and findings on this blog are reserved. See also http://youtube.com/c/hongqin @hongqin
Open-source deep-learning software for bioimage segmentation
https://carpenterlab.broadinstitute.org/files/anne/files/mbc.e20-10-0660.pdf
Ready 4 R
https://ready4r.netlify.app/schedule/
change point analysis can be applied.
https://www.nature.com/articles/s41598-017-19067-2
It seems that regression were discussed by treating the even as intervention. So, before and after t-test can be used.
For covid19 analysis, we can co-integrate deaths ~ mobility with a window around holiday events.
single cell gene-networks, topological robustness of gene networks, cellular life in different tissues, blood versus neurons
https://tabula-microcebus-cellxgene.ds.czbiohub.org/all/
when random is linear, cross corelation on time lag only gave a gradual trend. When periodic time series is gave, cross correlation gave an obvious cycling effect.
see https://github.com/hongqin/cointegration-sandbox/blob/main/random-perioditic-walk.pdf
If you have unit roots in your time series, a series of successive differences, d, can transform the time series into one with stationarity. The differences are denoted by I(d), where d is the order of integration. Non-stationary time series that can be transformed in this way are called series integrated of order k. Usually, the order of integration is either I(0) or I(1); It’s rare to see values for d that are 2 or more.
From: https://www.statisticshowto.com/order-of-integration/
"Thus in theory you can test for cointegration either between
Reference:
https://stats.stackexchange.com/questions/285582/cointegration-with-lagged-variables
So, the lag is best analyzed from cross correlation analysis.
Co-dominant neutralizing epitopes make anti-measles immunity resistant to viral evolution
https://www.cell.com/action/showPdf?pii=S2666-3791%2821%2900073-2
Measles led to poly-clonal antibody with multiple epitope response, so the antigenic shift has a very small chance.
SARS-CoV2 and Influenza led to focused antibody response, so antigenic shift has a higher chance.
Theoretically, a vaccine with poly-epitope response would be a better vaccine.
PROGRAMMING QUANTUM COMPUTERS: A PRIMER WITH IBM Q AND D-WAVE EXERCISES
https://sites.google.com/ncsu.edu/qc-tutorial/
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6930685/
CVPR 2020, The 2nd Tutorial onLearning Representations via Graph-structured Networks | ||
Slides and recorded videos are provided in this webpage. Sunday afternoon (1PM - 4:30PM PDT), June 14, 2020 |
https://xiaolonw.github.io/graphnnv2/
ignatieva
https://www.biorxiv.org/content/10.1101/2021.01.21.427579v1.full.pdf
Classification: Yeast deletion with CR effect: extend or shorten lifespan
Input: double deletion genetic interactions
neural networks: DCell or a hypothesis-based-graph model
Oldies but goldies: A. Barron, Universal Approximation Bounds for Superpositions of a Sigmoidal Function, 1993. Proves that 1 hidden layer perceptrons break the curse of dimensionality to approximate a class of smooth functions. https://en.wikipedia.org/wiki/Universal_approximation_theorem… https://en.wikipedia.org/wiki/Multilayer_perceptron
https://twitter.com/gabrielpeyre/status/1384371246461329409
zoom
socractive
utc anonymous survey
deep learning illustrated
Student interested in research.
key concepts:
loss function
activation function
regularization
epoch
There are several GitHub repo with intrusion detection codes:
https://github.com/
https://github.com/cstub/ml-
https://github.com/rambasnet/
https://github.com/
We only need to pick 2 of these methods that work for us.
There is a Kaggle competition on intrusion detection, it provide training and testing data at
https://www.kaggle.com/c/
For MS thesis, Artis may try two ML method on the Kaggle data set, compare their performance, which would good for your thesis. You can first start to try run the GitHub sample codes.
GitHub
https://raw.githubusercontent.com/CynthiaKoopman/Network-Intrusion-Detection/master/KDDTrain%2B_2.csv
Some of the best multi-view data are at NCBI GTEx site.
https://www.gtexportal.org/home/datasets
These data sets however are quite complicated and need substantial analysis because they can be fed into deep learning models.
nano -w GTEx_Analysis_2017-06-05_v8_RNASeQCv1.1.9_gene_reads.gct
This file seems to show Ensembl gene ids and counts
plotting with pylab.
subplot position is bit tricky. line type is 'r--'
2:20pm ->
cipher coding review
https://www.scholastic.com/pathways/techlab/index.html
The International Society for Computational Biology is pleased to announce the HPC-AI Advisory Council (HPCAIAC) and National Supercomputing Centre (NSCC) Singapore 2021 APAC HPC-AI Competition.
High-performance computing and artificial intelligence are the most essential tools fueling the advancement of science. In order to handle the ever-growing demands for higher computation performance and the increase in the complexity of research problems, the world of scientific computing continues to re-innovate itself in a fast pace.
The competition encourages international teams in the APAC region to showcase their HPC and AI expertise in a friendly yet spirited competition that builds critical skills, professional relationships, competitive spirits and lifelong comraderies.
To become part of a team – register here - http://www.hpcadvisorycouncil.
encoder, imputing,
GAN to generate diverse data set, using European people to GAN under-represented data.
there are thing that we know we don't know, there are things that we do not know we don't know. Calibration, test, re-calibrate
race, socio-economical, life style,
convert Shor's algorithm:
https://qiskit.org/textbook/ch-algorithms/shor.html
Shor's algorihm uses some kind of transformation for prime number factorization, and use a good guess, to my intuitive understanding.
controlling edge dynamics in complex networks
https://www.nature.com/articles/nphys2327
Nepusz, Vicsek
"We also find that transcriptional regulatory networks are particularly easy to control. Analytic calculations show that networks with scale-free degree distributions have better controllability properties than uncorrelated networks, and positively correlated in- and out-degrees enhance the controllability of the proposed dynamics."
Qin: interesting point on the positively correlated in and out degree enhance controllability. Any implication in evolution of biological networks?
https://book-wright-ma.github.io/
My lab has multiple PhD positions open. Research directions are in Data Science, machine learning, and biomedical big data. One research direction is to develop multi-view deep learning neural networks to integrate heterogeneous genomics data sets to predict aging and diseases. The second research direction is to develop MASK-RCNN models to detect and quantify cell objects, and develop graph-based algorithms to infer cell division events. The third research direction is to apply algebraic graph theory and develop deep-learning methods for single-cell genomics data analysis. Lab GitHub projects can be seen at github.com/hongqin
Please contact hong-qin@utc.edu or qinstat@gmail.com with your resume, transcripts, personal statement, and references.
https://www.utc.edu/apply/
Select semester "Spring 2022"
create an account
In Enrollment, select "PhD Computational Science: Computer Science".
https://towardsdatascience.com/analyzing-and-interpreting-data-from-rating-scales-d169d66211db
utc course evaluation
Poker example with video recording? (Poker card does not have zero, I used Joker card instead).
Power point example?
For selection sort, my poker card demo and Python output are not consistent, likely due to the implementation of the inner loop.
I used bisection search as a reverse analogy for merge-sort. However, bisection search is O(log2(n)), but merge-sort has O( n log2(n))
poker cards, unorganized, organized, search for heart ace.
bisection search
https://www.nature.com/articles/s41467-021-22168-2
RNA was extracted and sequenced from muscle biopsies collected from 53 healthy individuals (22–83 years old) of the GESTALT study of the National Institute on Aging–NIH
1. cost function should be able to average /sum up over each sample
2. cost function does not dependent on the activation function, which is required for back-propagation to work.
https://stats.stackexchange.com/questions/154879/a-list-of-cost-functions-used-in-neural-networks-alongside-applications
https://youtu.be/TrdevFK_am4
A transformer is a self-attention model, which seems to be my gene-network model based on gene expression!!!
for herbarium and plant images, segmented leaves, barks, and flowers can be separated into views, which can then be fed into feature extraction layers such as shapes and vines, followed by typical neurla networks or graph networks, and multi-task prediction on family-genus-species
multi-task training here make sense because the predicted, family, genus species is hierarchical by nature.
https://en.wikipedia.org/wiki/Species
Q: ImageNet Classification is inherently multi-task, is it?
vision transformers with attention, image with 16x16 words
https://arxiv.org/abs/2010.11929
multitask deep learning on yeast fitness and lifespan, morphology, integrated learning
multi-task learning on similar task can mitigate missing data. This is in contrast to transfer learning.
basically, a vector output for multiple outcomes,
https://www.partek.com/webinar-registration-follow-up/
cpsc 4180
https://youtu.be/TEpD7aP9m3I
https://youtu.be/qcEjJGjnIcA
https://youtu.be/maK7uYSK2Bk
https://youtu.be/gTAnRNaIBps
https://youtu.be/zrZXhRmGaMU
cpsc5180
Zoom recording,
// no Breakout room in spring 2021
* Basic running time types:
Constant, linear, logliner, quadratic, polynomial, exponential.
* Dominant terms in Big O notation (addition and subtractions)
* law of multiplication
Why recursive is exponential?
socrative test on linear, log-linear, quadratic etc // converted to Canvas multiple-choice questions
TODO: in unit 13 plot,
review computational complexity with plot
with plotting from unit 13plotting.ipynb.
# poker cards as an example, organized vs unorganized for search and sort.