Tuesday, November 26, 2019

REU projects


REU
mixture Gompertz, network aging fitting

RLS deep learning, prediction


Friday, November 22, 2019

GPU request for data science

RE: GPU Workstation for enhancing experience learning of artificial intelligence in MSDA

We would like to request a Linux GPU workstation to enrich student experiential learning of artificial intelligence (AI) in the Master of Science in Data Analytics program (MSDA). GPU-based deep learning methods are the state-of-the-art AI method in data science. Computational training of deep-learning models with real-world big data is time-consuming with CPU or low-end GPUs. Lack of GPU computing power has prohibited many UTC students from applying deep learning methods to big data that are typically in the business world. The proposed Linux workstation will improve GPU access to students in several courses in the MSDA program, including CPSC 5440 Introduction to Machine Learning, CPSC5180 Programming Languages for Advanced Data Analytics, CPSC 5530 Data visualization and Exploration, and CPSC 5240 Principle of Data Analytics. 


Precision 7920 Tower Workstation

Intel Xeon Gold 6130 2.1GHz, 3.7GHz Turbo, 16C, 10.4GT/s 3UPI, 22MB Cache, HT (125W) DDR4-2666

Windows 10 Pro for Workstations (4 Cores Plus) Multi - English, French, Spanish

NVIDIA® Quadro® P2000, 5GB, 4 DP (7X20T)

32GB 4x8GB DDR4 2666MHz RDIMM ECC

3.5" 2TB 7200rpm SATA Hard Drive

$4,699.00

https://www.dell.com/en-us/work/shop/desktops-all-in-one-pcs/precision-7920-tower-workstation/spd/precision-7920-workstation/xctopt7920us_3

https://www.dell.com/al/business/p/precision-desktops?~ck=bt

Deep learning-based projects are popular choices for many undergraduate and graduates students. Almost all students in CPSC4180/5180 chose deep learning related course projects. 

Some of my students are having trouble to get their deep learning model implanted and trained given the limited GPU computing resource and support we have.   If the CSE department have our own Linux GUP workstations, our students could be more efficient and productive. Given that we expect more and more data science MS students, increasing GPU computing support seem to be strategically important for both education and research support,

 The needs of machine learning and artificial intelligence are reflected by the recent Blue Sky initiative at our department and in our joint new program with College of Business of Data Analytics.  GPU-based deep learning is an important skill and knowledge that our students should be trained with for their future employability. In order to provide experiential learning to the students in our department and the college, we need to provide the state-of-the-art deep learning training to our students in the field of artificial intelligence.  Given the current cloud and virtual machine technology, GPU is still hard-linked with any virtual machine. So, in order to provide more GPU learning experiences to our students, we literally need to purchase more GPU hardware.  It actually does not matter whether these GPU are hosted in a cloud or in workstations, because GPU cannot be virtualized to the best of our knowledge. Given that typical training in real-world data require long-computing time for deep-learning models, dedicated Linux nodes or workstations are the most practical ways to provide experience learning experiences for students to use real-world data for deep-learning projects. Our computers in 312 can be used by students to analyze toy-data, but not sufficient for any real-world data sets. In short, in order to provide real-world experiential learning experiences of AI  to our students, we need provide the necessary GPU hardware to students.





CPSC 5180Programming Languages for Advanced Data
CPSC 5200Automata, Complexity, and Computability
CPSC 5210Design and Analysis of Computer Algorithms
CPSC 5230Decision Support and Business Intelligence
CPSC 5240Principles of Data Analytics
CPSC 5250Medical Informatics
CPSC 5260Introduction to Parallel Algorithms
CPSC 5270Advanced Database and Database Security
CPSC 5400Topics in Simulation
CPSC 5410Model Analysis and Simulation
CPSC 5420Programming with SAS
CPSC 5440Introduction to Machine Learning
CPSC 5450Advanced Topics in Artificial Intelligence
CPSC 5460Pattern Recognition
CPSC 5500Computer Graphics Applications and Algorithms
CPSC 5510Advanced Computer Graphics
CPSC 5530Data Visualization and Exploration
CPSC 5560Computer Data Communications
CPSC 5570Internetworking
CPSC 5580Software Defined Networks
CPSC 5590Advanced Computer Networks
CPSC 5600Advanced Biometrics and Cryptography
CPSC 5610Advanced Information Security Management
CPSC 5620Computer Network Security
CPSC 5640Internet Security Protocols
CPSC 5660System Vulnerability Analysis and Auditing
CPSC 5680Computer Forensics
CPSC 5700Advanced Computer Architecture
CPSC 5710Microcomputer Systems Architecture
CPSC 5720Real-Time Embedded Systems
CPSC 5800Advanced Topics in Systems Software
CPSC 5820Legacy Computing Systems

advantage of temporal networks

Li, ..., Barabasi, Science, 2017,
tempoal network advantages.

Energy needed from state vector x0 to final state xf
  E(x0, xf) = 1/2 d^T x W^01_eff  x d

where Weff encode the energy structure of the network.

I did not follow S1.1 method




logical puzzles


Logical puzzle YouTube, jellologic
https://www.youtube.com/watch?v=L_eTNclIKbQ

https://www.google.com/imgres?imgurl=https%3A%2F%2Fwww.woojr.com%2Fwp-content%2Fuploads%2F2018%2F08%2Fdifficult-logic-puzzle-kids-232x300.jpg&imgrefurl=https%3A%2F%2Fwww.woojr.com%2Fprintable-logic-puzzles-for-kids%2F&docid=s2zU7g6K-OmQ4M&tbnid=3r4TSXUQqlMTtM%3A&vet=10ahUKEwiLx4rxzf7lAhXOxFkKHbeTD64QMwhUKAcwBw..i&w=232&h=300&bih=852&biw=1870&q=logic%20puzzle%20examples%20with%20answers&ved=0ahUKEwiLx4rxzf7lAhXOxFkKHbeTD64QMwhUKAcwBw&iact=mrc&uact=8#h=300&imgdii=l2hOqZ9Mt4HwDM:&vet=10ahUKEwiLx4rxzf7lAhXOxFkKHbeTD64QMwhUKAcwBw..i&w=232


Monday, November 18, 2019

clonal haematopoiesis f

 2018 Jul;559(7714):350-355. doi: 10.1038/s41586-018-0321-x. Epub 2018 Jul 11.

Insights into clonal haematopoiesis from 8,342 mosaic chromosomal alterations


uncovered in blood-derived DNA from 151,202 UK Biobank participants using phase-based computational techniques (estimated false discovery rate, 6-9%).

Seems to be the first author Loh's postdoc work 

Saturday, November 16, 2019

remove Google Drive File large cache

uninstall Google Drive File Stream

$hqin/Library/Application Support/Google/DriveFS
mv DriveFS DriveFS.old

reinstall Google Drive File Stream.

restart computer


Tuesday, November 5, 2019

*** Qin lab funding ackowledgments


For HYSAA:
We thank the support of NSF Career award #1453078 and #1720215, BD Spoke  #1761839, and internal support of the University of Tennessee at Chattanooga. 


For yeast aging:

We thank the support of NSF Career award #1453078 and #1720215, BD Spoke  #1761839,  REU   #1852042, and internal support of the University of Tennessee at Chattanooga. 

For REU: 
REU   #1852042



For Machine Learning
We thank the support of NSF Career award #1453078 and #1720215, BD Spoke  #1761839, and internal support of the University of Tennessee at Chattanooga. TP, DM thanks the support of a DoD capacity building grant. 

Cody: We thank the support of NSF Career award #1453078 and #1720215, BD Spoke  #1761839, and internal support of the University of Tennessee at Chattanooga

Syed: BD Spoke  #1761839

Allison: NSF Career award #1453078 and #1720215, BD Spoke  #1761839,



Monday, November 4, 2019

NIH R25 aging and undergraduate education



NIA MSTEM: Advancing Diversity in Aging Research through Undergraduate Education (R25)


word2vec natural language processing


Word2Vec — a baby step in Deep Learning but a giant leap towards Natural Language Processing



https://medium.com/explore-artificial-intelligence/word2vec-a-baby-step-in-deep-learning-but-a-giant-leap-towards-natural-language-processing-40fe4e8602ba

NIH data harmonization


Data Harmonization, Curation and Secondary Analysis of Existing Clinical Datasets (R61/R33 Clinical Trial Not Allowed)

https://grants.nih.gov/grants/guide/rfa-files/RFA-NS-20-007.html

Examples of potential research topics include, but are not limited to:
  • Validation of diagnostic and/or prognostic models of outcome
  • Comparative effectiveness hypotheses
  • Mediation analyses of biological, cultural and environmental factors that affect treatment response or course of disease
  • Evaluation of biomarker validity (including clinical outcome assessments (COAs)), in new populations and/or new context of use
  • Discovery or validation of multi-domain clinical and/or biological measures for diagnosis, prognosis and/or treatment response using existing genetic and biological samples along with clinical and physiological assessments
  • Extended characterization or validation of natural history disease course
  • Novel methods for improving patient stratification
Examples of research that this RFA will not support include:
  • Studies using non-human animal models
  • Studies using data from a single clinical research study
  • Multiple single-site clinical research studies
  • Studies of harmonization, curation, and analysis where the primary data source is Electronic Health Record (EHR) data
  • Studies of harmonization, curation, and analysis where the primary data source is metadata