https://depts.washington.edu/uwruca/ruca-approx.php?utm_source=chatgpt.com
This site is to serve as my note-book and to effectively communicate with my students and collaborators. Every now and then, a blog may be of interest to other researchers or teachers. Views in this blog are my own. All rights of research results and findings on this blog are reserved. See also http://youtube.com/c/hongqin @hongqin
https://depts.washington.edu/uwruca/ruca-approx.php?utm_source=chatgpt.com
All of Us survey, data codebooks
https://docs.google.com/spreadsheets/d/1pODkE2bFN-kmVtYp89rtrJg7oXck4Fsex58237x47mA/edit?usp=sharing
The paper "A model for the assembly map of bordism-invariant functors" by Levin, Nocera, and Saunier (2025) develops advanced categorical frameworks for algebraic topology, particularly through oplax colimits of stable/hermitian/Poincaré categories and bordism-invariant functors123. While not directly addressing machine learning (ML) or large language models (LLMs), its contributions could indirectly influence these fields through three key pathways:
The paper's formalization of oplax colimits and Poincaré-Verdier localizing invariants13 provides new mathematical tools for structuring compositional systems. This could advance:
Model Architecture Design: By abstracting relationships between components (e.g., neural network layers) as bordism-invariant functors, enabling more rigorous analysis of model behavior under transformations5.
Geometric Deep Learning: Topological invariants and assembly maps could refine methods for learning on non-Euclidean data (e.g., graphs, manifolds) by encoding persistence of features under deformations5.
The bordism-invariance concept—where structures remain unchanged under continuous deformations—offers a mathematical foundation for invariance principles in ML:
Data Augmentation: Formalizing "bordism equivalence" could guide the design of augmentation strategies that preserve semantic content (e.g., image rotations as "topological bordisms")5.
Robust Feature Extraction: Kernels of Verdier projections13 might model noise subspaces to exclude during feature learning, improving adversarial robustness.
The paper’s explicit decomposition of complex functors (e.g., Shaneson splittings with twists13) parallels challenges in LLM-based reasoning:
Program Invariant Prediction: LLMs that infer program invariants6 could adopt categorical decompositions to handle twisted or hierarchical constraints (e.g., loop invariants in code).
Categorical Data Embeddings: LLM-generated numerical representations of categorical data4 might leverage bordism-invariance to ensure embeddings respect equivalence classes (e.g., "color" as a deformation-invariant attribute).
The work is highly theoretical, with no direct ML/LLM applications in the paper. Bridging this gap requires:
Translating topological bordisms into data-augmentation pipelines.
Implementing Poincaré-Verdier invariants as regularization terms in loss functions.
Extending LLM-based invariant predictors6 to handle categorical assembly maps.
While speculative, these connections highlight how advanced category theory could enrich ML’s theoretical foundations and LLMs’ reasoning capabilities.
In Mendely, if an article has unspecified type, it often list it as "preprint".
To fix this, just change the document type to 'jounral article' or 'conference proceedings' or other appropriate type.
Beyond Attention: Toward Machines with Intrinsic Higher Mental States
https://techxplore.com/news/2025-05-architecture-emulates-higher-human-mental.html#google_vignette
USDA
https://www.ers.usda.gov/data-products/rural-urban-commuting-area-codes?utm_source=chatgpt.com
Based on our discussion, I have looked further into the ‘360’ zipcode region, and found it contain 14 counties below:
counties = [ "Autauga County", "Barbour County", "Bullock County", "Butler County",
"Chilton County", "Coosa County", "Covington County", "Crenshaw County",
"Elmore County", "Lowndes County", "Macon County", "Montgomery County",
"Pike County", "Tallapoosa County"]
Amon them, there are “Barbour”, “Bullock”, and “Macon” counties.
So, not sure how useful the three-digit zip code of ‘360’ might be relevant to TU MCH project.
https://pastebin.com/uBcFUXCA
All of us research platform, data explore for rural health research.
Zip3, observational table
Workspaces > Beginner Intro to AoU Data and the Workbench > Analysis >
In ‘test-state_data_v060523’, found sample on state, county, and zip3.
https://catalog.odu.edu/courses/cs/#graduatecoursestext
https://catalog.odu.edu/courses/dasc/
This course explores the application of AI in health sciences, focusing on machine learning, NLP, computer vision, generative AI techniques for diagnostics, treatment planning, patient monitoring, and biomedical research. It covers precision medicine, ethical AI, and the integration of AI into practice. Students will gain a deep understanding and practical skills to develop innovative AI solutions that address real-world challenges in health sciences.
This course provides a deep dive into the foundations and current advancements in generative AI. It covers key concepts such as transformer models, GANs, VAEs, LLMs, and their applications across various fields, emphasizing both theory and hands-on learning, including ethical considerations such as fairness and bias mitigation. Students will develop a comprehensive understanding of generative AI and gain practical experience.
This course explores the application of AI in health sciences, focusing on machine learning, NLP, computer vision, generative AI techniques for diagnostics, treatment planning, patient monitoring, and biomedical research. It covers precision medicine, ethical AI, and the integration of AI into practice. Students will gain a deep understanding and practical skills to develop innovative AI solutions that address real-world challenges in health sciences.
This course provides a deep dive into the foundations and current advancements in generative AI. It covers key concepts such as transformer models, GANs, VAEs, LLMs, and their applications across various fields, emphasizing both theory and hands-on learning, including ethical considerations such as fairness and bias mitigation. Students will develop a comprehensive understanding of generative AI and gain practical experience.
This course explores the application of AI in health sciences, focusing on machine learning, NLP, computer vision, generative AI techniques for diagnostics, treatment planning, patient monitoring, and biomedical research. It covers precision medicine, ethical AI, and the integration of AI into practice. Students will gain a deep understanding and practical skills to develop innovative AI solutions that address real-world challenges in health sciences.
This course provides a deep dive into the foundations and current advancements in generative AI. It covers key concepts such as transformer models, GANs, VAEs, LLMs, and their applications across various fields, emphasizing both theory and hands-on learning, including ethical considerations such as fairness and bias mitigation. Students will develop a comprehensive understanding of generative AI and gain practical experience.
This course explores the application of AI in health sciences, focusing on machine learning, NLP, computer vision, generative AI techniques for diagnostics, treatment planning, patient monitoring, and biomedical research. It covers precision medicine, ethical AI, and the integration of AI into practice. Students will gain a deep understanding and practical skills to develop innovative AI solutions that address real-world challenges in health sciences.
This course provides a deep dive into the foundations and current advancements in generative AI. It covers key concepts such as transformer models, GANs, VAEs, LLMs, and their applications across various fields, emphasizing both theory and hands-on learning, including ethical considerations such as fairness and bias mitigation. Students will develop a comprehensive understanding of generative AI and gain practical experience.
step-by-step through how the sigmoid function turns a logit into a probability, with both math and plain English at each step.
The logit of a probability is defined as:
This is called the log-odds, because it’s the natural log of the odds.
Let’s say we are given a number , and we know it’s a logit.
That means:
We want to solve for , the probability.
To get rid of the logarithm, we exponentiate both sides:
This tells us the odds in terms of .
Now we solve this equation for .
Start with:
Multiply both sides by :
Distribute the left-hand side:
Now get all the 's on one side:
Factor out on the right-hand side:
Now divide both sides by :
We now simplify that expression:
That’s the sigmoid function:
So if you’re given a logit value , the sigmoid function:
turns it into a probability .
In logistic regression, we compute:
Then use the sigmoid to convert that score (logit) into a probability of class 1:
This way, the model’s output is interpretable as a probability — suitable for binary classification.
Let me know if you want a plot or visualization to make it even clearer!
The 25th IEEE International Conference on Data Mining (IEEE ICDM 2025) includes half- or full-day workshops
https://www3.cs.stonybrook.edu/~icdm2025/cfw.html
https://2025-icdm-ughs.github.io/
Important Dates:
All submission deadlines are end-of-day in the Anywhere on Earth (AoE) time zone.
In the AAAI
AIIDE – Artificial Intelligence and Interactive Digital Entertainment: Focuses on AI in games and entertainment media.
EAAI – Educational Advances in Artificial Intelligence: Emphasizes AI education, especially innovative teaching techniques and curricula.
FLAIRS – Florida Artificial Intelligence Research Society Conference: A regional AI conference that covers a broad spectrum of AI research.
HCOMP – Human Computation and Crowdsourcing: Explores systems that rely on human intelligence, often combined with AI (e.g., crowdsourced labeling).
IAAI – Innovative Applications of Artificial Intelligence: Focuses on real-world applications and deployed AI systems.
ICAPS – International Conference on Automated Planning and Scheduling: Covers planning, scheduling, and decision-making problems.
ICWSM – International Conference on Web and Social Media: Focuses on computational social science and data-driven studies of social media.
KR – Principles of Knowledge Representation and Reasoning: Covers formal representations of knowledge and reasoning mechanisms.
UAI – Uncertainty in Artificial Intelligence: Emphasizes probabilistic reasoning, Bayesian networks, and managing uncertainty in AI systems.
ODURF project reports -> Project ID -> transaction + view report.
https://hera.odurf.odu.edu/RFPortal/Reports/
you can pay with a credit card and seek reimbursement using this form here: Consultant-Honorarium-Reimbursement-Form.pdf