Open Notebook: machine learning

Showing posts with label machine learning. Show all posts

Wednesday, October 18, 2023

contrast learning

https://builtin.com/machine-learning/contrastive-learning

self-supervised learning.

Tuesday, September 27, 2022

machine learning model attrition challenge

https://mlmac.io/#submission

Thursday, August 25, 2022

Federated learning (book)

Book series

Federated Learning (FL) requires an aggregator and parties to exchange model updates. (Page 285)

vulnerable to the inference of private data

System entities of the FL system

the attack surface is used to refer to the exposed parameters and data

against data leak

FL-specific attacks often take advantage of the information transmission during FL.

Differential privacy: differential privacy at the party side or the aggregator side.

For healthcare data and personal information, there are regulation and compliance requirements [14, 63]

page 285: In FL, training data is not explicitly shared.

$13.3.1 Secure Aggregation

Monday, February 21, 2022

interpretable machine learning Christo[h Molnar

https://christophm.github.io/interpretable-ml-book/

Wednesday, January 5, 2022

confusion matrix in machine learning

accuracy, F1

Tuesday, December 14, 2021

Saturday, November 27, 2021

training set, validation set, and test test for machine learing / deep learning

from: https://machinelearningmastery.com/difference-test-validation-datasets/

– Training set: A set of examples used for learning, that is to fit the parameters of the classifier.

– Validation set: A set of examples used to tune the parameters of a classifier, for example to choose the number of hidden units in a neural network.

– Test set: A set of examples used only to assess the performance of a fully-specified classifier.

Friday, July 2, 2021

control theory and machine learning

https://sites.google.com/view/control-meets-learning

Wednesday, June 2, 2021

Hebbian, Oja learning

14:16:01 From Hong Qin to Everyone:

https://en.wikipedia.org/wiki/Lasso_(statistics)

14:24:48 From Trevor Peyton to Everyone:

https://en.wikipedia.org/wiki/Hebbian_theory

14:25:25 From Trevor Peyton to Everyone:

https://en.wikipedia.org/wiki/Generalized_Hebbian_algorithm

14:26:08 From Trevor Peyton to Everyone:

https://en.wikipedia.org/wiki/Oja%27s_rule

Tuesday, April 20, 2021

Oldies but goldies: A. Barron, Universal Approximation Bounds for Superpositions of a Sigmoidal Function, 1993. Proves that 1 hidden layer perceptrons break the curse of dimensionality to approximate a class of smooth functions. en.wikipedia.org/wiki/Universal en.wikipedia.org/wiki/Multilaye

https://twitter.com/gabrielpeyre/status/1384371246461329409

Tuesday, February 16, 2021

Galois_theory, Langlands program

https://en.wikipedia.org/wiki/Fundamental_theorem_of_Galois_theory

https://zh.wikipedia.org/wiki/%E4%BC%BD%E7%BD%97%E7%93%A6%E7%90%86%E8%AE%BA%E5%9F%BA%E6%9C%AC%E5%AE%9A%E7%90%86

我有一个直觉，工程上人们关于最优化方法的折腾路线图有点弱了，对那些年青人，我觉得可以以《群论与最优化》的主题去研究更具一般性，可以使数据科学的研究提升一个层次，一般来说，事物从某种状态任意变换为另一种状态路径的数目的庞大的，最直观的就是各种棋类问题，其实这些问题需要的思想与当年伽罗瓦解方程的思想沒本质区别（1百多年后，很少的人才能真正体会到它的真谛），如果人们能够熟练驾驭群的方法，人工智能的命运就会被改写，无需暴力方法。当年克莱因的纲领与现在朗兰兹的纲领，都逃不过伽罗瓦的群...

https://en.wikipedia.org/wiki/Langlands_program

https://en.wikipedia.org/wiki/Felix_Klein

https://en.wikipedia.org/wiki/Erlangen_program

Friday, September 20, 2019

regularization, L1 and L2 norm

What is the main purpose of using regularization?

Regularization helps to choose preferred model complexity, so that model is better at predicting. Regularization is nothing but adding a penalty term to the objective function and control the model complexity using that penalty term. It can be used for many machine learning algorithms.

https://towardsdatascience.com/regularization-in-machine-learning-76441ddcf99a

Wednesday, May 2, 2018

machine learning, hyper parameters

hyper-parameters, such as partition data into training, validation, and testing, iteractions, in general have no fixed way to pick theoretically.

hyper-parameters are those that need to be fixed before learning started.

hyperparameter optimization may be done by compare a tuple of hyperparameters, based on a predefined loss function on dependent data.