Beard length A/B testing on #Tinder
https://appsciencing.wordpress.com/2018/11/19/beard-studies/
#abtest #statistics #practicalML
https://appsciencing.wordpress.com/2018/11/19/beard-studies/
#abtest #statistics #practicalML
Application Science
How Do You Test Out A New Look? Dating Apps!
People have described a dystopian future where machines control your dating life by presenting you with images of single women and convincing you that feeding the machines will lead to a life (or aโฆ
โโBuilding Automated Feature Rollouts on Robust Regression Analysis
Nice article on important thing โ statistical analysis of hypothesis testing. Every new feature or change made to existent one is basically an experiment. Article covers how #Uber team handles this in live system.
Link: https://eng.uber.com/autonomous-rollouts-regression-analysis/
#Uber #statistics #production #truestory
Nice article on important thing โ statistical analysis of hypothesis testing. Every new feature or change made to existent one is basically an experiment. Article covers how #Uber team handles this in live system.
Link: https://eng.uber.com/autonomous-rollouts-regression-analysis/
#Uber #statistics #production #truestory
A visual exploration of Gaussian Processes: beautiful interactive plots and a brief tutorial to make GPs more approachable
Link: https://www.jgoertler.com/visual-exploration-gaussian-processes/
#Statistics #GP #GaussianProcesses
Link: https://www.jgoertler.com/visual-exploration-gaussian-processes/
#Statistics #GP #GaussianProcesses
Jochen Gรถrtler
A Visual Exploration of Gaussian Processes
How to turn a collection of small building blocks into a versatile tool for solving regression problems.
Evaluating gambles using dynamics
Link: https://aip.scitation.org/doi/10.1063/1.4940236
#Statistics #Gambling
Link: https://aip.scitation.org/doi/10.1063/1.4940236
#Statistics #Gambling
AIP Publishing
Evaluating gambles using dynamics
Gambles are random variables that model possible changes in wealth. Classic decision theory transforms money into utility through a utility function and defines
Valuing Life as an Asset, as a Statistic and at Gunpoint
Ever wondered, how much your life is worth? This is an article about Life as an asset evaluation. It is extremely useful for insuarance companies and as a metric to calculate compensations in case of tragic events, but it is also a key to understand, how valuable (or not) life is.
Math is beautiful.
Link: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3156911
#math #life #insurance #statistics
Ever wondered, how much your life is worth? This is an article about Life as an asset evaluation. It is extremely useful for insuarance companies and as a metric to calculate compensations in case of tragic events, but it is also a key to understand, how valuable (or not) life is.
Math is beautiful.
Link: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3156911
#math #life #insurance #statistics
โโIQ is largely a pseudoscientific swindle
Note by Nassim Taleb on how IQ works. He shows that high-IQ is not well-correlated with wealth or overall cognitive performance.
Link: https://medium.com/incerto/iq-is-largely-a-pseudoscientific-swindle-f131c101ba39
#statistics #iq #fallacy
Note by Nassim Taleb on how IQ works. He shows that high-IQ is not well-correlated with wealth or overall cognitive performance.
Link: https://medium.com/incerto/iq-is-largely-a-pseudoscientific-swindle-f131c101ba39
#statistics #iq #fallacy
โโFair Regression for Health Care Spending
What happens, if fairness built into the objective function for continuous outcomes & see large improvements in group undercompensation?
This is the most interesting & potentially impactful analysis of fairness in #ML for #healthcare, which can lead to significant improvement in the life of millions.
ArXiV: https://arxiv.org/abs/1901.10566
GitHub: https://github.com/zinka88/Fair-Regression
#statistics #regression
What happens, if fairness built into the objective function for continuous outcomes & see large improvements in group undercompensation?
This is the most interesting & potentially impactful analysis of fairness in #ML for #healthcare, which can lead to significant improvement in the life of millions.
ArXiV: https://arxiv.org/abs/1901.10566
GitHub: https://github.com/zinka88/Fair-Regression
#statistics #regression
โโWhy Financial Planning is Excitingโฆ At Least for a Data Scientist
Great introduction into the finance world and what data scientist can lack diving into the topic.
Link: https://eng.uber.com/financial-planning-for-data-scientist/
#Financial #statistics #Uber
Great introduction into the finance world and what data scientist can lack diving into the topic.
Link: https://eng.uber.com/financial-planning-for-data-scientist/
#Financial #statistics #Uber
Probabilistic foundations of econometrica: part 1
Great intro into #statistics basics.
Link: https://freakonometrics.hypotheses.org/57649
#beginner #novice #entrylevel
Great intro into #statistics basics.
Link: https://freakonometrics.hypotheses.org/57649
#beginner #novice #entrylevel
Freakonometrics
Probabilistic Foundations of Econometrics, part 1
In a series of posts, I wanted to get into details of the history and foundations of econometric and machine learning models. It will be some sort of online version of our joint paper with Emmanuel Flachaire and Antoine Ly, Econometrics and Machine Learningโฆ
Analyzing Experiment Outcomes: Beyond Average Treatment Effects
Good #statistics article on why tail distribution and #experimentdesign matters. Quantile treatment effects (QTEs) helps to capture the inherent heterogeneity in treatment effects when riders and drivers interact within the #Uber marketplace.
Link: https://eng.uber.com/analyzing-experiment-outcomes/
Good #statistics article on why tail distribution and #experimentdesign matters. Quantile treatment effects (QTEs) helps to capture the inherent heterogeneity in treatment effects when riders and drivers interact within the #Uber marketplace.
Link: https://eng.uber.com/analyzing-experiment-outcomes/
โโPseudo-extended Markov chain Monte Carlo
Pseudo-Extended #MC for easier sampling from multimodal posteriors. Extend the target distribution and then run your favourite sampler (f.e. #HMC).
ArXiV: https://arxiv.org/abs/1708.05239
#statistics
Pseudo-Extended #MC for easier sampling from multimodal posteriors. Extend the target distribution and then run your favourite sampler (f.e. #HMC).
ArXiV: https://arxiv.org/abs/1708.05239
#statistics
โโImportant article in Nature about statistical significance
Scientists rise up against statistical significance โ about motion to move from widely using and quoting statistical significance to confindence intervals.
Link: https://www.nature.com/articles/d41586-019-00857-9
#statistics #statsignificance #nature #science
Scientists rise up against statistical significance โ about motion to move from widely using and quoting statistical significance to confindence intervals.
Link: https://www.nature.com/articles/d41586-019-00857-9
#statistics #statsignificance #nature #science
Ranking Items With Star Ratings and How Not To Sort By Average Rating
Two absolute must read articles for proper sorting handling. Sorting items with just an average score is wrong and there is some good classic statistics explanation why.
Link: https://www.evanmiller.org/ranking-items-with-star-ratings.html
Link2: https://www.evanmiller.org/how-not-to-sort-by-average-rating.html
#Statistics #rating #scoring #ranking
Two absolute must read articles for proper sorting handling. Sorting items with just an average score is wrong and there is some good classic statistics explanation why.
Link: https://www.evanmiller.org/ranking-items-with-star-ratings.html
Link2: https://www.evanmiller.org/how-not-to-sort-by-average-rating.html
#Statistics #rating #scoring #ranking
www.evanmiller.org
How Not To Sort By Average Rating
Users are rating items on your website. How do you know what the highest-rated items are?
The female problem: how male bias in medical trials ruined women's health
Intersting article on #bias in #medical trials and how proper #statistics training is still important.
Link: https://www.theguardian.com/lifeandstyle/2019/nov/13/the-female-problem-male-bias-in-medical-trials
Intersting article on #bias in #medical trials and how proper #statistics training is still important.
Link: https://www.theguardian.com/lifeandstyle/2019/nov/13/the-female-problem-male-bias-in-medical-trials
the Guardian
The female problem: how male bias in medical trials ruined women's health
Centuries of female exclusion has meant womenโs diseases are often missed, misdiagnosed or remain a total mystery
๐Guest post on great example of book abandonment at GoodReads
An excellent new article from Gwern on analyzing abandoned (hard to finish, hard to read) books on Goodreads. This write up includes step by step instructions with source code, even the way he parsed the data from the website without an API.
Itโs a shame analysis like this does not come from an online book subscription service like Bookmate or MyBook. They have vastly superior datasets and many able data scientists. I am quite sure amazon kindle team does prepare internal reports like that for some evil business purposes, but thatโs a whole different story.
During my time at video game database company RAWG.io weโve compiled โmost abandonedโ and โmost addictiveโ reports for video games.
Do you make a popular service with valuable user behavior data? Funny data analysis reports are a good way to get some attention to your product. Take a lead from Pornhub, they are great at publicizing their data.
Link: https://www.gwern.net/GoodReads
Pornhub Insights: https://www.pornhub.com/insights/
โ
This is a guest post by Samat Galimov, who writes about technology, programming and management in Russian on @ctodaily.
#DataAnalysis #GoodReads #statistics #greatstats #talkingnumbers
An excellent new article from Gwern on analyzing abandoned (hard to finish, hard to read) books on Goodreads. This write up includes step by step instructions with source code, even the way he parsed the data from the website without an API.
Itโs a shame analysis like this does not come from an online book subscription service like Bookmate or MyBook. They have vastly superior datasets and many able data scientists. I am quite sure amazon kindle team does prepare internal reports like that for some evil business purposes, but thatโs a whole different story.
During my time at video game database company RAWG.io weโve compiled โmost abandonedโ and โmost addictiveโ reports for video games.
Do you make a popular service with valuable user behavior data? Funny data analysis reports are a good way to get some attention to your product. Take a lead from Pornhub, they are great at publicizing their data.
Link: https://www.gwern.net/GoodReads
Pornhub Insights: https://www.pornhub.com/insights/
โ
This is a guest post by Samat Galimov, who writes about technology, programming and management in Russian on @ctodaily.
#DataAnalysis #GoodReads #statistics #greatstats #talkingnumbers
gwern.net
The Most โAbandonedโ Books on GoodReads
Which books on GoodReads are most difficult to finish? Estimating
proportions in December 2019 gives an entirely different result than
absolute counts.
proportions in December 2019 gives an entirely different result than
absolute counts.
P-value, explained, one more time with demos
Article includes not only great explanation of what is #pvalue, but how it works and how it can be used to make a correct conclusions.
Link:https://www.freecodecamp.org/news/what-is-statistical-significance-p-value-defined-and-how-to-calculate-it/
#entrylevel #dsformanagers #tutorial #explained #interactive #statistics
Article includes not only great explanation of what is #pvalue, but how it works and how it can be used to make a correct conclusions.
Link:https://www.freecodecamp.org/news/what-is-statistical-significance-p-value-defined-and-how-to-calculate-it/
#entrylevel #dsformanagers #tutorial #explained #interactive #statistics
freeCodeCamp.org
What is Statistical Significance? P Value Defined and How to Calculate It
By Peter Gleeson P values are one of the most widely used concepts in statistical analysis. They are used by researchers, analysts and statisticians to draw insights from data and make informed decisions. Along with statistical significance, they are...
Tail risk of contagious diseases
Fresh N. Talebโs and Pasquale Cirilloโs article on risks of fat tail distribution.
Article: https://www.nature.com/articles/s41567-020-0921-x
#statistics #fattail
Fresh N. Talebโs and Pasquale Cirilloโs article on risks of fat tail distribution.
Article: https://www.nature.com/articles/s41567-020-0921-x
#statistics #fattail
Nature
Tail risk of contagious diseases
Nature Physics - This Perspective argues that an approach called extreme value theory is appropriate for understanding the so-called tail risk of epidemic outbreaks, in particular by demonstrating...
โโOverview of Open Source projects growth metrics
Quantative analytics of top starred repositories.
Link: https://medium.com/runacapital/open-source-growth-benchmarks-and-the-20-fastest-growing-oss-startups-d3556a669fe6
#opensource #analytics #statistics #growth
Quantative analytics of top starred repositories.
Link: https://medium.com/runacapital/open-source-growth-benchmarks-and-the-20-fastest-growing-oss-startups-d3556a669fe6
#opensource #analytics #statistics #growth