由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Joke版 - 革千老祖师爷的命了:iris dataset
相关主题
谁在下载盗版paper?真的,人类不可以再吃猪肉了
我现在在国内,上mitbbs毫无问题从挖车事件看国男怎么拱手把MM送给老外的 (转载)
Ping Pong Penguin真正的高素質人民!不留一絲垃圾在廣場 (转载)
马甲的祖师爷: 鲁迅大爷 ZZ美国拍A片赚学费女大学生将在著名色情网站Pornhub实习ZT
怎么突然被封了说说我比较欣赏的一些二线ID (转载)
企鹅居然会飞!(BBC视频)企鹅带路软件
【笑死了】When the Bruins Lost, Boston Turned to Porn (转载)老毛真是民科的祖师爷啊
美国遗传学家:人类是公猪与母猩猩杂交产物海豹强奸企鹅就像琐男撸管一样 (转载)
相关话题的讨论汇总
话题: dataset话题: iris话题: eugenicist话题: github话题: penguins
进入Joke版参与讨论
1 (共1页)
w*****g
发帖数: 16352
1
GitHub上看来的。学习了。还是PornHub社区祥和。
So given recent events there has been a lot of discussion about the Iris
data, which is a standard for demonstrating classification and other
statistical techniques. The problem with it is that was first published by
Ronald Fisher (a Eugenicist) in the Annals of Eugenics and I don't think I
need to point out why that history is extremely problematic for a dataset
that has been used for classification problems. It's also incredibly
overused and many people will roll their eyes just seeing it.
Thankfully there is a new dataset a lot of people seem to be coalescing
around, specifically this open source dataset about penguins: https://github
.com/allisonhorst/palmerpenguins, which has several advantages:
It wasn't popularized by a eugenicist
It has more interesting dimensions to explore
It's about penguins
I therefore propose we replace the Iris data in sampledata and all the
associated examples with equivalent examples using the penguin data.
Libraries like seaborn have already taken this step and I think we should
too.
★ 发自iPhone App: ChinaWeb 1.1.5
H********g
发帖数: 43926
2
优生学是反动的资产阶级科学

github

【在 w*****g 的大作中提到】
: GitHub上看来的。学习了。还是PornHub社区祥和。
: So given recent events there has been a lot of discussion about the Iris
: data, which is a standard for demonstrating classification and other
: statistical techniques. The problem with it is that was first published by
: Ronald Fisher (a Eugenicist) in the Annals of Eugenics and I don't think I
: need to point out why that history is extremely problematic for a dataset
: that has been used for classification problems. It's also incredibly
: overused and many people will roll their eyes just seeing it.
: Thankfully there is a new dataset a lot of people seem to be coalescing
: around, specifically this open source dataset about penguins: https://github

1 (共1页)
进入Joke版参与讨论
相关主题
【大数据】民主党、共和党和毛片怎么突然被封了
中国人观看成人视频的时间最长(zz)企鹅居然会飞!(BBC视频)
台媒:中国大陆人看色情片时长排世界第一 (转载)【笑死了】When the Bruins Lost, Boston Turned to Porn (转载)
not your usual penguin美国遗传学家:人类是公猪与母猩猩杂交产物
谁在下载盗版paper?真的,人类不可以再吃猪肉了
我现在在国内,上mitbbs毫无问题从挖车事件看国男怎么拱手把MM送给老外的 (转载)
Ping Pong Penguin真正的高素質人民!不留一絲垃圾在廣場 (转载)
马甲的祖师爷: 鲁迅大爷 ZZ美国拍A片赚学费女大学生将在著名色情网站Pornhub实习ZT
相关话题的讨论汇总
话题: dataset话题: iris话题: eugenicist话题: github话题: penguins