请教大家一道ML设计题 - JobHunting版

本页内容为未名空间相应帖子的节选和存档，一周内的贴子最多显示50字，超过一周显示500字访问原贴

JobHunting版 - 请教大家一道ML设计题

相关主题
● yelp skype面	● 离成功转码还有多远？ (转载)
● What is the worst part about working at Google?	● [合集] 那个Google random generate 1-7的题怎么做啊？
● 报个offer，统计	● 讨论个idea题
● 发一个MSFT bing的onsite面经	● 问一道题
● 大家来讨论一下 software engineer-machine learning \|\| data mining 的要求把。	● 一道面试碰到的概率题
● g家店面	● 求教Careercup 150 上的一道题目
● Machine learning / data science 面经以及一些总结	● MS on-site
● 东岸 data science /CS内推机会	● 问个关于Epic的题目：Mingo

相关话题的讨论汇总
话题: upvotes话题: review话题: ml话题: let话题: yelp

进入JobHunting版参与讨论

(共1页)

x*****0
发帖数: 452

I've encountered this question online. How to design an algorithm to predict
how many upvotes a review would get in a certain time period after
publishing. Let's say the review comes from Yelp. So the problem becomes the
following:
> Give millions of reviews (text) and their associated upvotes in a
> certain time period, how do you design a ML predictive model?
Definitely, a lot of prior information may affect the upvotes of a certain
review would got. For example:
(1) Who wrote this review, elite or a regular user?
(2) The business of the review. A review for a hot restaurant may get
more upvotes than for a car mechanic. Generally speaking, when you have a
Yelp's review record, it contains a business id.
(3) ...
Let's pass these prior information and focus on the text features. Can
anyone give me some suggestions. I'm thinking using LDA (topic model) to
generate a topics dictionary but still did not come out a complete solution.
关于如何从text中generate features，大家有什么好的想法吗？

(共1页)

进入JobHunting版参与讨论

相关主题
● 问个关于Epic的题目：Mingo	● 大家来讨论一下 software engineer-machine learning \|\| data mining 的要求把。
● 吾波第三季亏损再破记录	● g家店面
● [合集] 绿卡重要，还是career重要	● Machine learning / data science 面经以及一些总结
● 问一下关于CPT的事情	● 东岸 data science /CS内推机会
● yelp skype面	● 离成功转码还有多远？ (转载)
● What is the worst part about working at Google?	● [合集] 那个Google random generate 1-7的题怎么做啊？
● 报个offer，统计	● 讨论个idea题
● 发一个MSFT bing的onsite面经	● 问一道题

相关话题的讨论汇总
话题: upvotes话题: review话题: ml话题: let话题: yelp

#	版面	帖数(主题数)
-	全站	4871 (796)
1	Military	3777 (569)
2	Stock	341 (51)
3	Joke	117 (17)
4	History	116 (3)
5	Automobile	100 (9)
6	USANews	55 (9)
7	Midlife	45 (1)
8	Headline	41 (41)
9	Dreamer	33 (13)
10	FleaMarket	32 (20)
11	Living	30 (7)

boards

未名新帖统计// 7月16日

历史上的今天