由买买提看人间百态

boards

本页内容为未名空间相应帖子的节选和存档,一周内的贴子最多显示50字,超过一周显示500字 访问原贴
Programming版 - Interview questions about hash function
相关主题
贡献一下:本版上搜集的 Google 面试题 (转载)Re: 请教一道题目
百度面试题,any idea?Re: [转载] how would you do this?
有大牛可以解释下bloom filter是在什么条件下使用最好perl array|hash question
又一道面试题,我是不是想多了?hashtable question
一个关于unordered_map/hashmap的问题这个条件语句如何写?
STL map一道c++ 题, 找出duplicate numbers
slack got hacked, 没一个安全的intel icc hash_map 求救!
consistent hashing实际应用Check if the sum of two integers in an integer array eqauls to the given number
相关话题的讨论汇总
话题: hash话题: function话题: interview话题: about话题: questions
进入Programming版参与讨论
1 (共1页)
l******e
发帖数: 94
1
Phone screen question:
You have a billion urls , where each has a huge page, how to detect the
duplicate documents?
I said hashing the document contents, so the interviewer asker do I know
which hash function should I used? I have no clue about what specific
function can hash a large file into a small key that takes relatively less
space.
Anybody give can give me some hint?
D*******a
发帖数: 3688
2
you can use any hash functions, e. g. sum all characters mod 2^32-1

less

【在 l******e 的大作中提到】
: Phone screen question:
: You have a billion urls , where each has a huge page, how to detect the
: duplicate documents?
: I said hashing the document contents, so the interviewer asker do I know
: which hash function should I used? I have no clue about what specific
: function can hash a large file into a small key that takes relatively less
: space.
: Anybody give can give me some hint?

1 (共1页)
进入Programming版参与讨论
相关主题
Check if the sum of two integers in an integer array eqauls to the given number 一个关于unordered_map/hashmap的问题
map是用什么data structure来implement的?STL map
C下有没有好用的hash table函数库?slack got hacked, 没一个安全的
c+= 怎么实现 hashtable 的?consistent hashing实际应用
贡献一下:本版上搜集的 Google 面试题 (转载)Re: 请教一道题目
百度面试题,any idea?Re: [转载] how would you do this?
有大牛可以解释下bloom filter是在什么条件下使用最好perl array|hash question
又一道面试题,我是不是想多了?hashtable question
相关话题的讨论汇总
话题: hash话题: function话题: interview话题: about话题: questions