3
邮件地址的频数统计
source link: https://yihui.org/cn/2009/04/frequency-of-email-addresses/
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
邮件地址的频数统计
谢益辉 / 2009-04-13
受 Zhang H. 启发,自个儿看了看 Gmail 邮箱的联系人,导出为 Outlook 的 CSV 格式,然后读进 R,用正则表达式去掉 *@部分,table () 了一下。Hotmail 邮箱的频数比我想象的要高,然后也没想到中科院植物所也排在前面,估计是那次 R 会议所致。剩下的联系人就分布在五湖四海了,结果请看:
x = read.csv("contacts.csv", stringsAsFactor = FALSE)
y = gsub("^.*@", "", x$E.mail.Address)
y = sort(table(y))
Disqus
Utterances
Preferences
© Yihui Xie 2005 - 2020
Recommend
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK