3

邮件地址的频数统计

 3 years ago
source link: https://yihui.org/cn/2009/04/frequency-of-email-addresses/
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
neoserver,ios ssh client

邮件地址的频数统计

谢益辉 / 2009-04-13


受 Zhang H. 启发,自个儿看了看 Gmail 邮箱的联系人,导出为 Outlook 的 CSV 格式,然后读进 R,用正则表达式去掉 *@部分,table () 了一下。Hotmail 邮箱的频数比我想象的要高,然后也没想到中科院植物所也排在前面,估计是那次 R 会议所致。剩下的联系人就分布在五湖四海了,结果请看:

CSV 频数文件下载

x = read.csv("contacts.csv", stringsAsFactor = FALSE)
y = gsub("^.*@", "", x$E.mail.Address)
y = sort(table(y))

又摔上跤了 世上最痛苦的事

Disqus Utterances Preferences

© Yihui Xie 2005 - 2020

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK