Resource page for BAEIR Members and Friends
Feel free to suggest useful stuff to be listed here.
Lab Meeting Material
- 2019 Spring (restricted access)
- 2018 Fall (restricted access)
- 2018 Spring (restricted access)
- 2017 Fall (restricted access)
- 2017 Spring (restricted access)
- 2016 Fall (restricted access)
- 2016 Spring (restricted access)
- 2015 Fall (restricted access)
- 2015 Summer Bootcamp (restricted access)
- 2015 Spring (restricted access)
- 2014 Fall (restricted access)
Important Conferences:
- KDD; ACM SigKDD Conference on Knowledge Discovery and Data Mining; http://www.kdd.org/
- KDD 2013, http://www.kdd.org/kdd2013/, local archive at http://weiwei.lu.im.ntu.edu.tw/kdd2013/kdd2013.htm (username/password: kdd2013/2013kdd)
- KDD 2012, http://kdd2012.sigkdd.org/, http://weiwei.lu.im.ntu.edu.tw/kdd2012/
- NIPS, http://nips.cc/
- ICML, http://icml.cc/2015/
- ICDM; IEEE International Conference on Data Mining
- ICIS 2014, http://icis2014.aisnet.org/
Important Journals (IS oriented):
- MIS Quarterly
- Information Systems Research
- Journal of Management Information Systems
- Decision Support Systems
Import Journals (Technical):
- ACM Transactions on Information Systems
- IEEE Transactions Knowledge and Data Engineering
Real Time Status (cluster, network traffic)
You are welcome to apply for an account on common.lu.im.ntu.edu.tw (Ubuntu 12 LTS)
- Lab Torque Cluster Status (originally created by 鈺嫻)
- Traffice flow: 資管系
Opensource Tools
- R (statistical inference platform), http://www.r-project.org/
- Rcpp: a good way to improve the speed of you R code. You may also need RcppArmadillo.
- Writing C extension for R: The standard way to to improve the speed of your R code.
- Natural language processing and text mining:
- OpenNLP (https://opennlp.apache.org/),
- Lingpipe (http://alias-i.com/lingpipe/),
- Standford NLP tools (http://nlp.stanford.edu/software/index.shtml)
- Mallet (http://mallet.cs.umass.edu/)
- NLTK (http://www.nltk.org/), a Python library
Dataset: Public
- UCI Machine Learning Repository
- Kaggle
- Open Government Data
- KDnugget
- StatLib
- University of Edinburgh
- KDD Cup (1997 - 2010); use Google to locate newer KDD CUP datasets.
- Enron Email Dataset
- MovieLens
- 467 Million Twitter tweets; also see the local dataset
- Wikipedia Downloads
- Wikipedia pagecount; also see the local dataset
- WRDS: Accounting data, stock return, high frequency trading, analysts forecasts (you need to apply for an account through the college); also see the local dataset