BMI Students

Tuesday, November 29, 2005

Using Curl for fast downloads

This is nice, although I am rarely waiting on downloads these days.
From here.

For example, suppose you want to download the Mandrake 8.0 ISO from the following three locations:
url1=http://ftp.eecs.umich.edu/pub/linux/mandrake/iso/Mandrake80-inst.iso
url2=http://ftp.rpmfind.net/linux/Mandrake/iso/Mandrake80-inst.iso
url3=http://ftp.wayne.edu/linux/mandrake/iso/Mandrake80-inst.iso

The length of the file is 677281792, so initiate three simultaneous downloads using curl's "--range" option:
bash$ curl -r 0-199999999 -o mdk-iso.part1 $url1 &
bash$ curl -r 200000000-399999999 -o mdk-iso.part2 $url2 &
bash$ curl -r 400000000- -o mdk-iso.part3 $url3 &

Monday, November 14, 2005

Machine learning blog

At this machine learning blog, they scanned in a number of old papers, including:

"Why isn't everyone a Bayesian?" by Efron B, American Statistician 1986. Examines reasons why not everybody was a Bayesian, as of 1986, with scorching reply from Lindley.
"Axioms of Maximum Entropy" by Skilling, MaxEnt 1988 proceedings. Sets up four practically motivated axioms, and uses them to derive maximum entropy as the unique method for picking a single probability distribution from the set of valid probability distributions.

The other posts are worth a look too.
Here

Personal Genome Project

George Church comments on a Personal Genome Project...
Here

Friday, November 04, 2005

The FDR

False Discovery Rate is really important for most of us. This paper (lecture notes, actually) covers most of what you need to know. Gil Chu recently gave a talk on local FDR, so I am linking to that paper too. The first link's server seems to be down right now (hopefully temporarily). Anyone know what the difference between local FDR and PER is?

Multiple Hypothesis Correction
Review from Genome Res.
Local FDR