Difference between revisions of "Main Page/Research/MSB/Data processing"
From phurvitz
Phil Hurvitz (talk | contribs) |
Phil Hurvitz (talk | contribs) (→Data processing scripts) |
||
(2 intermediate revisions by the same user not shown) | |||
Line 68: | Line 68: | ||
==Data processing scripts== | ==Data processing scripts== | ||
− | To download MSB files: [[/ | + | To download MSB files: [[../Scripts/msb.get.data.pl|msb.get.data.pl]] |
− | To read MSB files: [[/ | + | To read MSB files: [[../Scripts/read.msb.files.R|read.msb.files.R]] |
− | To remove duplicates in the ''class.csv'' file: [[/ | + | To remove duplicates in the ''class.csv'' file: [[../Scripts/msb_remdupes.pl|msb_remdupes.pl]] |
− | To conflate timestamps between the the phone log file and the MyExperience file: [[/ | + | To conflate timestamps between the the phone log file and the MyExperience file: [[../Scripts/conflate_timestamp.R|conflate_timestamp.R]] |
+ | |||
+ | A master script to download and process data | ||
==How the hell does it work?== | ==How the hell does it work?== | ||
[[/how it works|how it works]] | [[/how it works|how it works]] |
Latest revision as of 17:18, 27 January 2009
Contents
Jonathan's processing stuff
Job submit: http://ubi.cs.washington.edu/infrastructure/submitJob_phurvitz.html
Job status: http://ubi.cs.washington.edu/assist/index.php/Job_Status
Output data: http://www.cs.washington.edu/research/projects/ubicomp3/phurvitz/gis.washington.edu/phurvitz/msb/data/
Pushing files
use rsync: e.g., to copy from the SD card to the HD
rsync -rivh --ignore-existing --size-only /cygdrive/g/msp-data/ /cygdrive/c/users/phurvitz/htdocs/phurvitz/msb/data/pmh04_20071012/
e.g., to copy from the HD to the web server
rsync -rivh --ignore-existing --size-only /cygdrive/c/users/phurvitz/htdocs/phurvitz/msb/data/pmh04_20071012/ /cygdrive/p/public_html/msb/data/pmh04_20071012/
A script to automate this pushing process (assuming the SD card is mounted on drive G:
#! /usr/bin/perl use strict; # get the args if ($#ARGV == -1) { print "Usage: $0 <subject_id>\n"; exit; } # get the dirs my $sddata = "g:/msp-data/"; my $hddata = "c:/users/phurvitz/htdocs/phurvitz/msb/data/$ARGV[0]/"; my $msbdata = "p:/public_html/msb/data/$ARGV[0]/"; # do the dirs exist? if (! -d $sddata) { print "$sddata does not exist\n"; exit; } if (! -d $hddata) { print "$hddata does not exist\n"; exit; } if (! -d $msbdata) { print "$msbdata does not exist\n"; exit; } # munge to cygdir pathnames $sddata =~ s/://g; $sddata =~ s/^/\/cygdrive\// ; $hddata =~ s/://g; $hddata =~ s/^/\/cygdrive\// ; $msbdata =~ s/://g; $msbdata =~ s/^/\/cygdrive\// ; # copy from the SD card to the HD system ("rsync -rivh --ignore-existing --size-only $sddata $hddata"); # copy from the HD to the webdir system ("rsync -rivh --ignore-existing --size-only $hddata $msbdata");
Data processing scripts
To download MSB files: msb.get.data.pl
To read MSB files: read.msb.files.R
To remove duplicates in the class.csv file: msb_remdupes.pl
To conflate timestamps between the the phone log file and the MyExperience file: conflate_timestamp.R
A master script to download and process data