Difference between revisions of "Main Page/Research/MSB/Data processing/how it works"

From phurvitz
Jump to: navigation, search
 
Line 9: Line 9:
 
* Downloaded data are stored in separate subdirectories for each subject, named '''part_00''', '''part_01''', etc., per MSB power-cycle session
 
* Downloaded data are stored in separate subdirectories for each subject, named '''part_00''', '''part_01''', etc., per MSB power-cycle session
 
** Each part should be downloaded and processed separately.   
 
** Each part should be downloaded and processed separately.   
** Use the script [[/msb.get.data|msb.get.data]] to download all the parts in one step.
+
** Use the script [[../../Scripts/msb.get.data.pl|msb.get.data.pl]] to download all the parts in one step.
  
* Once downloaded, use the R script [[/read.msb.files|read.msb.files]] to generate csv files.
+
* Once downloaded, use the R script [[../../Scripts/read.msb.files.R|read.msb.files.R]] to generate csv files.
  
* There may be records with duplicate timestamps.  To remove duplicate records from the class.csv file, use the perl script [[/msb_remdupes.pl|msb_remdupes.pl]] script.
+
* There may be records with duplicate timestamps.  To remove duplicate records from the class.csv file, use the perl script [[../../Scripts/msb_remdupes.pl|msb_remdupes.pl]].
  
 
* The csv files can be related by timestamps (''i.e.'', '''phone_log''' and '''class''' by the '''date''' field).
 
* The csv files can be related by timestamps (''i.e.'', '''phone_log''' and '''class''' by the '''date''' field).

Latest revision as of 17:12, 27 January 2009


  • Each time the MSB starts, a new session is started. Files on the MSB increment with the pattern
    • log.uwar
    • log-0001.uwar
    • log-0002.uwar
    • ....
  • Downloaded data are stored in separate subdirectories for each subject, named part_00, part_01, etc., per MSB power-cycle session
    • Each part should be downloaded and processed separately.
    • Use the script msb.get.data.pl to download all the parts in one step.
  • There may be records with duplicate timestamps. To remove duplicate records from the class.csv file, use the perl script msb_remdupes.pl.
  • The csv files can be related by timestamps (i.e., phone_log and class by the date field).