Difference between revisions of "Main Page/Research/MSB/Data processing/how it works"

From phurvitz
Jump to: navigation, search
Line 6: Line 6:
 
** log-0002.uwar
 
** log-0002.uwar
 
** ....
 
** ....
 
  
 
* Downloaded data are stored in separate subdirectories for each subject, named '''part_00''', '''part_01''', etc., per MSB power-cycle session
 
* Downloaded data are stored in separate subdirectories for each subject, named '''part_00''', '''part_01''', etc., per MSB power-cycle session
** Each part should be downloaded and processed separately.  Use the script [[/msb.get.data|msb.get.data]] to download all the parts in one step.
+
** Each part should be downloaded and processed separately.   
 +
** Use the script [[/msb.get.data|msb.get.data]] to download all the parts in one step.
  
 
* Once downloaded, use the R script [[/read.msb.files|read.msb.files]] to generate csv files.
 
* Once downloaded, use the R script [[/read.msb.files|read.msb.files]] to generate csv files.
  
* To remove duplicate records with duplicate seconds timestamps from the class.csv file, use the perl script [[/msb_remdupes.pl|msb_remdupes.pl]] script.
+
* There may be records with duplicate timestamps.  To remove duplicate records from the class.csv file, use the perl script [[/msb_remdupes.pl|msb_remdupes.pl]] script.
  
The csv files can be related by timestamps (''i.e.'', '''phone_log''' and '''class''' by the '''date''' field).
+
* The csv files can be related by timestamps (''i.e.'', '''phone_log''' and '''class''' by the '''date''' field).

Revision as of 22:04, 12 October 2007


  • Each time the MSB starts, a new session is started. Files on the MSB increment with the pattern
    • log.uwar
    • log-0001.uwar
    • log-0002.uwar
    • ....
  • Downloaded data are stored in separate subdirectories for each subject, named part_00, part_01, etc., per MSB power-cycle session
    • Each part should be downloaded and processed separately.
    • Use the script msb.get.data to download all the parts in one step.
  • Once downloaded, use the R script read.msb.files to generate csv files.
  • There may be records with duplicate timestamps. To remove duplicate records from the class.csv file, use the perl script msb_remdupes.pl script.
  • The csv files can be related by timestamps (i.e., phone_log and class by the date field).