Extracted Data

Hello ODK Community,

I work for a public health NGO in Lusaka, Zambia, and have recently finished programming a very long, very complex survey.

When the data collected has few observations, it seems that the data is extracted in normal CSV columns.

However, when the level of data collection and field work intensifies to the normal, expected large amount of data collection, the extracted CSVs look very strange, with all the data in one column, separated by quotation marks and commas- the commas are not always singular, either, but often repeat strangely, i.e. "x",,,,,"y",,,,...

The analysis and research teams are at a loss as to what to do; the extracted data as is is difficult if not impossible to load into SAS. Is there a way to get ODK to extract the data normally? Any help would be greatly appreciated. Thank you so much!

Sincerely,
Jonathan

Excel is too smart for its own good. This is the problem:

If the delimiter character occurs between text qualifiers, Excel omits the
qualifiers in the imported value. If no delimiter character occurs between
text qualifiers, Excel includes the qualifier character in the imported
value. Hence, "Dallas Texas" (using the quotation mark text qualifier) is
imported into one cell as "Dallas Texas".

When Exporting CSV from ODK Aggregate, we are just blindly writing "value".
That gets imported as "value" when it should be imported as value.

There are two work-arounds:

(1) instead of using ODK Aggregate to export your data, use ODK Briefcase
to pull your data down to your local computer, and Export to CSV from
there, that CSV export emits the text qualifier (") only when required and
the file will import correctly into Excel (you still need to open Excel
first, then import -- you cannot double-click the csv file and have it read
automatically). This scales better and is the recommended approach.

or
(2) use an alternative spreadsheet program to import the data. e.g,

ยทยทยท On Tue, Mar 10, 2015 at 1:56 AM, wrote:

Hello ODK Community,

I work for a public health NGO in Lusaka, Zambia, and have recently
finished programming a very long, very complex survey.

When the data collected has few observations, it seems that the data is
extracted in normal CSV columns.

However, when the level of data collection and field work intensifies to
the normal, expected large amount of data collection, the extracted CSVs
look very strange, with all the data in one column, separated by quotation
marks and commas- the commas are not always singular, either, but often
repeat strangely, i.e. "x",,,,,"y",,,,...

The analysis and research teams are at a loss as to what to do; the
extracted data as is is difficult if not impossible to load into SAS. Is
there a way to get ODK to extract the data normally? Any help would be
greatly appreciated. Thank you so much!

Sincerely,
Jonathan

--

Post: opendatakit@googlegroups.com
Unsubscribe: opendatakit+unsubscribe@googlegroups.com
Options: http://groups.google.com/group/opendatakit?hl=en


You received this message because you are subscribed to the Google Groups
"ODK Community" group.
To unsubscribe from this group and stop receiving emails from it, send an
email to opendatakit+unsubscribe@googlegroups.com.
For more options, visit https://groups.google.com/d/optout.

--
Mitch Sundt
Software Engineer
University of Washington
mitchellsundt@gmail.com