Let say I have a HH survey with a repeat part. I can use $filter option to download only those submissions I need https://{{myserver}}/v1/projects/{{projectId}}/forms/{{xmlFormId}}.svc/Submissions?$filter=__system/submissionDate le 2020-01-31T23:59:59.999Z Now I need to download the correspondin…

Hi @Odil ! There's currently only limited support for filtering OData, particularly for filtering subtables. In general, we don't plan to support filtering on form fields (like age) in the near future. See this forum post for some of the reasoning behind that. However, we support filtering the prima…

Thank you @Matthew_White for these comments. It's helpful. I was trying to use $filter to download only the latest data. If we can use __system/submissionDate also for subtables in the new future that would be really great. For the time being, to download the latest data I will use OData document p…

Hi @Odil Depending on the amount of data that you expect in your repeat groups, it may also be an option to use the ?expand=* query parameter in order to obtain repeated data directly with the main submissions query. See: https://odkcentral.docs.apiary.io/#reference/odata-endpoints/odata-form-servi…

[image] vlehn: it may also be an option to use the ?expand=* query parameter Yes, you are right. But this is not a solution when you want to download each table separately.

FWIW for my ETL pipeline I download all submissions every day with ruODK::odata_submission_get, which also downloads media attachments to a local folder. Each run only adds new media attachments, and the actual data only a few seconds (we have 100k+ records, so nowhere near LSHTM scales). This appr…

Hi @Matthew_White , I am still trying to pull only fresh data from central in order to not download every hour all the data collected since march ( Propagate submission date to child tables - #3 by mathieubossaert ). As @Florian_May said, getting all data every day or every hour works fine, but I am …

@Matthew_White you can forget my question :wink: [image] Mathieu Bossaert: Am I wrong ? No. but each "parent" table contains the path to repeat tables. The "subtablename@odata.navigationLink" attribute/column gives me the information I need to pull the data form the subtable ! /v1/projects/…

As a alternative solution to filter data subtables , and as a continuation of this start-up , I now have a first version of a pyODK function that meets our needs (getting only fresh data from Central) For a given filter, it returns a dictionary containing for each "table" of the form (submission tabl…

A first version of this new set of PostgreSQL fonctions is out :slight_smile: [image] Pl-pyodk : use of pyODK with PL/Python PostgreSQL functions to pull data from Central into your own database Showcase Two years ago, I spent time during the third COVID lock-down working …

Using $filter with repeat tables to download OData

Support

mathieubossaert May 18, 2023, 5:41am 10

A first version of this new set of PostgreSQL fonctions is out