City Pulse (Pvt) Ltd. has launched the very first website containing very informative resource materials on Mobile based Data Collection and Management in Pakistan. Currently it presents background information on traditional vs mobile based data collection and benefits of mobile based data collection, its areas of application and conceptual stages in designing mobile based data collection and management system. More is expected to come soon.
Category Archives: Open Data Kit (ODK)
Using ODK Briefcase to fetch data from Aggregate Server
This tutorial has been written during the implementation of a mobile based data collection and management project in Charsadda, Pakistan.
ODK Briefcase is an integral part of mobile based data collection and management system. We have been using this simple but very useful tool to push and pull the collected data for our aggregate servers (a web based server holding data being sent from mobiles). ODK Briefcase can also be used to fetch data from ODK Collect when you are offline.
ODK Briefcase can be used for:
1. Export forms with submissions from ODK Aggregate – PULL
2. Make bulk submission to ODK Aggregate – PUSH
3. CSV exports of submission data
4. Pull forms and submissions collected by ODK Collect from a mobile phone
Use ODK Briefcase to fetch data from server:
1. Install Java 6 or higher on your computer.
2. Download ODK briefcase from http://code.google.com/p/opendatakit/ and open it.
3. Specify the location of the ODK storage area on your computer this will create the ODK Briefcase Storage folder which will hold all the blank forms and finalized forms. For example, we kept it to H:\CharsaddaData. It will automatically create a subfolder named ‘ODK Briefcase Storage’ there
4. On the tab “Pull,” click on the bar to the right of “Pull data from:” (top left) and choose “Aggregate 1.0”.
5. Click on “Connect” to pull data from aggregate
6. Specify the server url, enter the username and password to the server account. Click “Connect” to pull forms from server. In this case server address will be https://citypulsedata.appspot.com
7. In the main screen, find the form you want to download and tick off on the check box to the left of it. In Charsadda project case, you need to select ElectricityConsumer_RCA_F.
8. Hit “Pull” button at bottom right of page. It will start fetching the form and you will be able to see the progress. ODK Briefcase automatically see if any form is already downloaded in the given folder. In that case it will quickly go through those form and will fetch only the new records.
9. Wait until you see the “SUCCESS!” message on the “Pull status”.
Using ODK Briefcase to export data in csv format:
1. Go to “Export” tab. Pull down the bar to the right of “Form:” (top left) and choose the relevant form. In case of our example it will be ElectricityConsumer_RCA_F
2. Click on the “Choose” button to select the export directory. For example in current case its same H:\CharsaddaData
3. Click “Export” button on the bottom-most right of the application. This will load quickly and will give you the same “SUCCESS!” message.
4. Go to the directory where you placed the data (e.g. H:\CharsaddaData) and you will find a folder title media along with a csv file titled same as the form (e.g. ElectricityConsumer_RCA_F.csv). Use this csv file along with media folder for any further operations if necessary.
Mobile Data Collection with Preloaded data
Background:
Recently I was asked to design a mobile based data collection and management system to collect data of sugarcane growers in Pakistan. The objective was to develop a data collection and management system which can be used to collect data from sugar cane growers using android based devices. The collected data required to contain text, numeric and pictures along with geographic shapes (polygons) of the sugarcane fields.
The field enumerators were expected to have low education level so it was desired to make the mobile data collection as intuitive and user friendly as possible. The weather and field conditions were tough requiring considerations.
The collected data was to be audited by supervisors through a web interface where they wanted to view and verify the collected data as well as the location, shape and size of sugarcane field. They wanted to view the polygon of sugarcane filed overlaid on satellite image with the ability to modify coordinates of polygon shape.
Additionally, it was required to have some mechanism in which existing data of sugarcane growers is made available on mobile data collection devices so that enumerator does not need to fill in all data fields. Instead they can simply verify if the existing data was correct.
Form Design with Preloaded data:
We decided to base our work on Open Data Kit with customized data collection forms and reporting server. ODK Collect 1.4.3 allows the data preloading in new round of survey. We took advantage of that and created a survey form with associated database of existing grower information. Some key technical aspects in designing such forms include the following:
- Create a .csv file containing the data you want to use as pre-loaded in your questions. For example our csv name is SCGDV1.csv
- The .csv file must contain a column with name ending with “_key”. This column will be used for lookup. For example in our case we used “grower_id_key”
- The column names for other columns should also be short and unique.
- Create a simple form using ODK build or any other xml form builder of your choice
- Open the xml for in note pad or any other xml editor for advance changes
- Search for “<bind nodeset” and you will reach in the part of form containing data nodes
- Initially they will look like:
<bind nodeset="/data/grower_id" type="int"/>
<bind nodeset="/data/name" type="string" required="true()"/>
<bind nodeset="/data/father_name" type="string"/>
<bind nodeset="/data/nic" type="int" required="true()"/>
<bind nodeset="/data/land" type="int"/>
<bind nodeset="/data/location" type="geopoint"/>
- Add pulldata() function to desired nodesets where you want to have preloaded data.
- The syntax will be like
calculate="pulldata('SCGDV1', 'name', 'grower_id_key', /data/grower_id)"
calculate=”pulldata(‘SCGDV1’, ‘name’, ‘grower_id_key’, /data/grower_id)”
- Where SCGDV1 is name of csv file, “name” is the column heading whose value you want to pull against grower_id given in /data/grower_id while “grower_id_key” will be used for searching that name.
- Suppose you entered 15 as a grower_id in a question and you use pulldata() function to fetch name of the grower having id 15 from the csv file. So it will search for 15 in “grower_id_key” column and will find the corresponding name for that record and fill the Name question with what it found.
- pulldata() function is used with a calculate command with each nodeset and resultantly code looks like this
<bind nodeset="/data/grower_id" type="int"/>
<bind calculate="pulldata('SCGDV1', 'name', 'grower_id_key', /data/grower_id)" nodeset="/data/name" type="string" required="true()"/>
<bind calculate="pulldata('SCGDV1', 'father_name', 'grower_id_key', /data/grower_id)" nodeset="/data/father_name" type="string"/>
<bind calculate="number(pulldata('SCGDV1', 'nic', 'grower_id_key', /data/grower_id))" nodeset="/data/nic" type="int" required="true()"/>
<bind calculate="number(pulldata('SCGDV1', 'land', 'grower_id_key', /data/grower_id))" nodeset="/data/land" type="int"/>
<bind nodeset="/data/location" type="geopoint"/>
- Even when they are numbers, data fields pulled from a .csv file are considered to be text strings. Thus, you may sometimes need to use the int() or number() functions to convert a pre-loaded field into numeric form. In my case int() did not worked but number() works fine as it can be seen above. I had to use this function for each and every nodeset where data type was integer. Otherwise it gives error.
- Once form is complete, test is using ODK Validate and upload in your aggregate along with csv file. Deploy on your mobile and it works perfect.