Factly Data Devthon


The Prototyping Stage of the Public Data Devthon was conducted at T-Hub on the 7th and 8th of May. The event was supported by the Government of Telangana and T-Hub. Rakesh Kumar Dubbudu, founder of FACTLY and Co-convenor of NCPRI was the curator for this edition of Devthon. The focus for this event was exploring the possibilities of Public Data.

Many of the participants were attending the Devthon for the first time. After a long wait, the event was finally about to start. By 10 AM, the hall was full of participants eager to learn and explore the possibilities of Open Public Data.

At 10 AM, Rakesh started by welcoming everyone present to the Public Data Devthon. He continued to explain about the scope of Open Data in India. Harish Krishnan, founder of Devthon took a few minutes to explain about the goals of Devthon and how this event came to be. This event was conducted jointly by FACTLY/Devthon with support from Manoj and Uday from FACTLY and Abraham from Devthon.

Then came the time to discuss about the datasets that were going to be used. Rakesh helped the audience understand the five datasets that were being used:

GHMC data: Based on grievance data from GHMC for the last one year (Apr 2015 to March 2016), can we find out the most common grievances, officials who are loaded with work, areas from where maximum grievances are coming from etc.

Village Dashboard based on Maa Bhoomi Portal: Based on the Village level Pahani on the Maa Bhoomi portal, can we build a dashboard for micro level planning at a village level on land holding size, type of land. water sources etc

NITI Aayog district data: Based on district data from NITI Aayog, can we build a dash board for comparing districts on various parameters and find sister districts?

MNREGA Data: Based on the MNREGA works done in a district since the inception of the scheme, can we find outliers in terms of wage/material expenditure, works done etc?

PDS data: Based on closing balance reports and key register reports of FPS, can we throw light on the FPS that may be diverting food grain meant for the poor.


At 11 AM, all the participants were given the choice to join any of the 5 groups which were using the 5 different datasets. There was a round of introductions where each one of the team members shared their name, experience and skill sets. After that it was jumping into the datasets that were on offer. The basic ideas that each one of them had in mind were discussed.

But the interesting part was the unique perspectives each team member had to offer. There were programmers, UX designers, database developers, journalists, activists and data enthusiasts that had come together to create a diverse team. A participant Sanjay Y shared, “ I was able to meet people from divergent backgrounds and exchange ideas.”

GHMC Data Analysis Team
MNREGA Dataset Analysis Team
NITI Aayog District Dashboard Team
Village Dashboard (Maa Bhoomi Portal) Team
PDS Data Analysis Team

After 2 hours of brainstorming of different ideas that could be pursued, all teams took a break where the discussions continued over lunch. When they came back at 2 PM, the prototyping started in some teams while others were still finalizing their ideas. In each team, one or two team members focused on cleaning the data and ensuring that they were uniform in structure. Rakesh and Srinivas moved from one team to the other and checked the progress while offering their feedback.



After two hours of sketching and tinkering, the teams reached their first checkpoint. They had to share the developments that had happened and could ask for feedback from other teams. The experts at the venue chipped in with inputs to help the teams get better clarity of their goals. The representatives from each team stepped forward to explain the prototypes that they were building. There was excitement in the room as the participants listened to what the other teams were up to and looked at ideas that they could incorporate into their own prototypes.

As the checkpoint ended, the team members huddled together to discuss what changes they wanted to make to their prototype. Also, the responsibilities were divided equally among the team members. Each one started working on the part they were assigned and constantly consulted their team members for feedback. Two hours flew by and everyone was happy with the work that they were able to do in a single day.

Day 1 ended with the teams being more than half way in completing their prototypes. A participant Vishal Pallerla said, “This is a good initiation by Devthon team to let the government know the outstanding possibilities that we have with the existing data to make decisions that can help government reduce the wastage of resources.”


Day 2 started at 10 AM with teams working on their presentations that they would be using to explain their prototypes. The work on the prototypes was also going in full swing. There was a checkpoint# 2 at 1 PM where the teams again shared their progress and took a final round of feedback. After lunch, the teams spent a good two hours in refining their work and getting it ready for presentation.


At 3 PM, all the teams were ready for the presentation practice. They came forward to explain the goals that they were trying to achieve with their respective prototypes and the users it will benefit. They also elaborated on the need for this product and the impact it will be having. The experts at the venue suggested some changes and corrections that were carefully noted by the team members. They just had 2 hours to make the final changes before they were ready for the final presentation before the IT Secretary, Governement of Telangana, Mr. Jayesh Ranjan.

At 5 PM, the stage was set for the outcomes of the event to presented. The chief guest, Mr. Jayesh Ranjan reached the venue. A short video was played to showcase the problem statements that were being tackled at this event.


Rakesh gave a brief introduction on the work put in by the teams and the goal of this Public Data Devthon. He also spoke about the datasets that were used and welcomed Mr. Jayesh Ranjan and Mr. Dileep Konatham.

Team #1 — Village Dashboard with data on Maa Bhoomi Portal

The first team to present was the team that were using the data from the Village Pahani records. They created a dashboard that could show, compare and calculate the data from the Village Pahani records. It could calculate the average land holding per person in a village, highest/lowest land holdings, type of land, water resource and barren/cultivable land in a village. This dashboard could be used by the common man and also an official like the district collector to measure the rural development metrics and find insights.

Team #2 GHMC Grievance Dataset Analysis

The team worked to visualize the complaints that the GHMC has received in the last year. It could show which were the areas that registered the most number of complaints and also the types of works that were requested the most number of times.

They were also able to build a demo for an App that would be able to visualize the status of complaints in a particular region. This would be a handy tool for all decision makers who were part of the GHMC. They could easily monitor if the efforts they were putting in were showing positive results and make changes accordingly.

Team #3 PDS Dataset Analysis

The main problem that the team focused on was the identification of leakages in the Public Distribution System. For achieving this goal, they created visualizations that showed the deviations in the behaviour of ration shop owners who showed huge variations of leftover food grains in the shop across months. By using this visualization, the supply variations across shops can easily be spotted. Also, the correlation between Supply and Number of Ration Food Cards can be seen. This can be used by officials to find fraudulent practices and investigate these shops wherever required.

Team #4 District Dashboard using NITI Aayog Data

This team presented their visualization that they had created to analyze District Level Quality of Life Data. They created visualizations to compare districts on key sectors of development like health, education, water and electricity. Combining this information, they wished to rank different districts on the quality of life. These visualizations could be useful to NGOs, activists, district and state administration to find the deficiencies in key development areas and work towards filling the necessary gaps.

Team #5 MNREGA Dataset Analysis

Team NREGA had two prototypes to showcase. The first one was a decision tree that would be used to understand if all the criteria were being met and thereby understanding whether the final goals of the NREGA scheme were being achieved. So all the data would be checked using the 4 rules that were part of the decision tree and easily be able to analyze if the scheme was being effective or not.

Also the team created a visualization to show the comparison between wages and materials. With the help of the visualization, they were able to clearly show the number of villages that showed a deviation from the average. They were able to find more than 300 villages that had spent more on the materials than the wages, thereby defeating the whole purpose of the scheme. These visualizations could be used to analyze the way villages were using their NREGA funds and easily spot the places where they were being misused.


Through the presentations, Mr. Jayesh shared his inputs and asked questions. After the presentations ended, Mr. Jayesh spoke about the importance of such an event and how the government was interested in such ideas and the power of such ideas to create better governance. He said he liked all the presentations and wished they would be improved over further events like this. He wished the participants the best for their efforts in Open Data and promised them more such opportunities to implement their ideas.

The feedback that Mr. Jayesh gave was also keenly noted by the participants. For the Village Pahani prototype, he pointed out the the gaps and nature of deficiencies in the data. He also asked the team to look at other datasets that can be correlated with this data like banking and electricity, crop productivity for better holistic understanding of rural development. He also suggested a pilot with multiple data sets to better understand its overall efficiency.

As part of the discussions with the GHMC team, the points which came up were the tracking and filing of the closure report. Also the need for resolution data needed to be opened up as next step. The grievance website form also needed to be reordered especially for address entry for easier access for the user. There was also no facility for reopening a grievance ID which was a big problem for users. He agreed with the team that finding correlations was going to be the main step forward in using open data to its maximum capacity.

For the PDS data, he spoke about the importance of overlaying vigilance data. He also suggested that social audits can be conducted whenever spikes are observed so that such incidences can be avoided. Also with respect to NREGA data, he spoke about the time of work availability and the importance of capturing all the outliers and presenting them later.


At the end of the event, the participants were only able to cover few but very important possibilities with the number of datasets that were available. There are so many other possibilities that are waiting to be explored with more access to public data and the correlations that are waiting to be discovered. The participants were excited at the prospects of more such events where they would get access to more Public Data using which they could use to implement many more of their ideas. Technology can be used in a big way to improve governance and Public Data is one which has the most impact.

You can join the Public Data Hyderabad Facebook Group to stay updated about all upcoming events and activities related to public data in Hyderabad.

By Abraham V