Case study

Unlocking the potential of structured datasets by establishing data structure and providing data wrangling services on Upstream Subsurface Sedimentological Reports for major O&G company


The client was able to successfully attain a structured data structure for its legacy Sedimentological Reports in less than 8 weeks, achieving 60 % increased efficiency accessing and searching for information.


The client faces challenges in accessing the information in their reports since most of the legacy reports were in the form of non-searchable pdf, images and etc. and this heavily affected their processes of correlations, studies, and research. Additionally, the technical challenges however are due to the volume which is more than 200 documents, and the information resides in multiple type of documents of different variations and formats as well as originating from the typewritten, machine printed and  scanned documents.  In order to enable users to proceed with the study, the client needed its information to be identified, cleaned, extracted, validated, structured first before proceeding with their analytics.


The client engaged with Net Geometry for DataGeometry-BPA Data Wrangling Services to structure their data residing in multiple legacy documents.

Implementing certified Project Management Institute (PMI) and Project Management Body of Knowledge (PMBOK) certified project phases & processes together with leveraging on DataGeometry – BPA, we make the documents to be searchable and automatically identify the targeted information as per defined by the SME, digitize, and extract them seamlessly and store them in a structured database upon being validated. In addition to that, the scope of digitization also was incorporated to digitize the graphs, charts, and logs that were present in the reports.

High Level Scope of Work for Extraction:

High Level Scope of Digitization for Digitization:

Business Impacts

DataGeometry-BPA’s solution has allowed the client’s team to make good use of the non-searchable static data that have been trapped in the reports. It automatically enables the documents to be searchable first enabling for easier detection of targeted information. This information is then extracted, verified, and stored in a structured format thus increasing the ease of accessibility. The results of the scalability and performances of the DataGeometry – BPA’s Data Wrangling Services outperforms significantly when compared to manual processes.

  • Significant cost reduction of up-to 30% for the wrangling services as compared to the manual process.
  • Reduces operational time for extraction by almost 60%, and the extraction activities for more than 200 over documents were completed within 8 weeks.
  • Work process that incorporates multistage verification involving SME increases the accuracy and reliability of the extracted data. 
  • Eliminate human errors during the review and extraction process.


    Agile, Rapid and Proven

    What’s Next?

    Keep connect with US

    Suites C-5-16, Metropolitan Square Commerce, Damansara Perdana, 47820, Petaling Jaya, Selangor, Malaysia

    +60 37625 3153

    Leave A Message