Machine Learning to Support Technical Document Indexing, a Case Study on Seismic Acquisition Reports

H. Blondelle; P. Neri; J. Micaelli

doi:10.3997/2214-4609.201801219

Machine Learning to Support Technical Document Indexing, a Case Study on Seismic Acquisition Reports
Authors H. Blondelle¹, P. Neri², J. Micaelli¹
View Affiliations Hide Affiliations

Affiliations: ¹ Agile Data Decisions ² Energistics
Publisher: European Association of Geoscientists & Engineers
Source: Conference Proceedings, 80th EAGE Conference and Exhibition 2018, Jun 2018, Volume 2018, p.1 - 5
DOI: https://doi.org/10.3997/2214-4609.201801219

Abstract

Summary

From the drill floor to the top floor, all exploration decisions are based on data. Today, industry standards formats proposed by the SEG or Energistics are structured and facilitate the transfer and archiving of the measurements, together with associated metadata. The xml formats proposed by Energistics such as WITSML™ also make it possible to stream the information in support of real-time decisions.

Nevertheless, to have a full understanding of the context of a survey, geoscientists still have to go to the acquisition reports. These reports are available in PDF or TIFF unstructured formats which are very difficult to index automatically at a large scale.

Various attempts to apply some deterministic data mining approaches have been disappointing due to the high variability of reports formats and layout styles.

In order to illustrate the potential of machine learning systems to index automatically subsurface related documents, we have built a learning models to detect 20 metadata items among seismic acquisition, QAQC, HSE and navigation reports. This has confirmed the capacity of ML to index on demand large volumes of documents. This also opens the possibility to extract data from unstructured documents prior to applying classical modelling or data analytic.

Article metrics loading...

/content/papers/10.3997/2214-4609.201801219

2018-06-11

2024-04-24

From This Site

/content/papers/10.3997/2214-4609.201801219

dcterms_title,dcterms_subject,pub_keyword

-contentType:Journal -contentType:Contributor -contentType:Concept -contentType:Institution

10

5

Full text loading...

References

Blinston, K., H.Blondelle
, 2017, Machine learning systems open up access to large volumes of valuable information lying dormant in unstructured documents:The Leading Edge. March 2017, p257–261
[Google Scholar]
Juneja, A., J.Micaelli and J.Johnston
, 2017, Method and system for extracting, verifying and cataloging technical information from unstructured documents: US patent 20170169103 A1, www.google.com/patents/US20170169103
[Google Scholar]
Su, F., et al.
, 2015, Attribute Extracting from Wikipedia Pages in Domain AutomaticallyinV. E.Balas, L. C.Jain, XZhao, eds., Information Technology and Intelligent Transportation Systems:Springer International Publishing, 433–440.
[Google Scholar]
Vapnik, V.N.
, 1999, An overview of statistical learning theory:IEEE Transactions On Neural Networks, 10, no. 5, 988–999, http://web.mit.edu/6.962/www/www_spring_2001/emin/slt.pdf.
[Google Scholar]
Zhong, B., J.Liu, Y.Du, Y.Liaozheng, and J.Pu
, 2016, Extracting attributes of named entity from unstructured text with deep belief network:International Journal of Database Theory and Application9.5, no. 5, 187–196, http://dx.doi.org/10.14257/ijdta.2016.9.5.19.
[Google Scholar]

http://instance.metastore.ingenta.com/content/papers/10.3997/2214-4609.201801219

Machine Learning to Support Technical Document Indexing, a Case Study on Seismic Acquisition Reports

Conference Proceedings 2018, 1 (2018); https://doi.org/10.3997/2214-4609.201801219

/content/papers/10.3997/2214-4609.201801219

Data & Media loading...

Most Cited This Month Most Cited RSS feed

- The natural combination of full and image‐based waveform inversion
  
  Authors Tariq Alkhalifah and Zedong Wu
- Poststack diffraction imaging using reverse‐time migration
  
  Authors Ilya Silvestrov, Reda Baina and Evgeny Landa
- Characterizing the effect of elastic interactions on the effective elastic properties of porous, cracked rocks
  
  Authors Luanxiao Zhao, Qiuliang Yao, De‐hua Han, Fuyong Yan and Mosab Nasser
- Fracture detection by Gaussian beam imaging of seismic data and image spectrum analysis
  
  Authors M.I. Protasov, G.V. Reshetova and V.A. Tcheverda
- Laboratory measurements of guided‐wave propagation within a fluid‐saturated fracture
  
  Authors Seiji Nakagawa, Shinichiro Nakashima and Valeri A. Korneev
More Less

Machine Learning to Support Technical Document Indexing, a Case Study on Seismic Acquisition Reports

Abstract

From This Site

Most Read This Month

Most Cited This Month Most Cited RSS feed

The natural combination of full and image‐based waveform inversion

Poststack diffraction imaging using reverse‐time migration

Characterizing the effect of elastic interactions on the effective elastic properties of porous, cracked rocks

Fracture detection by Gaussian beam imaging of seismic data and image spectrum analysis

Laboratory measurements of guided‐wave propagation within a fluid‐saturated fracture