Learning tree patterns for syntactic parsing

  • András Hócza

Abstract

This paper presents a method for parsing Hungarian texts using a machine learning approach. The method collects the initial grammar for a learner from an annotated corpus with the help of tree shapes. The PGS algorithm, an improved version of the RGLearn algorithm, was developed and applied to learning tree patterns with various phrase types described by regular expressions. The method also calculates the probability values of the learned tree patterns. The syntactic parser of learned grammar using the Viterbi algorithm performs a quick search for finding the most probable derivation of a sentence. The results were built into an information extraction pipeline.

Downloads

Download data is not yet available.
Published
2006-01-01
How to Cite
Hócza, A. (2006). Learning tree patterns for syntactic parsing. Acta Cybernetica, 17(3), 647-659. Retrieved from https://cyber.bibl.u-szeged.hu/index.php/actcybern/article/view/3689
Section
Regular articles

Most read articles by the same author(s)