Germanic Lexicon Project
Message Board

Home

Texts

Search

Messages

Volunteer

About


[ Main Message Index ]     [ Previous | Next ] [ Reply ]

Author: Keith Briggs
Date: 2004-11-09 09:43:19
Subject: Re: Probabilistic correction

More tests, to show the effect of increasing the size of the training corpus:

Train on Caesar's Gallic Wars:
> dbacl -0 -q4 -l latin ../Latin/gall*

> db="dbacl -c c -c latin -vN"
> echo "ge-helmian ; p. ode, ede; pp. od, ed" | ${db}
c 100.00% latin 0.00%
> echo "To cover with a helmet, crown;" | ${db}
c 100.00% latin 0.00%
> echo "galeSre, coronare" | ${db}
c 32.41% latin 67.59%
> echo ":-- ÐÚ gehelmodest us" | ${db}
c 100.00% latin 0.00%
> echo "corSnasti nos," | ${db}
c 100.00% latin 0.00%
> echo "Ps. Spl. 5, 15. Of wuldre and weorþmynt ðú" | ${db}
c 100.00% latin 0.00%
> echo "gehelmedest hine" | ${db}
c 100.00% latin 0.00%
> echo "de gloria et hondre coronasti eum," | ${db}
c 99.03% latin 0.97%
> echo "de gloria et honore coronasti eum," | ${db}
c 38.45% latin 61.55%
> echo "Ps. Spl. T. 8, 6. Gehelmod " | ${db}
c 100.00% latin 0.00%
> echo "gáleátus,
Ælfc. Gr. 43; Som. 45, II." | ${db}
c 100.00% latin 0.00%
> echo "[Laym, i-helmed : O. H. Ger. gehelmot.]" | ${db}
c 100.00% latin 0.00%

Add Annals of Tacitus:
> dbacl -h18 -0 -q4 -l latin ../Latin/gall* ../Latin/tac*

> db="dbacl -c c -c latin -vN"
> echo "ge-helmian ; p. ode, ede; pp. od, ed" | ${db}
c 100.00% latin 0.00%
> echo "To cover with a helmet, crown;" | ${db}
c 100.00% latin 0.00%
> echo "galeSre, coronare" | ${db}
c 58.05% latin 41.95%
> echo ":-- ÐÚ gehelmodest us" | ${db}
c 100.00% latin 0.00%
> echo "corSnasti nos," | ${db}
c 100.00% latin 0.00%
> echo "Ps. Spl. 5, 15. Of wuldre and weorþmynt ðú" | ${db}
c 100.00% latin 0.00%
> echo "gehelmedest hine" | ${db}
c 100.00% latin 0.00%
> echo "de gloria et hondre coronasti eum," | ${db}
c 98.23% latin 1.77%
> echo "de gloria et honore coronasti eum," | ${db}
c 17.58% latin 82.42%
> echo "Ps. Spl. T. 8, 6. Gehelmod " | ${db}
c 100.00% latin 0.00%
> echo "gáleátus,
Ælfc. Gr. 43; Som. 45, II." | ${db}
c 100.00% latin 0.00%
> echo "[Laym, i-helmed : O. H. Ger. gehelmot.]" | ${db}
c 100.00% latin 0.00%

Messages in this threadNameCollege/UniversityDate
Probabilistic correction Keith Briggs 2004-11-04 05:41:11
Re: Probabilistic correction Keith Briggs 2004-11-04 07:49:10
Re: Probabilistic correction Sean Crist Swarthmore College 2004-11-04 22:42:53
Re: Probabilistic correction Keith Briggs 2004-11-05 05:31:16
Re: Probabilistic correction Keith Briggs 2004-11-05 06:59:54
Re: Probabilistic correction Keith Briggs 2004-11-05 07:29:53
Re: Probabilistic correction Sean Crist Swarthmore College 2004-11-05 09:32:30
Re: Probabilistic correction Sean Crist Swarthmore College 2004-11-05 09:48:16
Re: Probabilistic correction Keith Briggs 2004-11-08 05:07:19
Re: Probabilistic correction Sean Crist Swarthmore College 2004-11-08 09:12:45
Re: Probabilistic correction Keith Briggs 2004-11-08 09:46:59
Re: Probabilistic correction Keith Briggs 2004-11-08 10:02:13
Re: Probabilistic correction Keith Briggs 2004-11-08 12:10:56
Re: Probabilistic correction Sean Crist Swarthmore College 2004-11-08 15:26:04
Re: Probabilistic correction Keith Briggs 2004-11-09 06:47:45
Re: Probabilistic correction Keith Briggs 2004-11-09 08:50:46
Re: Probabilistic correction Keith Briggs 2004-11-09 09:43:19
Re: Probabilistic correction Keith Briggs 2004-11-09 10:59:49
Italics (was: Probabilistic correction) Sean Crist Swarthmore College 2004-11-09 13:39:13
Re: Probabilistic correction Keith Briggs 2004-11-11 06:57:20