Glemt passord?
Registrer deg


Produktkategorier

Vis alle (992)

Kategorier

Vis alle(992)

Tidsskrifter

Bestill abonnement

Proceedings

A semantic proofreading tool for all languages based on a text repository

ForfattereKai A. Olsen, Bård Indredavik
InstitusjonMolde University College, University of Bergen, Oshaug Metall AS
PublikasjonNorsk informatikkonferanse (NIK)
Publiseringsdato2010-11-22
Sidetall intervall66-76
Generell lenkehttp://www.nik.no
ISBN/ISBN29788251927024/
ISSN/ISSN21892-0713 (trykk) / 1892-0721 (online)/
KategoriInformasjonsteknologi
RedaktørErik Hjelmås
UtgiverTapir Akademisk Forlag
Adresse utgiverBesøksadresse: Tapir Akademisk Forlag Nardoveien 12, Trondheim Postadresse: Tapir Akademisk Forlag Postboks 2461 Sluppen 7005 Trondheim
SpråkEnglish


Last ned (Gratis)



Abstrakt

A method for finding lexical, syntactic or semantic errors in text in any language is
introduced. It is based on a large text repository. The idea is to “follow everybody else”,
that is, to compare the sentence offered by the user to the similar sentences in the text
repository and suggesting alternative words when appropriate. This concept offers the
possibility of taking proofreading further than what is possible with standard spell- and
grammar checkers.
A prototype for such a system has been developed. This includes a spider that traverses
the Web and stores the retrieved text; a builder that creates a convenient index structure
for the text repository and the part that analyses the user’s sentence, offering
suggestions for improvement.

Referanser



Experiments of the Stanford Heuristic Programming Project, Addison-Wesley.


Dean, J. and Ghemawat, S. (2004) MapReduce: Simplified data processing on large
clusters. Proc. Sixth Symposium on Operating System Design and Implementation, San


Francisco, CA, Dec 6-8 http://labs.google.com/papers/mapreduce.html. Also in
Communications of the ACM, 2008, Vol. 51, No 1.


Dean, J. and Ghemawat, S. (2010) MapReduce: A Flexible Data Processing Tool,
Communications of the ACM, Vol. 53, No 1.


Dreyfus, Hubert (1972), What Computers Can\'t Do, New York: MIT Press


Dreyfus, Hubert (1992), What Computers Still Can\'t Do: A Critique of Artificial
Reason, New York: MIT Press


Hirschberg, S. (1975) A linear space algorithm for computing maximal common
subsequences. Communication of the ACM. 18(6) p 341-343.


Olsen, K.A., Williams, J. G. (2004). Spelling and Grammar Checking Using the Web as
a Text Repository, Journal of the American Society for Information Science and
Technology (JASIST), vol. 55, no 11.


Segaran, T. (2007) Programming Collective Intelligence, O´Reilly.


Powell, D. R., Allison, L., Dix, T. I. (1999). A versatile divide and conquer technique
for optimal string alignment. Inf. Proc. Lett. 70 p 127-139.







Forrige artikkel      Neste artikkel

Handlevogn

Handlevognen er tom



Tidsskrift: