summaryrefslogtreecommitdiffstats
path: root/languagetool/TODO
blob: 65dfdaee16637c29c84860170673edb360d7ada5 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
-move all tests from "tests/*" to real test cases
-whitespace errors are reported after all other errors are reported

German:
	-False alarms (i.e. sentences that are correct but that trigger an error message):
		-...
	-checks for upper/lowercase spelling would be nice (e.g. "Hin und Her")
	-neue  Fehlerregeln: zwei propositionen direkt hintereinander, z.B. in im
	-Wish: Python-Regel, die Pr�ft, ob ein Wort mit Bindestrich irgendwo im Text
	 auch ohne Bindestrich vorkomt. z.B. Kern-Kompotenz <-> Kernkompetenz: dann Warnung

English:
	-whitespace test fails to detect whitespace in fron of punctuation
	-make "testsuite.py all" run
	-"You will lost all changes." ->  this used to be "will[VM0] lost[VVD]",
		now it's "will[VM0] lost[AJ0]" -> error not found anymore
		(this is the missing error: <error from="73" to="77"><message>Modal verbs 
		like 'will'/'might'/... require base form of verb.</message></error>,
		see tests/entest7.txt)
		==> TaggerTest currently fails, is that related?!

Hungarian:
	-...


Issue #1

	Bei einem neuen Regel erkennt schlechte kombination nicht:

Diese sind schlecht nach rule fen2 und werden nicht erkannt:
S�t�l fen. Megy fen.

--------------------------------------------------------------------------------------
Die Regel:

��������<rule id="fen" name="fen es fenn keverese">
����������������<pattern lang="hu">"fen" (V1|V2|V3|IKV1|IKV2|IKV3|SI1|SI2|SI3|IKSI1|IKSI2|
IKSI3|V4|V5|V6|IKV4|IKV5|IKV6|SI4|SI5|SI6|IKSI4|IKSI5|IKSI6)</pattern>
����������������<message>fen es fenn keverese</message>
����������������<example type="correct">fen kest</example>
����������������<example type="incorrect">fen megy</example>
��������</rule>
��������<rule id="fen2" name="fen es fenn keverese">
����������������<pattern lang="hu"> (V1|V2|V3|IKV1|IKV2|IKV3|SI1|SI2|SI3|IKSI1|IKSI2|IKSI3|
V4|V5|V6|IKV4|IKV5|IKV6|SI4|SI5|SI6|IKSI4|IKSI5|IKSI6) "fen"</pattern>
����������������<message>fen es fenn keverese</message>
����������������<example type="correct">kest fen</example>
����������������<example type="incorrect">megy fen</example>
��������</rule>
--------------------------------------------------------------------------------------

Der test:
Fen megy �s fen s�t�l. S�t�l fen. Megy fen.

--------------------------------------------------------------------------------------

Das Ergebnis mit debug:

[en@Vigor13 languagetool]$ python TextChecker.py -l hu -d tests/hutest5.txt
Checking 'tests/hutest5.txt', file encoding latin1, language Hungarian:
Tw:[(u'Fen', u'Fen', u'V3'), (u' ', None, []), (u'megy', u'megy', u'V3'), (u' 
', None, []), (u'\xe9s', u'\xe9s', u'CNJ'), (u' ', None, []), (u'fen', 
u'fen', u'V3'), (u' ', None, []), (u's\xe9t\xe1l', u's\xe9t\xe1l', u'V3'), 
(u'', None, []), (u'.', None, None), (u' ', None, []), (u'', None, [])]
Tw:[(u'S\xe9t\xe1l', u'S\xe9t\xe1l', u'V3'), (u' ', None, []), (u'fen', 
u'fen', u'V3'), (u'', None, []), (u'.', None, None), (u' ', None, []), (u'', 
None, [])]
Tw:[(u'Megy', u'Megy', u'V3'), (u' ', None, []), (u'fen', u'fen', u'V3'), 
(u'', None, []), (u'.', None, None), (u' ', None, []), (u'', None, [])]
Tw:[('', None, [])]
Line 1, Column 0: fen es fenn keverese
� � � � ...Fen megy �s fen s�t�l. ...
Line 1, Column 12: fen es fenn keverese
� � � � ...Fen megy �s fen s�t�l. S�t�l fen. Me...
--------------------------------------------------------------------------------------

Issue #2:
Bei zu langen S�tzen ist die l�nge nicht der ganze Satz, nur der satzanfang 
(oder Ende). Zum anzeigen sollte aber der ganze Satz gef�rbt werden.

Line 68, Column 372: This sentence is 77 words long, which exceeds the 
configured limit of 60 words.
��������... szemben �s az i. el�tti MAHGAR...
<error from="8388" to="8389">This sentence is 77 words long, which exceeds the 
configured limit of 60 words.</error>
<error from="21979" to="21981">This sentence is 75 words long, which exceeds 
the configured limit of 60 words.</error>
-------------------------------------