漢字テクスト検索システムKR
広島平和科学 Volume 14
Page 129-186
published_at 1991
アクセス数 : 1008 件
ダウンロード数 : 82 件
今月のアクセス数 : 0 件
今月のダウンロード数 : 0 件
この文献の参照には次のURLをご利用ください : https://doi.org/10.15027/15208
File |
hps_14_131.pdf
1.67 MB
種類 :
fulltext
|
Title ( jpn ) |
漢字テクスト検索システムKR
|
Title ( eng ) |
KR : A retrieval system for Japanese texts
|
Creator |
Matsuo Masatsugu
|
Source Title |
広島平和科学
Hiroshima Peace Science
|
Volume | 14 |
Start Page | 129 |
End Page | 186 |
Journal Identifire |
[PISSN] 0386-3565
[EISSN] 2434-9135
[NCID] AN00213938
|
Abstract |
The present paper is an interim report of KR, a simple concording and word counting microcomputer program for full texts of Japanese, that is, texts represented in kanji, kana and other 2 byte symbols. The program was developed as part of a research project on 'Full Text Data Base of Documents of Atomic Bomb Damages', a report of which is also in this issue. The program, written in the C language for portability, is intended, first of all, as an easy and quick tool of searching and retrieving parts of a text where a given word or string apprears. In view of this, the usually time-consuming procedure of preparing Japanese texts,, which requires the delimitation of every word in the texts and the manual lemmmatization or explicit specification of the rules of lemmatization, are all drastically simplified. Users are expected only to prepare an MS-DOS text (or ASCII) files. The process of seraching conducted by a menu is also intended to be simple and quick. But, it cannot be so simple and easy because the program must satisfy many different reseach needs of users. Therefore, the program offers the following facilities as options. searching of a pair of terms, reordering, merging, and pairing of results of a search limiting of the range of search to part(s) or subtext(s) simultaneous counting and/or searching of more than one terms.
|
NDC |
Information science [ 007 ]
|
Language |
jpn
|
Resource Type | departmental bulletin paper |
Publisher |
広島大学平和科学研究センター
|
Date of Issued | 1991 |
Publish Type | Version of Record |
Access Rights | open access |
Source Identifier |
[ISSN] 0386-3565
[NCID] AN00213938
|