Proposed rule based syllable segmentation method is shown in figure 5. Myanmar characters of Web pages are detected and the relevance judgment of the web pages is determined by the proposed rule based syllable percentage threshold. Precision of the proposed crawler and n-gram based crawler Proposed crawler N-gram Based Crawler First run Second run First run Second run Correctly download as Myanmar pages Incorrectly download 11 6 32 as Myanmar pages No of pages Accuracy Table 7 shows the average percentage of precision for proposed crawler and ngram-based crawler were To collect the set Myanmar Web pages for search engine, crawlers, which traverses Web by following the hyperlinks and stored the download pages in a repository and used then by indexer component to index the web pages, are needed. Gathering the web pages manually for language specific search engine is not possible and realistic. Some of the general open source Web crawlers. It the web pages are relevant, store them in the pages repository in order to ready for indexer to extract the keywords of web pages.
© 2020 laboratoriosnatura.com - All rights reserved. All Models are over 21 y.o.