Wondering what’s next for npm?Check out our public roadmap! »

    crawler-html-3t

    1.0.10 • Public • Published

    Crawler Html 3T

    Đây là thư viện dùng để bóc tách dữ liệu html

    Installation

    npm install crawler-html-3t --save

    Usage

    Class ModelMongoose

    1. mod_sources
    • name_index
    • SourcesNews
    • Articles
    1. mod_baogom
    • name_index
    • mod_acticles
    • mod_links
    • mod_categories

    Class HtmlParser

    1. GetHtmlDoc
    • body: html
    • $: jquery
      GetHtmlDoc(url,function(error, body, $));
     

    Class HtmlExtract

    1. getTitle
     
     var title =  getTitle($);
     
    1. getDesc
     
     var description =  getDesc($);
     
    1. getImage
     
     var url_image =  getImage($);
     

    Class ReadRss

    1. getListFeed
     
    getListFeed(url_rss,function(error,list_feed));
     
    1. getListFeedByBodyXml
     
    getListFeedByBodyXml(bodyXml,function(error,list_feed));
     

    Install

    npm i crawler-html-3t

    DownloadsWeekly Downloads

    27

    Version

    1.0.10

    License

    none

    Last publish

    Collaborators

    • avatar