Class | WWW::Mechanize::PluggableParser |
In: |
lib/www/mechanize/pluggable_parsers.rb
|
Parent: | Object |
This class is used to register and maintain pluggable parsers for Mechanize to use.
A Pluggable Parser is a parser that Mechanize uses for any particular content type. Mechanize will ask PluggableParser for the class it should initialize given any content type. This class allows users to register their own pluggable parsers, or modify existing pluggable parsers.
PluggableParser returns a WWW::Mechanize::File object for content types that it does not know how to handle. WWW::Mechanize::File provides basic functionality for any content type, so it is a good class to extend when building your own parsers.
To create your own parser, just create a class that takes four parameters in the constructor. Here is an example of registering a pluggable parser that handles CSV files:
class CSVParser < WWW::Mechanize::File attr_reader :csv def initialize(uri=nil, response=nil, body=nil, code=nil) super(uri, response, body, code) @csv = CSV.parse(body) end end agent = WWW::Mechanize.new agent.pluggable_parser.csv = CSVParser agent.get('http://example.com/test.csv') # => CSVParser
Now any page that returns the content type of ‘text/csv’ will initialize a CSVParser and return that object to the caller.
To register a pluggable parser for a content type that pluggable parser does not know about, just use the hash syntax:
agent.pluggable_parser['text/something'] = SomeClass
To set the default parser, just use the ‘defaut’ method:
agent.pluggable_parser.default = SomeClass
Now all unknown content types will be instances of SomeClass.
CONTENT_TYPES | = | { :html => 'text/html', :xhtml => 'application/xhtml+xml', :pdf => 'application/pdf', :csv => 'text/csv', :xml => 'text/xml', } |
default | [RW] |
# File lib/www/mechanize/pluggable_parsers.rb, line 56 56: def initialize 57: @parsers = { CONTENT_TYPES[:html] => Page, 58: CONTENT_TYPES[:xhtml] => Page } 59: @default = File 60: end
# File lib/www/mechanize/pluggable_parsers.rb, line 91 91: def [](content_type) 92: @parsers[content_type] 93: end
# File lib/www/mechanize/pluggable_parsers.rb, line 95 95: def []=(content_type, klass) 96: @parsers[content_type] = klass 97: end
# File lib/www/mechanize/pluggable_parsers.rb, line 83 83: def csv=(klass) 84: register_parser(CONTENT_TYPES[:csv], klass) 85: end
# File lib/www/mechanize/pluggable_parsers.rb, line 70 70: def html=(klass) 71: register_parser(CONTENT_TYPES[:html], klass) 72: register_parser(CONTENT_TYPES[:xhtml], klass) 73: end
# File lib/www/mechanize/pluggable_parsers.rb, line 62 62: def parser(content_type) 63: content_type.nil? ? default : @parsers[content_type] || default 64: end
# File lib/www/mechanize/pluggable_parsers.rb, line 79 79: def pdf=(klass) 80: register_parser(CONTENT_TYPES[:pdf], klass) 81: end
# File lib/www/mechanize/pluggable_parsers.rb, line 66 66: def register_parser(content_type, klass) 67: @parsers[content_type] = klass 68: end
# File lib/www/mechanize/pluggable_parsers.rb, line 75 75: def xhtml=(klass) 76: register_parser(CONTENT_TYPES[:xhtml], klass) 77: end