Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PoC for the event-based plugin system with YARD parsing plugin #1321

Draft
wants to merge 2 commits into
base: master
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
2 changes: 2 additions & 0 deletions lib/rdoc.rb
Original file line number Diff line number Diff line change
Expand Up @@ -208,4 +208,6 @@ def self.home
autoload :Extend, "#{__dir__}/rdoc/code_object/extend"
autoload :Require, "#{__dir__}/rdoc/code_object/require"

autoload :BasePlugin, "#{__dir__}/rdoc/base_plugin"
autoload :EventRegistry, "#{__dir__}/rdoc/event_registry"
end
20 changes: 20 additions & 0 deletions lib/rdoc/base_plugin.rb
Original file line number Diff line number Diff line change
@@ -0,0 +1,20 @@
module RDoc
class BasePlugin
# Register a literner for the given event

def self.listens_to(event_name, &block)
rdoc.event_registry.register(event_name, block)
end

# Activate the plugin with the given RDoc instance
# Without calling this, plugins won't work

def self.activate_with(rdoc = ::RDoc::RDoc.current)
@@rdoc = rdoc
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Using class variable for this means any later plugin calls activate_with with its own RDoc will apply that to all other plugins. I don't think this is what we want?

class Base
  @@name = "Base"

  def name
    @@name
  end

  def self.update_name(name)
    @@name = name
  end
end

class Foo < Base
end

class Bar < Base
end

Foo.update_name("Foo")
f = Foo.new

Bar.update_name("Bar")
b = Bar.new

puts f.name # => "Bar"
puts b.name # => "Bar"

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought it's safe to use a class variable here since we don't have to update rdoc object.
However, some information such as a name of the plugin, should be stored as a class instance variables.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

  • If we don't expect plugins to call this method, then I don't think it should be in the base class as it is.
  • It doesn't look like the yard plugin needs this attribute yet? If that's the case, let's not introduce this until we have a real use case to help us define the API better.
  • If we want to pass RDoc's states down to plugins, let's avoid passing the RDoc object unless we have a solid use case for it. There's a lot of unnecessary coupling between RDoc's major components and I have been untangling for a while (example). Exposing RDoc directly will potentially make it harder for such work as all of its attributes & methods will be considered public APIs.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You are right, I didn't notice that it's not used at all now. I used it at first, but store has enough information for the plugin so I removed it.
Then we can remove this class variable. And I agree, we should avoid RDoc::RDoc object (thank you for untangling it!)

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah I'd expect we expose Options and Store objects first if we need to share states with plugins.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

https://github.com/ruby/rdoc/pull/1321/files#diff-2458b7fcdf31ccd44fa0681ba7e381360b3e4623c9f5409358222e4826c3c60dR6

Sorry it turned out that we do use rdoc object here because we need some global data store for event handling.
I still agree that we should avoid RDoc::RDoc instance, but then where should we put these event registration logic? Maybe we can introduce something like PluginManager and pass it to plugins?

end

def self.rdoc
@@rdoc
end
end
end
26 changes: 26 additions & 0 deletions lib/rdoc/event_registry.rb
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We already have a plugin system in RDoc::Markup::PreProcess.

Could you explain what is the merit of "the event-based plugin system" you suggested (and problems in the existing system)? Should existing RDoc::Markup::PreProcess be deprecated by this?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I believe that RDoc::Markup::PreProcess requires adding directives to existing code, and that is a problem. One of the advantages of a new plugin system is that just enabling plugins works without modifying existing code.

Should existing RDoc::Markup::PreProcess be deprecated by this?

Using directives has its own advantages, so I'd like to keep it.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems that RDoc::Markup::PreProcess has the "post_process" hook too:

##
# Adds a post-process handler for directives. The handler will be called
# with the result RDoc::Comment (or text String) and the code object for the
# comment (if any).
def self.post_process &block
@post_processors << block
end

Should we deprecate it by suggested plugin system?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

As far as I can see, this PreProcess works with directives, and this time it's not I want to work with. The goal of this PR is to introduce a way to make RDoc understand YARD without modifying existing code.
That said, we don't have to (or should not) deprecate post_process here since it's about directives and removing it will be backward-incompatible, and cannot be replaced by the plugin system in this PR.

Copy link
Member

@kou kou Mar 24, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm. It seems that the "post_process" hook doesn't depend on directives.

Could you try the following?

RDoc::Markup::PreProcess.post_process do |comment, code_object|
  if code_object.is_a?(RDoc::AnyMethod)
    puts "Parsing #{code_object.name}"
  end
end

It seems that the hook is called for all methods.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kou Yes it seems it calls hooks for every method. For YARD parsing purpose, this might work as well.

There are three issues though.

  1. The naming, PreProcess.post_process looks so weird
  2. It needs to modify code_object directly, resulting unexpected behaviors
  3. Low extensibility, it can work only with code_object and not with other things

As the name suggests, this API is limited to add other markups. So, I believe we can eventually deprecate this in favor our new plugin APIs.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I expect the plugin system to solve some maintainability problem in RDoc.
RDoc::Markup::PreProcess.post_process seems to be only used internally for TomDoc support. We don't need to deprecate it, but just remove and migrate TomDoc to use plugin.
We don't have enough practical plugin examples right now. It will help improving the plugin design and reveal what feature will be needed.

Supporting more YARD directives will will also helps it.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you update the PR description and clarifies the current plugin system problems and how to resolve them by the suggested plugin system in the PR description?

(I thought that I wrote this comment a few days ago but it was not written...)

Original file line number Diff line number Diff line change
@@ -0,0 +1,26 @@
module RDoc
class EventRegistry
EVENT_TYPES = %i[
rdoc_start
sample
rdoc_store_complete
]

attr_reader :environment

def initialize
@registry = EVENT_TYPES.map { |event_name| [event_name, []] }.to_h
@environment = {}
end

def register(event_name, handler)
@registry[event_name] << handler
end

def trigger(event_name, *args)
@registry[event_name].each do |handler|
handler.call(@environment, *args)
end
end
end
end
24 changes: 24 additions & 0 deletions lib/rdoc/options.rb
Original file line number Diff line number Diff line change
Expand Up @@ -268,6 +268,11 @@ class RDoc::Options

attr_accessor :pipe

##
# Currently enabled plugins

attr_reader :plugins

##
# Array of directories to search for files to satisfy an :include:

Expand Down Expand Up @@ -395,6 +400,7 @@ def init_ivars # :nodoc:
@coverage_report = false
@op_dir = nil
@page_dir = nil
@plugins = []
@pipe = false
@output_decoration = true
@rdoc_include = []
Expand Down Expand Up @@ -436,6 +442,7 @@ def init_with map # :nodoc:
@main_page = map['main_page']
@markup = map['markup']
@op_dir = map['op_dir']
@plugins = map['plugins']
@show_hash = map['show_hash']
@tab_width = map['tab_width']
@template_dir = map['template_dir']
Expand Down Expand Up @@ -503,6 +510,7 @@ def == other # :nodoc:
@main_page == other.main_page and
@markup == other.markup and
@op_dir == other.op_dir and
@plugins == other.plugins and
@rdoc_include == other.rdoc_include and
@show_hash == other.show_hash and
@static_path == other.static_path and
Expand Down Expand Up @@ -868,6 +876,12 @@ def parse argv

opt.separator nil

opt.on("--plugins=PLUGINS", "-P", Array, "Use plugins") do |value|
@plugins.concat value
end
Comment on lines +879 to +881
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We can use multiple --plugin to add multiple plugins.

Suggested change
opt.on("--plugins=PLUGINS", "-P", Array, "Use plugins") do |value|
@plugins.concat value
end
opt.on("--plugin=PLUGIN", "-P", "Use plugin") do |plugin|
@plugins << plugin
end

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm not sure which interface is better.

# This
rdoc --plugins yard_plugin,another_plugin

# Or that
rdoc --plugin yard_plugin --plugin another_plugin

Correct me if I'm wrong, but I think the later way is kind of rare.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What does "rare" mean here?
The latter is used in many cases. For example, ruby's -I is the latter interface. curl's --data/--header/... are also the latter interface.

The latter style is easyer to use from a script:

args=()
if [ "$USE_X_PLUGIN" = "yes" ]; then
  args+=(--plugin X_plugin)
fi
if [ "$USE_Y_PLUGIN" = "yes" ]; then
  args+=(--plugin Y_plugin)
fi
rdoc "${args[@]}" ...

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@kou Thank you for correcting me, after some investigation, I noticed that we have both styles in Ruby and other tools.
Personally I don't have a strong opinion here. For the future version we can consider YAML configuration as a primary way to enable plugins (like rubocop).
I'm not against using multiple --plugin option here.


opt.separator nil

opt.on("--tab-width=WIDTH", "-w", Integer,
"Set the width of tab characters.") do |value|
raise OptionParser::InvalidArgument,
Expand Down Expand Up @@ -1344,6 +1358,16 @@ def visibility= visibility
end
end

# Load plugins specified with options
# Currently plugin search logic is very simple, but it's not practical.
# TODO: We will improve this later.

def load_plugins
@plugins.each do |plugin_name|
require_relative "./#{plugin_name}.rb"
end
end

##
# Displays a warning using Kernel#warn if we're being verbose

Expand Down
10 changes: 10 additions & 0 deletions lib/rdoc/rdoc.rb
Original file line number Diff line number Diff line change
Expand Up @@ -71,6 +71,11 @@ class RDoc::RDoc

attr_accessor :store

##
# Event registry for RDoc plugins

attr_accessor :event_registry

##
# Add +klass+ that can generate output after parsing

Expand Down Expand Up @@ -105,6 +110,7 @@ def initialize
@options = nil
@stats = nil
@store = nil
@event_registry = ::RDoc::EventRegistry.new
end

##
Expand Down Expand Up @@ -449,6 +455,9 @@ def document options
end
@options.finish

::RDoc::BasePlugin.activate_with(self)
@options.load_plugins
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If we load plugins dynamically and use require, plugins will not be loaded to multiple RDoc instance.

Do we need to load plugins for each RDoc::RDoc#document?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Agree, initialize is a better place?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

#initialize is better than #document but do we need to load plugins for each RDoc instance?
Can we load plugins only once in one process?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I cannot see that using multiple RDoc::RDoc instance is common. From CLI or Rake task, I think we have only one instance of RDoc::RDoc.
However, we might have multiple instances in the future, probably for concurrency.
So then, where is the ideal place to activate plugins? I through it's RDoc::Rdoc since it's in the execution path anyway.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

gem install A B C creates a RDoc::RDoc for each package.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I see, that's pretty common...
Would we want separate plugins and their configurations for each RDoc::RDoc instance? That said, if we want to isolate each environment, loading plugins for each instance is fine. If not, then we can load plugins all at once at somewhere else (with RDoc module maybe?)

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Who does configure RDoc plugins?

If each package have RDoc plugins configuration, we may need per package plugins configuration. If users configure RDoc plugins, we may be able to share the same plugins configuration.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I thought only users will configure RDoc plugins, but noticed that each gem can configure RDoc along with its plugins with rdoc_options.

args = @spec.rdoc_options

And I guess that's why we instantiate an RDoc instance for each package in the first place. So loading plugins for each package might be necessary.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We may not need to require for registering a plugin. We may just need to instantiate a plugin for it. For example:

module RDoc
  class YardPlugin < BasePlugin
    def on_rdoc_store_complete(store)
      # ...
    end
  end

  class RDoc
    def document(options)
      # ...
      plugins = options.plugins.collect do |name|
        instantiate_plugin(name) # name.capitalize.constantize.new or associating name with class or something...
      end
      # ...
      plugins.each do |plugin|
        plugin.on_rdoc_store_complete(@store) if plugin.respond_to?(:on_rdoc_store_complete)
      end
      # ...
    end
  end
end


@store = RDoc::Store.new(@options)

if @options.pipe then
Expand All @@ -469,6 +478,7 @@ def document options
@options.default_title = "RDoc Documentation"

@store.complete @options.visibility
@event_registry.trigger :rdoc_store_complete, @store

@stats.coverage_level = @options.coverage_report

Expand Down
228 changes: 228 additions & 0 deletions lib/rdoc/yard_plugin.rb
Original file line number Diff line number Diff line change
@@ -0,0 +1,228 @@
# Yard type parser is inspired by the following code:
# https://github.com/lsegal/yard-types-parser/blob/master/lib/yard_types_parser.rb

require_relative 'base_plugin'
require 'strscan'

module RDoc
class YardPlugin < BasePlugin
listens_to :rdoc_store_complete do |env, store|
store.all_classes_and_modules.each do |cm|
cm.each_method do |meth|
puts "Parsing #{meth.name}"
parsed_comment = Parser.new(meth.comment.text).parse
# meth.params = parsed_comment.param.map(&:to_s).join("\n")
meth.comment.text = parsed_comment.plain.join("\n")
Comment on lines +14 to +15
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Plugins should update the given objects destructively?
How to extend RDoc by a plugin when existing RDoc objects don't have a place to store additional data? In this case, how to use the parsed type information?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes I consider this and don't have a confident conclusion now.
I think there are two ways, one is using env object and another is modifying existing objects (meth in his case).
Using env would be safer, and storing type information would look like:

# In a plugin
env[meth] = DataFromYardPlugin.new(type: type)

# In a generator
type = env[meth].type

Here, DataFromYardPlugin contains various information along with type.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If there are multiple plugins that want to attach additional information to meth, the approach may be fragile. (One plugin may overwrite env[meth] set by another plugin.)

Can we provide more safer mechanism?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think there are two ways of overriding: intentional and unintentional.

If the overriding is unintentional, it should be prevented. It can be done with something like below:

require 'logger'

module RDoc
  class Environment
    def initialize
      @data = {}
      @logger = Logger.new($stderr)
      @logger.level = Logger::INFO
      @hooks = {}
    end

    def [](key)
      @data[key]
    end

    def []=(key, value)
      old_value = @data[key]
      @data[key] = value
      @logger.debug("Set env[#{key.inspect}] = #{value.inspect}")
      trigger_hooks(key, old_value, value)
    end

    def delete(key)
      @logger.debug("Delete env[#{key.inspect}]")
      old_value = @data.delete(key)
      trigger_hooks(key, old_value, nil)
    end

    def keys
      @data.keys
    end

    def fetch(key, default = nil)
      val = @data.fetch(key, default)
      @logger.debug("Fetch env[#{key.inspect}] => #{val.inspect}")
      val
    end

    def to_h
      @data.dup
    end

    def logger
      @logger
    end

    def on_key_set(key, &block)
      (@hooks[key] ||= []) << block
    end

    private

    def trigger_hooks(key, old_value, new_value)
      return unless @hooks[key]
      @hooks[key].each do |hook|
        begin
          hook.call(key, old_value, new_value, self)
        rescue => e
          @logger.error("Error in hook for key #{key.inspect}: #{e.message}")
          raise
        end
      end
    end
  end
end

env = RDoc::Environment.new
env[:foo] = 'foo'
env.on_key_set(:foo) { raise 'Do not override :foo }
env[:foo] = 'bar' # => Error

In this approach, preventing override is a responsibility of those who want to do so. In other words, this is not automatic. This is good because if it allows other plugins to override it, it just leaves it overridable.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I feel that attaching information to a code object is a common pattern for RDoc plugins.
(In your example, attaching type information to each method object.)

If the conflict is a rare situation, your approach may work. But if it's a common pattern, your approach will be inconvenient.

Do you think whether attaching information to a code object is a common pattern or not?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ah, sorry. "attaching information to a code object" means both of object.ADDTIONAL_INFORMATION = XXX and env[object] = XXX cases. It's not related to "how to implement".

For example, if we use object.metadata[:types] = types (object.metadata returns a Hash) approach, we can attach multiple information to the object such as object.metadata[:types] and object.metadata[:my_property]. If we use env[object] = XXX approach, we can't attach multiple information to the object because it has only one space. But they are "how to implement". For example, we can use env[[object, :types]] with the env approach.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you for clarification, then yes, extracting information from somewhere like comment string and attaching it to code objects would be typical for plugins.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

OK. Then env[object] = XXX and env.on_key_set(object) { raise "Do not override #{object.inspect}" } will be inconvenient.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, preventing override by default is an option.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If the prevention happen, what should we (users? plugin developers) do as the next action?
Do we change key for env from env[object] = ... to something? What key should be used for the case?

end
end
end

class Parser
ParamData = Struct.new(:type, :name, :desc, keyword_init: true) do
def append_desc(line)
self[:desc] += line
end

def to_s
"Name: #{self[:name]}, Type: #{self[:type].map(&:to_s).join(' or ')}, Desc: #{self[:desc]}"
end
end
ReturnData = Struct.new(:type, :desc, keyword_init: true)
RaiseData = Struct.new(:type, :desc, keyword_init: true)
ParsedComment = Struct.new(:param, :return, :raise, :plain)

TAG_PARSING_REGEXES = {
param: /
@param\s+
(?: # Match either of the following:
\[(?<type1>[^\]]+)\]\s+(?<name1>\S+)\s*(?<desc1>.*)? | # [Type] name desc
(?<name2>\S+)\s+\[(?<type2>[^\]]+)\]\s*(?<desc2>.*)? # name [Type] desc
)
/x,
return: /@return\s+\[(?<type>[^\]]+)\]\s*(?<desc>.*)?/,
raise: /@raise\s+\[(?<type>[^\]]+)\]\s*(?<desc>.*)?/
}
def initialize(comment)
@comment = comment
@parsed_comment = ParsedComment.new([], nil, [], [])
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It seems that we can use a local variable not an instance variable for this.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

True, thank you for pointing out.

@mode = :initial
@base_indentation_level = 0 # @comment.lines.first[/^#\s*/].size
end

def parse
@comment.each_line do |line|
current_indentation_level = line[/^#\s*/]&.size || 0
if current_indentation_level >= @base_indentation_level + 2
Comment on lines +54 to +55
Copy link
Member

@kou kou Mar 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this work when the first line is # ...?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Honestly this indentation-related implementation is not perfect and will fail for many edge cases. I'll definitely add test cases for this.

# Append to the previous tag
data = @mode == :param ? @parsed_comment[@mode].last : @parsed_comment[@mode]
data.append_desc(line)
else
if (tag, matchdata = matching_any_tag(line))
if tag == :param
type = matchdata[:type1] || matchdata[:type2]
name = matchdata[:name1] || matchdata[:name2]
desc = matchdata[:desc1] || matchdata[:desc2]
parsed_type = TypeParser.parse(type)
@parsed_comment[:param] << ParamData.new(type: parsed_type, name: name, desc: desc)
@mode = :param
Copy link
Member

@tompng tompng Mar 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Storing previous_tag here instead of storing mode = :param, appending part will be simple

tag = ParamData.new(type: parsed_type, name: name, desc: desc)
@parsed_comment[:param] << tag
previous_tag = tag
# Append to the previous tag
- data = @mode == :param || @mode == :raise ? @parsed_comment[@mode].last : @parsed_comment[@mode]
- data.append_desc(line)
+ previous_tag.append_desc(line)

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Awesome, I'll take this!

elsif tag == :return
type = matchdata[:type]
desc = matchdata[:desc]
parsed_type = TypeParser.parse(type)
@parsed_comment[:return] = ReturnData.new(type: parsed_type, desc: desc)
@mode = :return
elsif tag == :raise
type = matchdata[:type]
desc = matchdata[:desc]
parsed_type = TypeParser.parse(type)
@parsed_comment[:raise] << RaiseData.new(type: parsed_type, desc: desc)
@mode = :raise
end
else
@parsed_comment[:plain] << line
end
end
@base_indentation_level = current_indentation_level
end

@parsed_comment
end

private

def matching_any_tag(line)
TAG_PARSING_REGEXES.each do |tag, regex|
matchdata = line.match(regex)
return [tag, matchdata] if matchdata
end
nil
end
end

class Type
attr_reader :name

def initialize(name)
@name = name
end

def to_s
@name
end
end

class CollectionType < Type
attr_reader :type

def initialize(name, type)
super(name)
@type = type
end

def to_s
"#{@name}<#{@type}>"
end
end

class FixedCollectionType < Type
attr_reader :type

def initialize(name, type)
super(name)
@type = type
end

def to_s
"#{@name}(#{@type})"
end
end

class HashCollectionType < Type
attr_reader :key_type, :value_type

def initialize(name, key_type, value_type)
super(name)
@key_type = key_type
@value_type = value_type
end

def to_s
"#{@name}<#{@key_type} => #{@value_type}>"
end
end

class TypeParser
TOKENS = {
collection_start: /</,
collection_end: />/,
fixed_collection_start: /\(/,
fixed_collection_end: /\)/,
type_name: /#\w+|((::)?\w+)+/,
literal: /(?:
'(?:\\'|[^'])*' |
"(?:\\"|[^"])*" |
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think some regexps are wrong.
"\" (escaped, quote is not closed) matches to this string literal regexp.
Integer and Float literal(integer part) matches to type_name: /#\w+|((::)?\w+)+/

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh I didn't notice this. I tested this code with some variations of type and it seems working. Tests are missing and we need to add them to ensure this regex working correctly since it's quite complex.

:[a-zA-Z_][a-zA-Z0-9_]*|
\b(?:true|false|nil)\b |
\b\d+(?:\.\d+)?\b
)/x,
type_next: /[,;]/,
whitespace: /\s+/,
hash_collection_start: /\{/,
hash_collection_next: /=>/,
hash_collection_end: /\}/,
parse_end: nil
}

def self.parse(string)
new(string).parse
end

def initialize(string)
@scanner = StringScanner.new(string)
end

def parse
types = []
type = nil
fixed = false
name = nil
loop do
found = false
TOKENS.each do |token_type, match|
if (match.nil? && @scanner.eos?) || (match && token = @scanner.scan(match))
found = true
case token_type
when :type_name, :literal
raise SyntaxError, "expecting END, got name '#{token}'" if name
name = token
when :type_next
raise SyntaxError, "expecting name, got '#{token}' at #{@scanner.pos}" if name.nil?
unless type
type = Type.new(name)
end
types << type
type = nil
name = nil
when :fixed_collection_start, :collection_start
name ||= "Array"
klass = token_type == :collection_start ? CollectionType : FixedCollectionType
type = klass.new(name, parse)
when :hash_collection_start
name ||= "Hash"
type = HashCollectionType.new(name, parse, parse)
when :hash_collection_next, :hash_collection_end, :fixed_collection_end, :collection_end, :parse_end
raise SyntaxError, "expecting name, got '#{token}'" if name.nil?
unless type
type = Type.new(name)
end
types << type
return types
end
end
end
raise SyntaxError, "invalid character at #{@scanner.peek(1)}" unless found
end
end
end
end
end
1 change: 1 addition & 0 deletions test/rdoc/test_rdoc_options.rb
Original file line number Diff line number Diff line change
Expand Up @@ -75,6 +75,7 @@ def test_to_yaml
'markup' => 'rdoc',
'output_decoration' => true,
'page_dir' => nil,
'plugins' => [],
'rdoc_include' => [],
'show_hash' => false,
'static_path' => [],
Expand Down
Loading