Class: TTFunk::Subset::Unicode8Bit

Inherits:

Base

Object
Base
TTFunk::Subset::Unicode8Bit

show all

Defined in:: lib/ttfunk/subset/unicode_8bit.rb

Overview

An 8-bit Unicode-based subset. It can include any Unicode character but limits number of characters so that the could be encoded by a single byte.

Constant Summary

Constants inherited from Base

Base::MICROSOFT_PLATFORM_ID, Base::MS_SYMBOL_ENCODING_ID

Instance Attribute Summary

Attributes inherited from Base

#original

Instance Method Summary collapse

#covers?(character) ⇒ Boolean
Can this subset include the character?.
#from_unicode(character) ⇒ Integer
Get character code for Unicode codepoint.
#includes?(character) ⇒ Boolean
Does this subset actually has the character?.
#initialize(original) ⇒ Unicode8Bit constructor
A new instance of Unicode8Bit.
#new_cmap_table ⇒ TTFunk::Table::Cmap
Get cmap table for this subset.
#original_glyph_ids ⇒ Array<Integer>
Get the list of Glyph IDs from the original font that are in this subset.
#to_unicode_map ⇒ Hash{Integer => Integer}
Get a mapping from this subset to Unicode.
#unicode? ⇒ true
Is this a Unicode-based subset?.
#use(character) ⇒ void
Add a character to subset.

Methods inherited from Base

#collect_glyphs, #encode, #encoder_klass, #glyphs, #microsoft_symbol?, #new_to_old_glyph, #old_to_new_glyph, #unicode_cmap

Constructor Details

#initialize(original) ⇒ `Unicode8Bit`

Returns a new instance of Unicode8Bit.

Parameters:

original (TTFunk::File)

Source Codelib/ttfunk/subset/unicode_8bit.rb, line 12
def initialize(original)
  @subset = { 0x20 => 0x20 }
  @unicodes = { 0x20 => 0x20 }
  @next = 0x21 # apparently, PDF's don't like to use chars between 0-31

Instance Method Details

#covers?(character) ⇒ `Boolean`

Can this subset include the character?

Parameters:

character (Integer) —
Unicode codepoint

Returns:

(Boolean)

Source Codelib/ttfunk/subset/unicode_8bit.rb, line 49
def covers?(character)
  @unicodes.key?(character) || @next < 256

#from_unicode(character) ⇒ `Integer`

Get character code for Unicode codepoint.

Parameters:

character (Integer) —
Unicode codepoint

Returns:

(Integer)

Source Codelib/ttfunk/subset/unicode_8bit.rb, line 65
def from_unicode(character)
  @unicodes[character]

#includes?(character) ⇒ `Boolean`

Does this subset actually has the character?

Parameters:

character (Integer) —
Unicode codepoint

Returns:

(Boolean)

Source Codelib/ttfunk/subset/unicode_8bit.rb, line 57
def includes?(character)
  @unicodes.key?(character)

#new_cmap_table ⇒ `TTFunk::Table::Cmap`

Get cmap table for this subset.

Returns:

(TTFunk::Table::Cmap)

Source Codelib/ttfunk/subset/unicode_8bit.rb, line 72
def new_cmap_table
  @new_cmap_table ||=
    begin
      mapping =
        @subset.each_with_object({}) do |(code, unicode), map|
          map[code] = unicode_cmap[unicode]
        end
      # since we're mapping a subset of the unicode glyphs into an
      # arbitrary 256-character space, the actual encoding we're
      # using is irrelevant. We choose MacRoman because it's a 256-character
      # encoding that happens to be well-supported in both TTF and
      # PDF formats.
      TTFunk::Table::Cmap.encode(mapping, :mac_roman)

#original_glyph_ids ⇒ `Array<Integer>`

Get the list of Glyph IDs from the original font that are in this subset.

Returns:

(Array<Integer>)

Source Codelib/ttfunk/subset/unicode_8bit.rb, line 94
def original_glyph_ids
  ([0] + @unicodes.keys.map { |unicode| unicode_cmap[unicode] }).uniq.sort

#to_unicode_map ⇒ `Hash{Integer => Integer}`

Get a mapping from this subset to Unicode.

Returns:

(Hash{Integer => Integer})

Source Codelib/ttfunk/subset/unicode_8bit.rb, line 29
def to_unicode_map
  @subset.dup

#unicode? ⇒ `true`

Is this a Unicode-based subset?

Returns:

(true)

Source Codelib/ttfunk/subset/unicode_8bit.rb, line 22
def unicode?

#use(character) ⇒ `void`

This method returns an undefined value.

Add a character to subset.

Parameters:

character (Integer) —
Unicode codepoint

Source Codelib/ttfunk/subset/unicode_8bit.rb, line 37
def use(character)
  unless @unicodes.key?(character)
    @subset[@next] = character
    @unicodes[character] = @next
    @next += 1

12	def initialize(original)
13	super
14	@subset = { 0x20 => 0x20 }
15	@unicodes = { 0x20 => 0x20 }
16	@next = 0x21 # apparently, PDF's don't like to use chars between 0-31
17	end

49	def covers?(character)
50	@unicodes.key?(character) \|\| @next < 256
51	end

72	def new_cmap_table
73	@new_cmap_table \|\|=
74	begin
75	mapping =
76	@subset.each_with_object({}) do \|(code, unicode), map\|
77	map[code] = unicode_cmap[unicode]
78	map
79	end
80
81	# since we're mapping a subset of the unicode glyphs into an
82	# arbitrary 256-character space, the actual encoding we're
83	# using is irrelevant. We choose MacRoman because it's a 256-character
84	# encoding that happens to be well-supported in both TTF and
85	# PDF formats.
86	TTFunk::Table::Cmap.encode(mapping, :mac_roman)
87	end
88	end

94	def original_glyph_ids
95	([0] + @unicodes.keys.map { \|unicode\| unicode_cmap[unicode] }).uniq.sort
96	end

37	def use(character)
38	unless @unicodes.key?(character)
39	@subset[@next] = character
40	@unicodes[character] = @next
41	@next += 1
42	end
43	end

Class: TTFunk::Subset::Unicode8Bit

Overview

Constant Summary

Constants inherited from Base

Instance Attribute Summary

Attributes inherited from Base

Instance Method Summary collapse

Methods inherited from Base

Constructor Details

#initialize(original) ⇒ Unicode8Bit

Instance Method Details

#covers?(character) ⇒ Boolean

#from_unicode(character) ⇒ Integer

#includes?(character) ⇒ Boolean

#new_cmap_table ⇒ TTFunk::Table::Cmap

#original_glyph_ids ⇒ Array<Integer>

#to_unicode_map ⇒ Hash{Integer => Integer}

#unicode? ⇒ true

#use(character) ⇒ void

#initialize(original) ⇒ `Unicode8Bit`

#covers?(character) ⇒ `Boolean`

#from_unicode(character) ⇒ `Integer`

#includes?(character) ⇒ `Boolean`

#new_cmap_table ⇒ `TTFunk::Table::Cmap`

#original_glyph_ids ⇒ `Array<Integer>`

#to_unicode_map ⇒ `Hash{Integer => Integer}`

#unicode? ⇒ `true`

#use(character) ⇒ `void`