The NetBSD Project

CVS log for pkgsrc/converters/orcus/

[BACK] Up to [] / pkgsrc / converters / orcus

Request diff between arbitrary revisions

Default branch: MAIN

Revision 1.12 / (download) - annotate - [select for diffs], Sun Oct 15 07:09:50 2023 UTC (4 months, 2 weeks ago) by ryoon
Branch: MAIN
CVS Tags: pkgsrc-2023Q4-base, pkgsrc-2023Q4, HEAD
Changes since 1.11: +2 -3 lines
Diff to previous 1.11 (colored)

orcus: Update to 0.18.1

sax parser
 * added support for optionally skipping multiple BOM's in the beginning of
   XML stream. This affects all XML-based file format filters such as
   xls-xml (aka Excel 2003 XML).
 * fixed a bug where XML documents consisting of simple single-column
   records were not properly converted to sheet data.
 * fixed a bug where the filter would always pass border color even when
   it was not set.
 * added new configure switches --without-benchmark and --without-doc-example
   to optinally skip building of these two directories.

 * fixed the flat output mode to properly calculate the lengths of UTF-8
   encoded strings.
 * replaced all uses of std::strtol() to parse_integer() to properly
   parse strings that are not necessarily null-terminated.
 * added a new output format type 'debug-state' which dumps the internal
   state of the populated document model in detail. This can be useful
   during debugging.
 * separated the import_shared_string interface implementation from the
   backend shared strings store per separation of responsibility.
 * merged the foo_t and foo_active_t struct pair, such as font_t and
   font_active_t, in the styles store into a single type using std::optional.
 * revised the documentation and public API and cleaned things up where
 * re-implemented the number format styles import to correctly keep
   track of element stacks and correctly perform structure checks to
   detect malformed documents.
   * added new interface to import named styles applied to columns.
   * added new interface to import attributes for asian and complex
     scripts for the folloiwng font attributes:
     * font name
     * font size
     * font style
     * font weight
   * re-designed the styles import interface to make it multi-level.
   * re-worked the import of the style:text-underline-width attribute
     to make its handling more in line with the specifications.
   * added support for importing wrap-text and shrink-to-fit cell format
   * added support for importing cell-hidden and locked attributes.
   * added support for importing direct and named cell formats applied
     to columns and rows.
   * added support for importing wrap-text and shrink-to-fit cell format
   * added support for importing direct and named cell formats applied to
     columns and rows.
   * added a new interface to pass the encoding information to the document
     model so that it can correctly decode non-UTF-8-encoded string values.

Revision 1.11 / (download) - annotate - [select for diffs], Mon Aug 14 05:23:58 2023 UTC (6 months, 2 weeks ago) by wiz
Branch: MAIN
CVS Tags: pkgsrc-2023Q3-base, pkgsrc-2023Q3
Changes since 1.10: +2 -1 lines
Diff to previous 1.10 (colored)

*: recursive bump for Python 3.11 as new default

Revision 1.10 / (download) - annotate - [select for diffs], Sat Feb 5 04:14:31 2022 UTC (2 years ago) by ryoon
Branch: MAIN
CVS Tags: pkgsrc-2023Q2-base, pkgsrc-2023Q2, pkgsrc-2023Q1-base, pkgsrc-2023Q1, pkgsrc-2022Q4-base, pkgsrc-2022Q4, pkgsrc-2022Q3-base, pkgsrc-2022Q3, pkgsrc-2022Q2-base, pkgsrc-2022Q2, pkgsrc-2022Q1-base, pkgsrc-2022Q1
Changes since 1.9: +2 -2 lines
Diff to previous 1.9 (colored)

orcus: Update to 0.17.2

orcus 0.17.2

* ods

  * fixed a bug where the state of style:cell-protect="none" was not
    explicitly pushed, thereby having had the same effect as not having this
    attribute.  After the fix, style:cell-protect="none" will explicitly push
    the hidden state to false, locked state to false, and the formula-hidden
    state to false.

orcus 0.17.1

* general

  * addressed a number of coverity issues.

  * removed a variety of compiler warnings.

* ods

  * re-generated sax parser tokens from ODF v1.3.

  * revised the style import code to only push style attributes that are
    actually specified in the XML.

* xls-xml

  * revised the XML structure validation strategy to ignore any mis-placed
    elements and their sub structures rather than aborting the import.

orcus 0.17.0

* general

  * set the baseline C++ version to 17.

  * cleaned up the public API to replace pstring with std::string_view, union
    with std::variant, and boost::optional with std::optional.  With this
    change, the public API no longer has dependency on boost.

* spreadsheet document

  * switched to using ixion::model_iterator for horizontal iteration of cells
    instead of using mdds::mtv::collection.

  * fixed a bug where exporting a spreadsheet document containing adjacent
    merged cells regions to html incorrectly exported the merged cell areas.

* xlsx

  * cached cell values are now correctly loaded from the file.

* sax parser

  * utf-8 names are now allowed as element and attribute names.

* css parser

  * unquoted utf-8 property values are now allowed.

* orcus-json

  * fixed segmentation fault when using --mode structure with the Windows

  * added yaml output option.

* xml-map

  * fixed a bug where mapping of an XML document with namespace aliases
    sometimes corrupts the alias values.

* python

  * added orcus.FormulaTokenOp enum type which describes type formula token
    operator types in a more finer grained manner.

* documentation

  * added notes to how to use orcus-xml and orcus-json to map XML and JSON
    documents to spreadsheet documents.

Revision 1.9 / (download) - annotate - [select for diffs], Sat Feb 6 06:29:04 2021 UTC (3 years ago) by ryoon
Branch: MAIN
CVS Tags: pkgsrc-2021Q4-base, pkgsrc-2021Q4, pkgsrc-2021Q3-base, pkgsrc-2021Q3, pkgsrc-2021Q2-base, pkgsrc-2021Q2, pkgsrc-2021Q1-base, pkgsrc-2021Q1
Changes since 1.8: +2 -2 lines
Diff to previous 1.8 (colored)

orcus: Update to 0.16.1

    fixed a build issue on 32-bit linux platforms, which was
    indirectly caused by ixion.

    fixed json parsing bug caused by an uninitialized variable,
    which manifested itself on debian 32-bit platform.

    removed compiler warnings on unused variables from the base
    parser handlers.

	full formula recalculations are now optional when loading
	documents. It makes more effective use of cached formula

	added the option of failing on the first faulty cell, or
	skipping them.

	fixed a bug that caused the threaded_sax_token_parser to

	added base parser handler classes in the public headers so
	that they can be sub-classed to overwrite necessary handler

	parsing of numeric values are now more strict for better
	conformance to the specs.

	added support for loading named expressions from ods

	fixed an infinite loop when loading one of the attached
	ods documents from

	fixed a segfault when loading the xlsx document from

	fixed a bug that prevented formulas from referencing cells
	located in later sheets.

	adjusted the xml path expressions to be more like XPath.
	Previously, an attribute was expressed as '@' in the old
	expression, but XPath uses '/@'. The new expression uses
	'/@' for an attribute.

	added the ability to identify and import ranges from XML
	documents without map file.

	added the ability to generate map file from XML documents
	for user customization.

	added support to specify default namespace in the map file.

	added orcus.Cell class to represent individual cell values
	and attributes.

	fixed several memory leaks in the python binding layer.

	modified function to take string input,
	instead of bytes.

	added __version__ attribute to the orcus module.

	cleaned up orcus.detect_format function to only take the
	stream parameter.

	added named_expressions properties to Document and Sheet
	class objects.

	added Python API to bulk-process a number of spreadsheet
	documents (

	added Python API to download attachments from bugzilla
	services via REST API (

Revision 1.8 / (download) - annotate - [select for diffs], Wed Feb 5 13:54:29 2020 UTC (4 years ago) by ryoon
Branch: MAIN
CVS Tags: pkgsrc-2020Q4-base, pkgsrc-2020Q4, pkgsrc-2020Q3-base, pkgsrc-2020Q3, pkgsrc-2020Q2-base, pkgsrc-2020Q2, pkgsrc-2020Q1-base, pkgsrc-2020Q1
Changes since 1.7: +2 -2 lines
Diff to previous 1.7 (colored)

orcus: Update to 0.15.3

orcus 0.15.3

* xml-map

  * fixed another bug related to filling of cells down the column in a linked
    range with nested repeat elements.  The bug would occur when the field in
    a linked range is more than one level deeper than the nearest row group

* xls-xml

  * fixed a bug where TopCell and LeftCell attributes of the Table element
    were not properly honored.

orcus 0.15.2

* xml-map

  * fixed a bug that prevented filling of cells down the column in a linked
    range with nested repeat elements.  The bug would occur when the field in
    a linked range is associated with an element content rather than an

* xls-xml

  * added code to properly pick up and pass the number format codes, including
    named number format values such as 'General Date', 'Long Time, 'Currency'

* fixed a build issue on older macOS environment, related to passing an rvalue
  to a tuple expecting a const reference.  The root cause was a bug in libc++
  of LLVM < 7.

* fixed a build issue with gcc5.

orcus 0.15.1

* switched xml_map_tree to using boost::object_pool to manage the life
  cycles of the objects within xml_map_tree, to avoid memory

* fixed incorrect handling of newly created elements in xml_map_tree.

* fixed segfault caused by double deletion of allocated memory for
  xml_map_tree::element, which seemed to happen only on 32-bit gcc builds.

* fixed weird test failures related to equality check of two double-precision
  values, caused probably by aggressive compiler optimization which only seems
  to get triggered in 32-bit gcc builds.

orcus 0.15.0

* spreadsheet interface

  * import_sheet::fill_down_cells() has been added as a required method, to
    allow the import filter code to duplicate cell value downward in one step.

* json parser

  * added test cases from JSONTestSuite.

  * fixed a bug on parsing an empty array containing one or more blank
    characters between the brackets.

* sax parser

  * fixed a bug on parsing an attribute value with encoded character
    immediately followed by a ';', such as '&amp;;'.

  * fixed a bug on parsing an assignment character '=' that either preceded or
    followed by whitespaces in attribute definition.

  * optionally use SSE4.2 intrinsics to speed up element name parsing.

* orcus-xml

  * revised its cli interface to make use of boost's program_options.

  * orcus-xml-dump's functionality has been combined into orcus-xml.

  * map mode now supports nested repeat elements to be mapped as range fields.

* orcus-json

  * map mode has been added to allow mapping of JSON documents to spreadsheet
    document model.  This mode either takes explicit mapping rule via map
    file, or performs automatic mapping by auto-identifying mappable ranges by
    analyzing the structure of the JSON document.

  * structure mode has been added to display the logical structures of JSON

  * significantly improved performance of json document tree by utilizing
    object pool to manage the life cycles of json value instances.

* xls-xml

  * added support for importing named color values in the ss:Color attributes.

  * added support for handling UTF-16 streams that contains byte order marks.

* spreadsheet document

  * significantly improved performance of flat format output generation.

* internal

  * string_pool now uses boost's object_pool to manage the instances of stored

  * file_content class has been added to memory-map file contents instead of
    loading them in-memory.

  * memory_content class has been added to map in-memory buffer with the
    optional ability to perform unicode conversion.

  * dom_tree has been renamed to dom::document_tree, and its interface has
    been cleaned up to hide its implementation details.

Revision 1.7 / (download) - annotate - [select for diffs], Fri Feb 22 06:21:26 2019 UTC (5 years ago) by gutteridge
Branch: MAIN
CVS Tags: pkgsrc-2019Q4-base, pkgsrc-2019Q4, pkgsrc-2019Q3-base, pkgsrc-2019Q3, pkgsrc-2019Q2-base, pkgsrc-2019Q2, pkgsrc-2019Q1-base, pkgsrc-2019Q1
Changes since 1.6: +2 -2 lines
Diff to previous 1.6 (colored)

orcus: bump dependency version in

libreoffice, the sole package that presently depends on orcus, requires a
minimum version of 0.14 now. (And, for that matter, the orcus change log
indicates that there have been incompatible API changes.) Bump
accordingly here. (If an older version of orcus was present, the
current version of libreoffice would simply have failed to build.)

Revision 1.6 / (download) - annotate - [select for diffs], Sat Feb 3 00:04:10 2018 UTC (6 years ago) by ryoon
Branch: MAIN
CVS Tags: pkgsrc-2018Q4-base, pkgsrc-2018Q4, pkgsrc-2018Q3-base, pkgsrc-2018Q3, pkgsrc-2018Q2-base, pkgsrc-2018Q2, pkgsrc-2018Q1-base, pkgsrc-2018Q1
Changes since 1.5: +2 -4 lines
Diff to previous 1.5 (colored)

converters/orcus: import orcus-0.13.2

Standalone file import filter library for spreadsheet documents.

This package contains the 0.13 branch of the library.

Revision 1.5, Mon Feb 6 07:10:00 2017 UTC (7 years ago) by wiz
Branch: MAIN
Changes since 1.4: +1 -1 lines

Remove orcus, unused.

Revision 1.4 / (download) - annotate - [select for diffs], Tue Dec 6 15:13:20 2016 UTC (7 years, 2 months ago) by ryoon
Branch: MAIN
CVS Tags: pkgsrc-2016Q4-base, pkgsrc-2016Q4
Changes since 1.3: +2 -2 lines
Diff to previous 1.3 (colored)

Update to 0.11.2

2016-05-11  Kohei Yoshida  <>  [d6084fe1771052e516ecfb270cb24dd9917a1895]

        Up the version to 0.11.2.

2016-05-11  Kohei Yoshida  <>  [70fd8327c94b27a99e2c7800e91c13e5099cceda]

        Make it buildable with mdds-1.2.

Revision 1.3 / (download) - annotate - [select for diffs], Tue Sep 8 19:40:59 2015 UTC (8 years, 5 months ago) by wiz
Branch: MAIN
CVS Tags: pkgsrc-2016Q3-base, pkgsrc-2016Q3, pkgsrc-2016Q2-base, pkgsrc-2016Q2, pkgsrc-2016Q1-base, pkgsrc-2016Q1, pkgsrc-2015Q4-base, pkgsrc-2015Q4, pkgsrc-2015Q3-base, pkgsrc-2015Q3
Changes since 1.2: +2 -2 lines
Diff to previous 1.2 (colored)

Update to 0.9.2:

No Changelog found. Major API change -- 0.10 in directory names instead
of 0.8 before.

Revision 1.2 / (download) - annotate - [select for diffs], Mon Oct 13 21:43:20 2014 UTC (9 years, 4 months ago) by ryoon
Branch: MAIN
CVS Tags: pkgsrc-2015Q2-base, pkgsrc-2015Q2, pkgsrc-2015Q1-base, pkgsrc-2015Q1, pkgsrc-2014Q4-base, pkgsrc-2014Q4
Changes since 1.1: +2 -2 lines
Diff to previous 1.1 (colored)

Update to 0.7.0

* Change to 0.8.0 branch
* Change license to mpl-2.0 from mit

Add some more formats.

Revision 1.1 / (download) - annotate - [select for diffs], Sat Aug 24 16:12:10 2013 UTC (10 years, 6 months ago) by jaapb
Branch: MAIN
CVS Tags: pkgsrc-2014Q3-base, pkgsrc-2014Q3, pkgsrc-2014Q2-base, pkgsrc-2014Q2, pkgsrc-2014Q1-base, pkgsrc-2014Q1, pkgsrc-2013Q4-base, pkgsrc-2013Q4, pkgsrc-2013Q3-base, pkgsrc-2013Q3

Packaged converters/orcus, a library that deals with spreadsheet documents
(libreoffice dependency).

This form allows you to request diff's between any two revisions of a file. You may select a symbolic revision name using the selection box or you may type in a numeric name using the type-in text box.

CVSweb <>