summaryrefslogtreecommitdiff
AgeCommit message (Collapse)Author
2016-07-14tests: Add tests for latin1 detectionArseny Kapoulkine
2016-07-14Implement encoding detection by name.Arseny Kapoulkine
This adds about 40 cycles for parsing <?xml version='1.0'?> declaration and about 70 cycles for parsing <?xml version='1.0' encoding='utf-8'?>, as measured on a Core i7, which should be negligible for all documents. Fixes #16.
2016-07-10Makefile: Add -Wcast-qualArseny Kapoulkine
It is probably redundant given that we have -Wold-style-cast, but it's better to warn about casts like this in case we ever need to remove the latter flag.
2016-06-18Only enable cxx_long_long_type if it is in the list of featuresArseny Kapoulkine
Fixes #99.
2016-06-10Switch to regular AppVeyor imageArseny Kapoulkine
2016-06-10Add CMakeLists to 'make release' now that it's top-levelArseny Kapoulkine
2016-06-10Merge pull request #90 from jippeholwerda/masterArseny Kapoulkine
Put CMakeLists.txt in the project root.
2016-05-15Merge pull request #97 from HumMan/masterArseny Kapoulkine
Add vs2013 projects
2016-04-24Add vs2013 projectsAlexsey Zazvonnykh
2016-04-14Adjust XML allocation pages to have the exact specified sizeArseny Kapoulkine
Previously the page size was defining the data size, and due to additional headers (+ recently removed allocation padding) the actual allocation was a bit bigger. The problem is that some allocators round 2^N+k allocations to 2^N+M, which can result in noticeable waste of space. Specifically, on 64-bit OSX allocating the previous page size (32k+40) resulted in 32k+512 allocation, thereby wasting 472 bytes, or 1.4%. Now we have the allocation size specified exactly and just recompute the available data size, which can in small space savings depending on the allocator.
2016-04-14Remove extra space in an empty tag for format_rawArseny Kapoulkine
When using format_raw the space in the empty tag (<node />) is the only character that does not have to be there; so format_raw almost results in a minimal XML but not quite. It's pretty unlikely that this is crucial for any users - the formatting change should be benign, and it's better to improve format_raw than to add yet another flag. Fixes #87.
2016-04-03Refactor auto_deleter now that we only need to support one signatureArseny Kapoulkine
Also rename auto_deleter_fclose to close_file.
2016-04-03Merge pull request #92 from QUSpilPrgm/masterArseny Kapoulkine
Do not assume that fclose can be converted to int(*)(FILE*)
2016-03-27Fix AppVeyor buildArseny Kapoulkine
2016-03-24Do not assume that fclose can be converted to int(*)(FILE*) because some ↵QUSpilPrgm
compilers use a special calling convention for stdlib functions like fclose
2016-02-25Change paths according to new CMakeLists.txt location.Jippe Holwerda
2016-02-25Put CMakeLists.txt in the project root.Jippe Holwerda
Having CMakeLists.txt in the project root makes it so much easier to use pugixml as an external dependency in another CMake project.
2016-02-13scripts: Update license yearArseny Kapoulkine
2016-02-13scripts: Rename CocoaPods script to match NuGet namingArseny Kapoulkine
Also remove top-level LICENSE file since .podspec already has it.
2016-02-13Merge pull request #85 from igagis/masterArseny Kapoulkine
scripts: Add CocoaPods package
2016-02-03embed license to podspec, as LICENSE file is not in 1.7 tag, cocoapods ↵Ivan Gagis
errors otherwise. Get sources from zeux github
2016-02-03LICENSE, podspecIvan Gagis
2016-02-02Refactor conversion from integer to stringArseny Kapoulkine
Unify the implementations by automatically deducing the unsigned type from its signed counterpart. That allows us to use a templated function instead of duplicating code.
2016-02-02docs: Add documentation for long settersArseny Kapoulkine
2016-02-02tests: Add tests for long settersArseny Kapoulkine
2016-02-02Implement set/set_value/operator= for long typesArseny Kapoulkine
This makes the coverage for basic numeric types complete (sans long double). Fixes #78.
2016-01-26Specify std=c++11 explicitly in Travis configurationsArseny Kapoulkine
That way the defaults in the Makefile only matter for local runs.
2016-01-26Merge pull request #83 from ogdf/use-real-standards-onlyArseny Kapoulkine
Add cxxstd Makefile argument for testing C++ standards
2016-01-26Let Travis CI perform C++98-based testsStephan Beyer
2016-01-26Make cxxstd an argument to MakefileStephan Beyer
This determines the used C++ standard. If you do not want to use a specific C++ standard, use cxxstd=any. The default is set to c++11. The "define" PUGIXML_NO_CXX11 is removed from the Makefile since it is not used in the code anyways.
2016-01-26Let Travis CI use Ubuntu trustyStephan Beyer
This allows to perform C++11-based tests on the Linux.
2016-01-24test_parse.cpp: add newline to last lineStephan Beyer
This is necessary in order to comply with the C++03 standard.
2016-01-24Merge pull request #81 from ogdf/whitespace-issuesArseny Kapoulkine
Fix whitespace issues
2016-01-24Fix whitespace issuesStephan Beyer
Git warns when it finds "whitespace errors". This commit gets rid of these whitespace errors for code and adoc files.
2016-01-20Change header format in non-compact mode to store page offsetArseny Kapoulkine
This utilizes the fact that pages are of limited size so we can store offset from the object to the page in a few bits - we currently use 24 although that's excessive given that pages are limited to ~512k. This has several benefits: - Pages do not have to be 64b aligned any more - this simplifies allocation flow and frees up 40-50 bytes from xml_document::_memory. - Header now has 8 bits available for metadata for both compact and default mode which makes it possible to store type as-is (allowing easy type extension and removing one add/sub operation from type checks). - One extra bit is easily available for future metadata extension (in addition to the bit for type encoding that could be reclaimed if necessary). - Allocators that return 4b-aligned memory on 64-bit platforms work fine if misaligned reads are supported. The downside is that there is one or two extra instructions on the allocation path. This does not seem to hurt parsing performance.
2016-01-19docs: Adjust as_int documentation to match implementationArseny Kapoulkine
Also remove the description of behavior for trailing non-numeric characters. It's likely this will become a parse error in the future so better leave it as unspecified for now. Fixes #80.
2016-01-14Merge pull request #79 from zeux/embed-pcdataArseny Kapoulkine
Add parse_embed_pcdata flag This flag determines if plain character data is be stored in the parent element's value. This significantly changes the structure of the document; this flag is only recommended for parsing documents with a lot of PCDATA nodes in a very memory-constrained environment. Most high-level APIs continue to work; code that inspects DOM using first_child()/value() will have to be adapted.
2016-01-12Preserve order semantics for child_value/text when using parse_embed_pcdataArseny Kapoulkine
The performance cost is probably negligible and this means we treat embedded value as the first child consistently.
2016-01-12docs: Add parse_embed_pcdata documentationArseny Kapoulkine
2016-01-12Adjust parse_embed_pcdata documentationArseny Kapoulkine
Since round-tripping should not be a problem any more don't mention it.
2016-01-12tests: Add test for parse_embed_pcdataArseny Kapoulkine
2016-01-12Only save first PCDATA contents in the elementArseny Kapoulkine
This change fixes an important ordering issue - if element node has a PCDATA child *after* other elements, it's impossible to tell which order the children were in. Since the goal of PCDATA embedding is to save memory when it's the only child, only apply the optimization to the first child. This seems to fix all roundtripping issues so the only caveat is that the DOM structure is different.
2016-01-09Implement output support for embedded PCDATA valuesArseny Kapoulkine
This is a bit awkward since preserving correct indentation structure requires a bit of extra work, and the closing tag has to be written by _start function to correctly process the rest of the tree.
2016-01-08Support XPath string value for parse_embed_pcdataArseny Kapoulkine
2016-01-08Support xml_node::child_value/text for parse_embed_pcdataArseny Kapoulkine
2016-01-08Add initial support for parse_embed_pcdataArseny Kapoulkine
When this flag is true, PCDATA value is saved to the parent element instead of allocating a new node. This prevents some documents from round-tripping since it loses information, but can provide a significant memory reduction and parsing speedup for some documents.
2015-12-31docs: Regenerate HTML documentationArseny Kapoulkine
2015-12-31Update all license/copyright texts to 2016Arseny Kapoulkine
2015-12-31docs: Use ISO-8601 for changelog datesArseny Kapoulkine
2015-12-30scripts: Fix default value for STATIC_CRT optionArseny Kapoulkine
Also refactor to use the same case and run after common options.