| Andrew Cooke | Contents | Latest | RSS | Twitter | Previous | Next

C[omp]ute

Welcome to my blog, which was once a mailing list of the same name and is still generated by mail. Please reply via the "comment" links.

Always interested in offers/projects/new ideas. Eclectic experience in fields like: numerical computing; Python web; Java enterprise; functional languages; GPGPU; SQL databases; etc. Based in Santiago, Chile; telecommute worldwide. CV; email.

Personal Projects

Lepl parser for Python.

Colorless Green.

Photography around Santiago.

SVG experiment.

Professional Portfolio

Calibration of seismometers.

Data access via web services.

Cache rewrite.

Extending OpenSSH.

C-ORM: docs, API.

Last 100 entries

Good Example Of Good Police Work (And Anonymity Being Hard); Best Santiago Burgers; Also; Michael Emmerich (Vibrator Translator) Interview (Japanese Books); Clarice Lispector (Brazillian Writer); Books On Evolution; Looks like Ara (Modular Phone) is dead; Index - Translations From Chile; More Emotion in Chilean Wines; Week 7; Aeon Magazine (Science-ish); QM, Deutsch, Constructor Theory; Interesting Talk Transcripts; Interesting Suggestion Of Election Fraud; "Hard" Books; Articles or Papers on depolarizing the US; Textbook for "QM as complex probabilities"; SFO Get Libor Trader (14 years); Why Are There Still So Many Jobs?; Navier Stokes Incomplete; More on Benford; FBI Claimed Vandalism; Architectural Tessellation; Also: Go, Blake's 7; Delusions of Gender (book); Crypto AG DID work with NSA / GCHQ; UNUMS (Universal Number Format); MOOCs (Massive Open Online Courses); Interesting Looking Game; Euler's Theorem for Polynomials; Weeks 3-6; Reddit Comment; Differential Cryptanalysis For Dummies; Japanese Graphic Design; Books To Be Re-Read; And Today I Learned Bugs Need Clear Examples; Factoring a 67 bit prime in your head; Islamic Geometric Art; Useful Julia Backtraces from Tasks; Nothing, however, is lost with less discomfort than that which, when lost, cannot be missed; Article on Didion; Cost of Living by City; British Slavery; Derrida on Metaphor; African SciFi; Traits in Julia; Alternative Japanese Lit; Pulic Key as Address (Snow); Why Information Grows; The Blindness Of The Chilean Elite; Some Victoriagate Links; This Is Why I Left StackOverflow; New TLS Implementation; Maths for Physicists; How I Am 8; 1000 Word Philosophy; Cyberpunk Reading List; Detailed Discussion of Message Dispatch in ParserCombinator Library for Julia; FizzBuzz in Julia w Dependent Types; kokko - Design Shop in Osaka; Summary of Greece, Currently; LLVM and GPUs; See Also; Schoolgirl Groyps (Maths); Japanese Lit; Another Example - Modular Arithmetic; Music from United; Python 2 and 3 compatible alternative.; Read Agatha Christie for the Plot; A Constructive Look at TempleOS; Music Thread w Many Recommendations; Fixed Version; A Useful Julia Macro To Define Equality And Hash; k3b cdrom access, OpenSuse 13.1; Week 2; From outside, the UK looks less than stellar; Huge Fonts in VirtualBox; Keen - Complex Emergencies; The Fallen of World War II; Some Spanish Fiction; Calling C From Fortran 95; Bjork DJ Set; Z3 Example With Python; Week 1; Useful Guide To Starting With IJulia; UK Election + Media; Review: Reinventing Organizations; Inline Assembly With Julia / LLVM; Against the definition of types; Dumb Crypto Paper; The Search For Quasi-Periodicity...; Is There An Alternative To Processing?; CARDIAC (CARDboard Illustrative Aid to Computation); The Bolivian Case Against Chile At The Hague; Clear, Cogent Economic Arguments For Immigration; A Program To Say If I Am Working; Decent Cards For Ill People; New Photo; Luksic And Barrick Gold; President Bachelet's Speech; Baltimore Primer

© 2006-2015 Andrew Cooke (site) / post authors (content).

libxml2: Creating XML and Validating with Schema

From: andrew cooke <andrew@...>

Date: Thu, 14 Jul 2011 16:57:48 -0400

I need to create a document for a horribly complicated schema in XML, from C,
using libxml2.  Since the document has a schema I am bootstrapping myself by
creating an empty document, testing it, and then adding elements etc as needed
until it validates.  At that point, hopefully, I will (1) have a good starting
point and (2) understand libxml2 much more than I do right now.

But anyway, just to get there has taken fair amount of effort, so here's the
code, in case it will help anyone else (some parts removed to protect the
innocent - mainly logging).  Also, logging and handling of failures is
incomplete.

Andrew

PS This is on CentOS 5.4, although I had to install libxml from
https://github.com/projecthydra/libxml2 to handle my particular schema.


#include <stdio.h>

// do not simply append libxml2 here - it will not work
// see http://bugs.centos.org/view.php?id=1340 for solution
#include <libxml/parser.h>
#include <libxml/xmlschemas.h>

// elided imports

// first, two callbacks that display errors - you could probably use
// fprintf instead.

static void xmlSchemaValidityErrorFunc_impl(void __attribute__((unused)) *ctx,
const char *msg, ...) {
    static char buffer[5000];
    va_list argp;
    va_start(argp, msg);
    vsprintf(buffer, msg, argp);
    va_end(argp);
    // elided logging of errors
}


static void xmlSchemaValidityWarningFunc_impl(void __attribute__((unused))
*ctx, const char *msg, ...) {
    static char buffer[5000];
    va_list argp;
    va_start(argp, msg);
    vsprintf(buffer, msg, argp);
    va_end(argp);
    // elided logging of warnings
}


/**
 * See http://wiki.njh.eu/XML-Schema_validation_with_libxml2
 */
static int validate(xmlDocPtr doc, const char *schema_path) {

    int status = ...;
    const char *schema_path;
    xmlDocPtr schema_doc = NULL;
    xmlSchemaParserCtxtPtr parser_ctxt = NULL;
    xmlSchemaPtr schema = NULL;
    xmlSchemaValidCtxtPtr valid_ctxt = NULL;

    if (! (schema_doc = xmlReadFile(schema_path, NULL, XML_PARSE_NONET))) {
       status = ...;
       goto exit;
    }

    if (! (parser_ctxt = xmlSchemaNewDocParserCtxt(schema_doc))) {
       status = ...;
       goto exit;
    }

    if (! (schema = xmlSchemaParse(parser_ctxt))) {
       status = ...;
       goto exit;
    }

    if (! (valid_ctxt = xmlSchemaNewValidCtxt(schema))) {
       status = ...;
       goto exit;
    }

    xmlSchemaSetValidErrors(valid_ctxt, &xmlSchemaValidityErrorFunc_impl,
       &xmlSchemaValidityWarningFunc_impl, NULL);

    if (xmlSchemaValidateDoc(valid_ctxt, doc)) {
       ssi_chan_log("error", "Document is invalid");
       status = ...;
       goto exit;
    }

exit:
    if (valid_ctxt) xmlSchemaFreeValidCtxt(valid_ctxt);
    if (schema) xmlSchemaFree(schema);
    if (parser_ctxt) xmlSchemaFreeParserCtxt(parser_ctxt);
    if (schema_doc) xmlFreeDoc(schema_doc);

    return status;
}


int new_document(const char* path, configuration *conf) {

    int status = ..;
    xmlDocPtr doc = NULL;
    xmlNodePtr root_node = NULL;
    xmlNsPtr ns = NULL;

    char *namespace = "http://...";

    LIBXML_TEST_VERSION; // checks link against compilation

    // probably need more tests/error handling here
    doc = xmlNewDoc(BAD_CAST "1.0"); // XML version
    ns = xmlNewNs(NULL, BAD_CAST namespace, BAD_CAST "foo");
    root_node = xmlNewNode(ns, BAD_CAST "MyRoot");
    xmlDocSetRootElement(doc, root_node);

    if ((status = validate(doc, conf->schema_path))) goto exit;

    if (-1 == xmlSaveFormatFileEnc(path, doc, "UTF-8", 1)) {
        status = ...;
        goto exit;
    }

exit:
    if (ns) xmlFreeNs(ns);
    if (doc) xmlFreeDoc(doc);
    xmlCleanupParser();
    xmlMemoryDump();

    return status;
}

Comment on this post