Changes between Version 3 and Version 4 of SophieFormat


Ignore:
Timestamp:
11/07/09 17:33:57 (15 years ago)
Author:
mira
Comment:

--

Legend:

Unmodified
Added
Removed
Modified
  • SophieFormat

    v3 v4  
    1 The R3 format is not just a Book format but a Resource format. It represents a given Resource with its sub-resources, binary data, cached data, external Resource references, etc. [[BR]] 
    2 Since Resources form a hierarchy (for instance Book -> Page -> Frame), the R3 format structure follows the same hierarchy. [[BR]] 
    3 The R3 format is a zipped file incorporating some directories and some files: 
    4  * each Resource described inside (either the root -resource or any of its sub-resources) has a corresponding directory with some helper directories and files. 
    5 The exact structure description of the format is described via an example. Here is a description of the file format for a Book, though any kind of Resource can be saved in similar structure: 
    6 * The name of the zip file is something like "ExampleBook.s2b". This name without the file extension (namely "ExampleBook") is the name of the Book Resource. The extension depends on the kind of the resource that is persisted. 
    7 * The root directory of the file contains: 
    8  * '''_cache''' - a directory for caching remote Resources. It is used to load a resource even if you can’t connect to the server.   
    9  * '''_data''' - a directory for storing binary and text data. Instead of embedding unreadable binary data in the -resource.xml file, it is stored here and just referred from other places (like the .xml files)  
    10  * '''_resource.xml''' - an .xml file describing the structure and properties of the root-resource. 
    11  * '''sub-elements''' directories – All child resources of the current resource are persisted in similar manner in inner directories named with their names in the resource tree. Those resources are kept in the described above manner.   
    12 Resource names cannot start with an underscore so they won't conflict with _cache,  _data and _resource.xml. 
    13 Here is a sample  _resource.xml  for a book: 
    14 === _resource.xml === 
     1SOPHIE 2.0 RESOURCE FORMAT  
     2The R3 format is used for persistence of a sophie 2.0 resources. It is simply the archive of the root directory of the resource. It contains:  
     3•       _resource.xml file - xml description of all properties of the persisted resource  
     4•       _data folder - incorporates all embedded files like images, videos, large texts, etc. 
     5•       _cache folder- containing all cached remote resources for offline usage  
     6•       children directories – There is sub-directory for all child resources of the persisted one that have the same structure as the parent one. The names of those directories do not start with underscore and match the names of the child resources (not their titles!).  
     7At the end of this page you can find a sample book. You can download it and open it with any file compression software (WinZip, WinRar, 7-zip, File Roller, etc) to gain a better understanding of the described structure.  
     8Basically every resource could be persisted on its own so there are different file extensions for the different resources: ‘.page’, ‘.book’, ‘.frame’, etc. All of those are further extended by the generalized extension ‘.s2’. So if you were saving a page and you decided to name the file “Page A” the complete name of the file would be “Page A.page.s2”. 
     9The key part of every resource directory is its _resource.xml. It is different for the different kinds of resources but there are some common things:  
     10* It begins with a standard <xml> tag. 
     11 * The root tag is named ‘resource’.  
     12  * It has a mandatory attribute ‘kind’ that specifies the kind of the persisted resource (book, page…). 
     13  * There is also mandatory attribute ‘format-version’ whose value is the id of the persistence format. For now this it is always "r3" but in future there could be different formats. ) 
     14 * There is a common child ‘title’. This is the name of the resource that is used in all user controls – the resource palette, links\timelines hud, bottom flap, etc. Its value is string:  
    1515{{{ 
    16 <?xml version="1.0" encoding="UTF-8" standalone="no"?> 
    17  
    18 <resource format-version="r3" kind="book"> 
    19   <title>New Book</title> 
    20   <pages size="4"> 
    21     <page index="0"><ref location="./Page Qs5k1U"/></page> 
    22     <page index="1"><ref location="./PageCs37p5"/></page> 
    23     <page index="2"><ref location="./PageYIgmpZ"/></page> 
    24     <page index="3"><ref location="./Pageh3QLuC"/></page> 
    25   </pages> 
    26   <page-templates size="1"> 
    27     <template index="0"><ref location="./PageJd88Us"/></template> 
    28   </page-templates> 
    29   <page-size> 
    30     <height>480.0</height> 
    31     <width>640.0</width> 
    32   </page-size> 
    33 </resource> 
    34  
     16   <title> 
     17     Cool Book 
     18  </title> 
    3519}}} 
    36  
    37  * It begins with a standard <xml> tag. 
    38  * All other content is wrapped in a <resource> tag which contains the kind of the resource, the file format id (for now it is always "r3") and other attributes that are not threated as regular properties. 
    39  * All properties of the resource are saved in appropriate format depending on the type of the value. 
     20 * There is a child ‘template’ that contains a reference to another resource. It is saved like this: 
     21{{{ 
     22<template> 
     23  <ref location="../Resource Jd88Us"/> 
     24</template> 
     25}}}  
     26This means that the referenced resource is located in the parent directory of the current one and is called “Resource Jd88Us”. Notice that this is the actual name of the resource, not its title.  
     27 * There is a ‘comment’ child that has a string value.  
     28 * There is an ‘import-origin’child that has a string value. It represents the URI of the import origin of this resource.  
     29Those children nodes are common for all resources. They are not mandatory, though. So the ‘resource’ node could have some or all of them. When a property`s value is null there is no point in having empty tags for it.  
     30There are many other children nodes that could be found in a _resource.xml depending on the kind of the persisted resource. Here we are presented most of them specifying where you could see them. Basically all properties of a resource are saved in this xml file in appropriate form depending on the type of the kept value.  
     31When the persisted resource is a book its _resource.xml could have ‘page-size’ node that holds the size of the page- width and height:  
     32{{{ 
     33<page-size> 
     34  <height>480.0</height> 
     35  <width>640.0</width> 
     36</page-size> 
     37}}} 
    4038  
    41  * ListProperties are saved in a similar way. There is a xml child for each element of the list and the series of these tags are wrapped in a one bigger tag for the whole ListProperty. All list element tags have an attribute 'index'. This attribute is added because in the xml representation order of the children with the same name on the same level of the xml is not regarded. That means that in the list of pages for example the page elements could change their place in the xml representation but we won`t lose the information for the order in which they were persisted.   
     39 Book resources could also have 3 additional child nodes - ‘pages’, ‘page-templates’ and ‘frame-templates’. The value of these properties is a list of references to resources just like the template one. These child nodes have attribute ‘size’ that is integer number indicating how many elements this list holds. Each element of the list is persisted separately in a child node called accordingly ‘page’ or ‘template’. Those elements have attribute ‘index’ since the order of those elements is important to the way the book is presented. In xml files order of the children with the same name on the same level of the xml is not regarded. That way the list of pages for example page elements could change their place in the xml representation but we won`t lose the information for the order in which they were arranged in the book.