Author Topic: updated xowa.trid.xml for XOWA Wiki data  (Read 747 times)

jenderek

  • Sr. Member
  • ****
  • Posts: 375
updated xowa.trid.xml for XOWA Wiki data
« on: April 25, 2023, 12:48:49 AM »
Hello trid users,

some days ago i updated definition for some SQLite 3.x databases.
Some has 4 byte suffix XOWA.

So i run trid utility on such XOWA examples. All my samples are
described as correctly with low priority generic as "SQLite 3.x
database" by xml.trid.xml. With high priority many samples are
described as "XOWA Wiki data" with suffix XOWA and mime type
application/octet-stream (See appended output/trid-v-old.txt).

For comparison reason i also run file command (version 5.44) on such
samples. Here these XOWA samples are described generic as "SQLite 3.x
database" with additional information (see appended
output/file-5.44.txt) and mime type application/vnd.sqlite3 (see
appended output/file-i-5.44.txt) and wrong suffix (see appended
output/file-ext-5.44.txt).

For comparison reason i also run the file format identification
utility DROID ( See https://sourceforge.net/projects/droid/). Here the
examples are also recognized generic . These are described as "SQLite
Database File Format" with version "3" and with mime type
application/x-sqlite3 by PUID fmt/729. But suffix xowa is considered
as "bad" (EXTENSION_MISMATCH true).

I found no mime type for xowa. But the samples are just SQLite
Database. So these should get that associated official mime type as it
is also shown by file command.  So that is now expressed inside TrID
definition by line like:
   <Mime>application/vnd.sqlite3</Mime>

So i run tridscan on my undetected XOWA samples to update
xowa.trid.xml.
Afterwards i looked what has changed. In Global Strings many lines with
XOWA vanished like:
   <String>XOWA.BLDR.DBSCHEMA_VERSION28</String>
   <String>XOWA.BLDR.SESSIONWIKI_DOMAIN</String>
   <String>XOWA.BLDR.DBCORE_FILE_NAME</String>
   <String>XOWA.BLDR.DBORIG_FILE_NAME</String>
   <String>XOWA.BLDR.SESSIONDUMP_NAME</String>
   <String>+XOWA.BLDR.SESSIONTIME201</String>
   <String>XOWA.BLDR.SESSIONVERSION</String>
   <String>UXOWA.BLDR.SESSIONGUID</String>
   <String>XOWA.BLDR.SESSIONUSER</String>
   <String>XOWA.BLDR.DBPART_ID</String>
   <String>XOWA.BLDR.DBNS_IDS</String>
   <String>UXOWA.BLDR.DBGUID</String>
   <String>XOWA.BLDR.DBTYPE</String>
   <String>-PAGES-ARTICLES</String>
   <String>XOWA.BLDR.DBID</String>
   <String>-CORE.XOWA</String>
Some of such lines still survived like:
   <String>CREATE UNIQUE INDEX XOWA_CFG__MAIN ON XOWA_CFG (CFG_GRP</String>
   <String>INDEXXOWA_CFG__MAINXOWA_CFG</String>
   <String>}TABLEXOWA_CFGXOWA_CFG</String>
   <String>CREATE TABLE XOWA_CFG</String>

Some obviously garbage line vanished like:
   <String>-PAGES-ARTICLES</String>
   <String>ANON</String>
   <String>MOUS</String>
   <String>RANO</String>
   <String>SERA</String>

From the first lines some shorted part survived. This are expressed by
lines like:
   <String>FILE</String>
   <String>SESS</String>
   <String>WIKI</String>

With the updated trid definition now all my XOWA examples are described
like before but now a specific mime type is also shown (see appended
output/trid-v-new.txt). TrID definitions and output are
stored in archive xowa_.zip. I hope that my updated definition can be used in
future version of triddefs.

With best wishes
Jörg Jenderek

Mark0

  • Administrator
  • Hero Member
  • *****
  • Posts: 2743
    • Mark0's Home Page
Re: updated xowa.trid.xml for XOWA Wiki data
« Reply #1 on: April 25, 2023, 02:32:32 PM »
Thanks!