• Backing up the repository directory

  • We tried to make OpenKM as intuitive as possible, but an advice is always welcome.
We tried to make OpenKM as intuitive as possible, but an advice is always welcome.
Forum rules: Please, before asking something see the documentation wiki or use the search feature of the forum. And remember we don't have a crystal ball or mental readers, so if you post about an issue tell us which OpenKM are you using and also the browser and operating system version. For more info read How to Report Bugs Effectively.
 #5522  by Flogeras
 
Hello

I use GNU tar to backup the repository directory. I have been trying to use the --listed-incremental option, using weekly snapshots (we have fairly low traffic on our openkm install), but something seems to be changing the mtime and ctime of the files each night at around midnight. I can understand that the system may index periodically, but why is it changing the mtime/ctime. This makes incremental backups impossible for me. Is there something in my configuration that might be wrong?

Thanks,
Dave
 #5528  by jllort
 
if lucene indexing files are a problem to you, simply do not include it. After restoring, lucene indexing are automatically created .... but please if you decide it ... try restoring this backup type to ensure all is right.
 #5533  by Flogeras
 
I guess it is more a question of why is it modifying the files when it indexes them? Should I contact the lucene people to investigate if this is a bug?
 #5565  by pavila
 
Which files are modified? IT also depends on your repository.xml configuration, please post it.
 #5570  by Flogeras
 
It seems that each and every file within the repository/repository/datastore directory is modified at (or just after) midnight every day.

Here is my repository.xml (from OpenKM 4.1), it should be pretty standard with the exception of using Postgres for the metadata.
Code: Select all
<?xml version="1.0"?>                                                           
<!DOCTYPE Repository PUBLIC "-//The Apache Software Foundation//DTD Jackrabbit 1.4//EN"                                                                         
                            "http://jackrabbit.apache.org/dtd/repository-1.4.dtd">                                                                              
<Repository>                                                                    
    <FileSystem class="org.apache.jackrabbit.core.fs.local.LocalFileSystem">    
        <param name="path" value="${rep.home}/repository"/>                     
    </FileSystem>                                                               
    <Security appName="OpenKM">                                                 
      <AccessManager class="es.git.openkm.core.OKMAccessManager"/>              
      <!-- <AccessManager class="org.apache.jackrabbit.core.security.SimpleAccessManager"/> -->                                                                 
    </Security>                                                                 
    <Workspaces rootPath="${rep.home}/workspaces" defaultWorkspace="default" /> 
    <Workspace name="${wsp.name}">                                              
        <FileSystem class="org.apache.jackrabbit.core.fs.local.LocalFileSystem">
            <param name="path" value="${wsp.home}"/>                            
        </FileSystem>                                                           
        <PersistenceManager class="org.apache.jackrabbit.core.persistence.bundle.PostgreSQLPersistenceManager">                                                 
            <param name="driver" value="org.postgresql.Driver"/>                
            <param name="url" value="jdbc:postgresql://localhost:5432/openkm?autoReconnect=true"/>                                                              
            <param name="schema" value="postgresql"/>                           
            <param name="user" value="openkm"/>                                 
            <param name="password" value=""/>                            
            <param name="schemaObjectPrefix" value="${wsp.name}_"/>             
            <param name="externalBLOBs" value="false"/>                         
        </PersistenceManager>                                                   
        <SearchIndex class="org.apache.jackrabbit.core.query.lucene.SearchIndex">                                                                               
            <param name="path" value="${wsp.home}/index"/>                      
            <param name="resultFetchSize" value="100"/>                         
            <param name="useCompoundFile" value="true"/>                        
            <param name="minMergeDocs" value="100"/>                            
            <param name="volatileIdleTime" value="3"/>                          
            <param name="maxMergeDocs" value="100000"/>                         
            <param name="mergeFactor" value="10"/>                              
            <param name="maxFieldLength" value="50000"/>                        
            <!-- Disable extractor pool -->                                     
            <!-- <param name="extractorPoolSize" value="0"/> -->                
            <param name="extractorTimeout" value="1000"/>                       
            <param name="bufferSize" value="10"/>                               
            <param name="cacheSize" value="1000"/>                              
            <param name="forceConsistencyCheck" value="false"/>                 
            <!-- <param name="consistencyCheck" value="true"/> -->              
            <!-- <param name="consistencyFix" value="true"/> -->                
            <param name="autoRepair" value="true"/>                             
            <!-- <param name="analyzer" value="es.git.openkm.analysis.SpanishAnalyzer"/> -->                                                                    
            <param name="respectDocumentOrder" value="false"/>                  
            <param name="indexingConfiguration" value="${wsp.home}/../../../indexing_configuration.xml"/>                                                       
            <param name="textFilterClasses" value="
org.apache.jackrabbit.extractor.PlainTextExtractor,                             
org.apache.jackrabbit.extractor.PdfTextExtractor,                               
org.apache.jackrabbit.extractor.HTMLTextExtractor,                              
org.apache.jackrabbit.extractor.XMLTextExtractor,                               
org.apache.jackrabbit.extractor.RTFTextExtractor,                               
org.apache.jackrabbit.extractor.OpenOfficeTextExtractor,                        
es.git.openkm.extractor.MsExcelTextExtractor,                                   
es.git.openkm.extractor.MsPowerPointTextExtractor,                              
es.git.openkm.extractor.MsWordTextExtractor,                                    
es.git.openkm.extractor.MsOffice2007TextExtractor,                              
es.git.openkm.extractor.ExifTextExtractor,                                      
es.git.openkm.extractor.TiffTextExtractor,                                      
es.git.openkm.extractor.AudioTextExtractor" />                                  
        </SearchIndex>                                                          
    </Workspace>                                                                
    <Versioning rootPath="${rep.home}/version">                                 
        <FileSystem class="org.apache.jackrabbit.core.fs.local.LocalFileSystem">
            <param name="path" value="${rep.home}/version"/>                    
        </FileSystem>                                                           
        <PersistenceManager class="org.apache.jackrabbit.core.persistence.bundle.PostgreSQLPersistenceManager">                                                 
            <param name="driver" value="org.postgresql.Driver"/>                
            <param name="url" value="jdbc:postgresql://localhost:5432/openkm?autoReconnect=true"/>                                                              
            <param name="schema" value="postgresql"/>                           
            <param name="user" value="openkm"/>                                 
            <param name="password" value=""/>                            
            <param name="schemaObjectPrefix" value="version_"/>                 
            <param name="externalBLOBs" value="false"/>                         
        </PersistenceManager>                                                   
    </Versioning>                                                               
    <!-- Also see DatabaseDataStore-->                                          
    <DataStore class="org.apache.jackrabbit.core.data.FileDataStore"/>          
</Repository>
 #10937  by snowman
 
Hello,

is there any update on that issue? I observe the same behavior. Even if I have no external traffic for several days, all my repository files have a mtime stamp of midnight. Inremental backup is ruled out with that.

Best regards,
Snowman
 #10945  by pavila
 
Please, try with a more recent OpenKM version because 4.1 is only supported for our costumers. For example OpenKM 5.0.4 or OpenKM 5.1.3

About Us

OpenKM is part of the management software. A management software is a program that facilitates the accomplishment of administrative tasks. OpenKM is a document management system that allows you to manage business content and workflow in a more efficient way. Document managers guarantee data protection by establishing information security for business content.