Difference between revisions of "IBM CMOD Cache Filesystems"

Jump to navigation Jump to search
Many minor updates, refreshed links.
(Many minor updates, refreshed links.)
Line 15: Line 15:
As you load data into IBM Content Manager OnDemand, data is placed into the caches for fast access by end users.  The only exception to this rule is if you choose not to load Application Group data to the cache at all, and send them directly to secondary storage - which is generally reserved for situations where documents need to be archived for legal or regulatory reasons, but are rarely, if ever, accessed.  
As you load data into IBM Content Manager OnDemand, data is placed into the caches for fast access by end users.  The only exception to this rule is if you choose not to load Application Group data to the cache at all, and send them directly to secondary storage - which is generally reserved for situations where documents need to be archived for legal or regulatory reasons, but are rarely, if ever, accessed.  


If you have more than one cache filesytem, IBM CMOD chooses the cache with the most free space, and puts the compressed objects generated by the load there.
If you have more than one cache filesytem, OnDemand chooses the cache with the most free space, and puts the compressed objects generated by the load there.


==How does the cache get emptied?==
==How does the cache get emptied?==
Line 21: Line 21:


== What is cache expiration? ==
== What is cache expiration? ==
CMOD Cache expiration is the process by which OnDemand reduces the amount of data inside the caches, in order to make room for new data that is loaded into your Content Manager OnDemand server.  It does this by removing data that has already been migrated to secondary storage or is eligible for expiration, starting with the oldest documents first, until the filesystems are approximately 80% full by default.  As mentioned above, on large IBM CMOD servers that have multiple terabytes of cache, adjusting the minimum and maximum parameters makes optimal use of your expensive cache filesystems.  The only reason to maintain a terabyte or more of free space in your cache is to ensure there is room to load incoming files if data can't be written to secondary storage for some reason.  You can visit the [[arsmaint]] page to find out more about how you can create free space in the IBM CMOD cache filesystem.
CMOD Cache expiration is the process by which OnDemand reduces the amount of data inside the caches, in order to make room for new data that is loaded into your Content Manager OnDemand server.  It does this by removing data that has already been migrated to secondary storage or is otherwise eligible for expiration - starting with the oldest documents first - until the filesystems are approximately 80% full by default.  As mentioned above, on large IBM CMOD servers that have multiple terabytes of cache, adjusting the minimum and maximum parameters makes optimal use of the expensive storage that the CMOD cache is built with.  The only reason to maintain an excessive amount of free space in your cache is to ensure there is room to load incoming files if data can't be written to secondary storage for some reason.  You can visit the [[arsmaint]] page to find out more about how you can create free space in the IBM CMOD cache filesystem.


==What is cache optimization?==
==What is cache optimization?==
Storage is expensive and finite.  In order to make the most out of the storage space you have in your IBM CMOD Cache filesystems, you should try to store more of the data that users are actually accessing, for the period of time that they’re interested in them.  You can optimize the cache contents so that you’re using precisely enough disk to perform the required retrievals and to have enough space to load new data.  An optimized cache contains only the data people are accessing.  Unfortunately, there's no built-in tool for performing the Content Manager OnDemand Cache, but [https://cmod.cloud/ibm-cmod-cache-optimization/ CMOD Cache optimization] is offered as a service by IBM Business Partners.
Fast and reliable storage is expensive and finite.  In order to make the most out of the space you have in your IBM CMOD Cache filesystems, you must store more of the data that users are actually accessing, but only for the period of time that they are actually accessing them.  It's possible to optimize the cache contents so that you’re using precisely the amount of disk necessary to quickly perform the required retrievals, but still have enough space to load new data.  An optimized cache contains only the data people are accessing.  Unfortunately, there's no built-in tool for performing the Content Manager OnDemand Cache, but [https://cmod.cloud/ibm-cmod-cache-optimization/ CMOD Cache optimization] is offered as a service by companies that specialize in [https://TenaciousConsulting.com IBM CMOD Professional Services].


==How can I optimize the IBM CMOD Cache?==
==How can I optimize the IBM CMOD Cache?==
If your IBM CMOD Cache Filesystems are large (over 1 terabyte) then the default setting for the <code>arsmaint -c </code> will leave 20% - 200 gigabytes of empty cache.  If you have multiple 1TB cache filesytems, and you only load 20GB per day, you could have a huge amount of wasted space in your caches.  You can adjust this number with the <code>-m </code> and <code>-n </code> options for [[arsmaint]] to set new maximum and minimum caches sizes during cache expiration.
If your IBM CMOD Cache Filesystems are large (for example, 1 terabyte / 1000 gigabytes) then the default setting for the <code>arsmaint -c </code> will leave 20% - 200 gigabytes of empty cache.  If you have multiple 1TB cache filesytems, and you only load 20GB per day, you could have a huge amount of wasted space in your caches.  You can adjust this number with the <code>-n </code> and <code>-x </code> options for [[arsmaint]] to set new maximum and minimum caches sizes during cache expiration.


The other ways to tune the cache filesystems are to analyse the patterns of retrievals of Application Group data, and change the cache retention to match actual end user usage, and to use data from your load and retrieval patterns to determine your optimal cache size.
The other ways to tune the cache filesystems are to analyze the patterns of retrievals of Application Group data, and change the cache retention to match actual end user usage, and to use data from your load and retrieval patterns to determine your optimal cache size.


== Can I delete files in the cache manually? ==
== Can I delete files in the cache manually? ==
NO!  The cache is only to be managed with the [[arsmaint]] utility!  The structure of the Cache Filesystem is complex, and deleting files or directories inside the Content Manager OnDemand cache could lead to documents becoming unavailable to end users, and potentially irrecoverable data loss.
NO!  The cache is only to be managed with the [[arsmaint]] utility!  The structure of the OnDemand Cache Filesystem is complex, and deleting files or directories inside the Content Manager OnDemand cache could lead to documents becoming unavailable to end users, and potentially irrecoverable data loss.


== Too late, I already deleted files from my CMOD cache filesystem.  Help!==
== Too late, I already deleted files from my CMOD cache filesystem.  Help!==
If you have a backup of your cache data in your enterprise backup system, you can probably recover the missing data.  You can contact [https://Tenacious.Consulting Tenacious Consulting] for assistance.  They have tools to help you restore and verify the contents of your cache, or to pull data from secondary storage back into the CMOD cache.
If you have a backup of your cache data in your enterprise backup system, you can probably recover the missing data.  Contact a company that provides [https://cmod.cloud/professionalservices/ Professional Services for CMOD] for assistance in recovering data.  They have tools to help you restore and verify the contents of your cache, or to pull data from secondary storage back into the CMOD cache.


== What are the retr and migr directories? ==
== What are the retr and migr directories? ==
Line 47: Line 47:


== Can I balance the data in the cache filesystem? ==
== Can I balance the data in the cache filesystem? ==
Content Manager OnDemand automatically balances the amount of data in caches by placing new CMOD object files in the cache with the most free space, measured in blocks of free space in the filesystem.  If you have a large number of cache filesystems with dramatically different utilization, you may have a configuration issue that needs to be corrected.
Content Manager OnDemand automatically balances the amount of data in caches by placing new CMOD object files in the cache with the most free space, measured in blocks of free space in the filesystem.  If you have a large number of cache filesystems with dramatically different utilization, you probably have a configuration issue that needs to be corrected.


== Related Articles ==
== Related Articles ==

Navigation menu