- Jul 28, 2017
-
-
Vladimir Bahyl authored
XRD_TIMEOUTRESOLUTION=600 # increased from 15s
-
Julien Leduc authored
replaced eosh script with ls -y except after retrieval as archived and retrieved are the same status regarding eos... Looks like sometime ls -y determined archived files is not a growing function...
-
- Jul 27, 2017
-
-
Julien Leduc authored
client_ar.sh can now write to /eos/ctaeos/preprod with -d option, just complains: Could not remove disk replica for /eos/ctaeos/preprod/ as drop is already done in the wfe script. Should test for disk replica before trying to drop with ls -y on directory.
-
Eric Cano authored
-
Eric Cano authored
In the MemQueue, the promise for the next batch was set after the queue was committed, but before the lock was released (by the last user of the queue, through a shared pointer). This would lead to an uselessly early start of the next queue batch for writing an avoidable contention on the object store lock. This would no lead to a pile up though as only 2 thread would be contended (previous and early starting next).
-
Eric Cano authored
-
Eric Cano authored
-
Eric Cano authored
... in preparation for replacement of RetrieveMount::getNextJob().
-
Julien Leduc authored
-
Julien Leduc authored
-
Eric Cano authored
The retrieve request now gets properly queued in case of retrieve error. The errors are counted and the request gets deleted eventually. A new field was added to the retrive request in object store. This commit will fail on upgrade if there are retrieve requests still queued at update time. Cleaned up some unused structures in cta.proto Minor modifications to ArchiveJobs.
-
Victor Kotlyar authored
Converted all bytes to Mbytes. Removed extra space in the output. Reordered fields.
-
Eric Cano authored
This is a stop gap solution while we wait for efficient archive/retrieve reporting.
-
Eric Cano authored
-
- Jul 26, 2017
-
-
Eric Cano authored
This affects only unit tests as taped already relied on getNextJobBatch().
-
Vladimir Bahyl authored
-
Eric Cano authored
As rados re-creates an object when trying to lock it, we tested for presence before locking. This is racy as object could be deleted in the mean time. Instead, we now lock blindly and delete the object if we find it having a zero-size. As we own the lock, this is safe. This problem led to issues in garbage colector, where agent gets polled while it could disappear.
-
- Jul 25, 2017
-
-
Eric Cano authored
-
Eric Cano authored
-
Eric Cano authored
-
Eric Cano authored
This fixes frontend failing to start on file permission error.
-
Julien Leduc authored
-
Julien Leduc authored
- Jul 24, 2017
-
-
Eric Cano authored
Set default to 8k as seen in this documentation: http://www.rsyslog.com/doc/v8-stable/configuration/global/index.html.
-
Eric Cano authored
Added the vid in the name of the retrieve queue object.
-
Eric Cano authored
-
Julien Leduc authored
-
Eric Cano authored
-
- Jul 21, 2017
-
-
Eric Cano authored
Also fixed flaws in the cache update logic.
-
Eric Cano authored
Some further adaptation was needed on the templates.
-
Julien Leduc authored
-
- Jul 20, 2017
-
-
Eric Cano authored
It is now known as ostoredb::MemQueue<ArchiveRequest, ArchiveQueue>. This is in preparation of the reuse of the code for ostoredb::MemQueueu<RetrieveRequest, RetrieveQueue>.
-
Julien Leduc authored
their namespace. According to https://kubernetes.io/docs/concepts/configuration/secret/#creating-a-secret-using-kubectl-create-secret: ``` Secret API objects reside in a namespace. They can only be referenced by pods in that same namespace. ```
-
Julien Leduc authored
This allows to solve the recurrent docker login issue by passing the credentials to kubernetes as a secret. Indeed docker login credentials are valid for 3 months then it is over. We just need to pass the `ctareg` service account credentials in the `ctaregsecrets` and we are done. In addition `imagePullSecrets` works at the full cluster level as each kubelet needs the registry credentials to pull its own image. If `ctaregsecret` secret does not exists, kubernetes is perfectly fine with that and uses the default authentication. This means that there is no need to define this secret (which is useless in the buildtree environment).
-
Eric Cano authored
Changed the default behavior of DummyCatalogue so unit tests keep passing. Adapted SchedulerDatabase API.
-
Victor Kotlyar authored
Added tapepool information from the catalog in Scheduler::getQueuesAndMountSummaries.
-
- Jul 19, 2017
-
-
Eric Cano authored
Created a unit test for RetrieveRequest garbage collection. Created a RetrieveQueue::addJobIfNecessary function. Fixed various bugs in RetrieveRequest::garbageCollect. Added basic functionnality to DummyCatalogue for GC unit tests. Cleaned up interace of RetrieveRequest for functions that are never used. Implemented missing function in RootEntry. Homogenized capitalisation of object names. Added support for RetrieveRequest in cta-objectstore-collect-orphaned-object.
-