summaryrefslogtreecommitdiff
path: root/scheduler
AgeCommit message (Expand)AuthorFilesLines
2009-06-18Ensure one-time-hosts aren't in the Everyone ACL, and make the scheduler igno...showard2-0/+21
2009-06-18temporary fix for bug in scheduling when at capacity. if no drone has capaci...showard2-5/+63
2009-06-17add a log message to the scheduler thats useful for debugging atomic groupsshoward1-0/+4
2009-06-16Fix _atomic_and_has_started() to check *only* for states that are ashoward2-8/+13
2009-06-15Do not call .set_host if the host is already set.showard1-1/+4
2009-06-10Ensure all entry points get the import-time logging logic executed before oth...showard2-2/+3
2009-06-10Make scheduler and babysitter use the new logging_manager system.showard6-115/+57
2009-06-10Redesign the reverify hosts feature. Host status is no longer changedshoward1-3/+33
2009-06-10Have the scheduler wait a configurable amount of time before startingshoward3-48/+468
2009-06-10convert a few straggling print statements in the scheduler code to logging callsshoward3-7/+6
2009-06-08Add an option to global config to disable to the scheduler isn't accidentally...showard1-1/+9
2009-06-08Add logging info for drones so we know what drone drone_utility is running on...showard1-1/+2
2009-06-08Change behavior of Force Reverify: no longer executes cleanup before.showard1-7/+9
2009-05-29Have the scheduler check for and sometimes cleanup various DB inconsistencies.showard1-11/+109
2009-05-29make AgentTasksTest inherit from BaseSchedulerTest. it didn't used to, since...showard1-1/+3
2009-05-29make the readonly connection fallback to the regular Django connection when r...showard1-1/+3
2009-05-26fix JobManager.get_status_counts, which was returning incorrect counts in som...showard1-138/+9
2009-05-26Make recovered tasks correctly handle being aborted before being started. Un...showard2-26/+68
2009-05-26Explcitly catch SystemExit so we don't stack trace when we exit with sys.exitshoward1-0/+2
2009-05-20Sort hosts when choosing them for use in an atomic group and whenshoward2-12/+97
2009-05-15Add ability to reverify a host from the Host List.showard1-0/+5
2009-05-15Update debug_scheduler logging config to use INFO instead of debug.showard1-1/+1
2009-05-13Fix recurring run code to reflect recent changes to rpc_utils.create_new_job().showard1-13/+3
2009-05-13Add an extra check for existence of Autoserv results in GatherLogsTask -- in ...showard2-2/+4
2009-05-13Add job maximum runtime, a new per-job timeout that counts time since the job...showard3-4/+26
2009-05-13Fix scheduler unittest for parser's new -P flagmbligh1-1/+1
2009-05-13Add post-parse site hooks (parse -P to trigger, default = off)mbligh1-1/+1
2009-05-12Add job option for whether or not to parse failed repair results as part of a...showard2-9/+26
2009-05-11Represent a group of machines with either the atomic group label name,showard2-23/+191
2009-05-08Only run crashinfo collection when Autoserv exited due to some signal -- not ...showard2-12/+24
2009-05-08Pick hosts out of an atomic group in order rather than randomly so thatshoward1-3/+5
2009-05-05Change Agent.abort() again. This time, it runs through its queue of AgentTas...showard2-40/+50
2009-05-01Call out to site_monitor_db: site_init_monitor_dbmbligh1-0/+9
2009-04-29Here is a patch, which extends the autotest system with recurring jobshoward1-1/+53
2009-04-29Make autoserv --collect_crashinfo only run when Autoserv actually failed (exi...showard2-3/+23
2009-04-29fix Agent.abort() when it's called before the agent has started (in that case...showard2-3/+15
2009-04-28Make a couple of errant files executablembligh1-0/+0
2009-04-27Make RepairTask write job_queued and job_finished keyvals so they can be pars...showard2-17/+51
2009-04-27Fix two bugs introduced in previous change to add collect_crashinfo support.showard2-32/+126
2009-04-24Change the client, babysitter, scheduler logging configs to append tojadmanski2-2/+2
2009-04-22Add support to the scheduler to run autoserv --collect_crashinfo after a job ...showard4-328/+417
2009-04-22Fix monitor_db_unittest, broken by previous change to refactor cleanup code. ...showard3-4/+12
2009-04-20extract code for generated autoserv command lines to a common place, includin...showard2-11/+34
2009-04-16Add information collecting method so we can see what state the system was in ...showard1-1/+39
2009-04-16Change connect_timeout default from 30 seconds to 5 minutesshoward1-1/+1
2009-04-15Fix monitor_db.DBObject.save() to handle None values as NULL properly.showard2-3/+35
2009-04-09This gives us fixed width, and saves some space.mbligh2-4/+4
2009-04-09New prefix for file loggingmbligh2-2/+4
2009-04-08Move clean up functions into seperate file/classesmbligh2-70/+147
2009-04-07Ensure exception information from monitor_db goes to logs.showard2-2/+10