summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorAlex,Shi <alex.shi@intel.com>2010-06-17 14:08:13 +0800
committerGreg Kroah-Hartman <gregkh@suse.de>2010-08-02 10:26:38 -0700
commit56715eac6f5c8b45214f0a39fca6692fe326575a (patch)
treecfb2e85a28fa4791971bacd371c18a843fd4e970
parented8cfec535b4e4b03a01b31e3b0434dd8ff078cc (diff)
downloadlwn-56715eac6f5c8b45214f0a39fca6692fe326575a.tar.gz
lwn-56715eac6f5c8b45214f0a39fca6692fe326575a.zip
sched: Fix over-scheduling bug
commit 3c93717cfa51316e4dbb471e7c0f9d243359d5f8 upstream. Commit e70971591 ("sched: Optimize unused cgroup configuration") introduced an imbalanced scheduling bug. If we do not use CGROUP, function update_h_load won't update h_load. When the system has a large number of tasks far more than logical CPU number, the incorrect cfs_rq[cpu]->h_load value will cause load_balance() to pull too many tasks to the local CPU from the busiest CPU. So the busiest CPU keeps going in a round robin. That will hurt performance. The issue was found originally by a scientific calculation workload that developed by Yanmin. With that commit, the workload performance drops about 40%. CPU before after 00 : 2 : 7 01 : 1 : 7 02 : 11 : 6 03 : 12 : 7 04 : 6 : 6 05 : 11 : 7 06 : 10 : 6 07 : 12 : 7 08 : 11 : 6 09 : 12 : 6 10 : 1 : 6 11 : 1 : 6 12 : 6 : 6 13 : 2 : 6 14 : 2 : 6 15 : 1 : 6 Reviewed-by: Yanmin zhang <yanmin.zhang@intel.com> Signed-off-by: Alex Shi <alex.shi@intel.com> Signed-off-by: Peter Zijlstra <a.p.zijlstra@chello.nl> LKML-Reference: <1276754893.9452.5442.camel@debian> Signed-off-by: Ingo Molnar <mingo@elte.hu> Signed-off-by: Greg Kroah-Hartman <gregkh@suse.de>
-rw-r--r--kernel/sched.c3
1 files changed, 0 insertions, 3 deletions
diff --git a/kernel/sched.c b/kernel/sched.c
index d7aa8fdea1ff..12b9e7344d90 100644
--- a/kernel/sched.c
+++ b/kernel/sched.c
@@ -1719,9 +1719,6 @@ static void update_shares_locked(struct rq *rq, struct sched_domain *sd)
static void update_h_load(long cpu)
{
- if (root_task_group_empty())
- return;
-
walk_tg_tree(tg_load_down, tg_nop, (void *)cpu);
}