Fix: remove break in epoll loop of apps. thread
authorDavid Goulet <dgoulet@efficios.com>
Wed, 18 Dec 2013 23:34:44 +0000 (18:34 -0500)
committerDavid Goulet <dgoulet@efficios.com>
Thu, 19 Dec 2013 14:39:51 +0000 (09:39 -0500)
In *heavy* stress test with a large number of applications (> 7000 a
second), the manage application thread could starve the delete process
by breaking just after adding an application to the poll set.

Also, we've observed that somehow the application unregister process is
not done on most of the application by breaking the loop at each delete
from the poll set. We are still uncertain why but one theory is that
epoll detects that an I/O operation is ready (here a shutdown) and an
other subsystem of the session daemon uses that socket for I/O which
flags the poll event as "has been taken care of" thus the loop never
sees it because of that break.

The notify socket thread does not use a break between poll operation
which leads us to that conclusion with the manage apps thread.

We don't use epoll with edge-trigger thus a POLLERR/POLLHUP should
always be return as long as it's not taken care of.

Acked-by: Mathieu Desnoyers <mathieu.desnoyers@efficios.com>
Signed-off-by: David Goulet <dgoulet@efficios.com>
src/bin/lttng-sessiond/main.c

index e522db01dc174493d76366c825a0f44594e31545..2eefebfe155f7270922fac7d7f0b0c34fd710eb8 100644 (file)
@@ -1293,10 +1293,6 @@ static void *thread_manage_apps(void *data)
                                        }
 
                                        DBG("Apps with sock %d added to poll set", sock);
-
-                                       health_code_update();
-
-                                       break;
                                }
                        } else {
                                /*
@@ -1312,7 +1308,6 @@ static void *thread_manage_apps(void *data)
 
                                        /* Socket closed on remote end. */
                                        ust_app_unregister(pollfd);
-                                       break;
                                }
                        }
 
This page took 0.028015 seconds and 4 git commands to generate.