test: Allow all valid AIX rc in test-stdio-closed #10239

sxa · 2016-12-12T17:55:03Z

Checklist

make -j4 test (UNIX), or vcbuild test nosign (Windows) passes
tests and/or benchmarks are included
commit message follows commit guidelines

Affected core subsystem(s)

test

Description of change

Allow either of the two possible return codes on AIX to
be considered successful on test-stdio-closed.js. This is likely
an interim solution until we can be sure when AIX returns
one or the other, but I currently have systems that have both
and I don't want this failing on any of them.

Fixes: nodejs#10234 Allow either of the two possible return codes on AIX to be considered successful on test-stdio-closed.js

sam-github · 2016-12-12T18:27:42Z

LGTM, to get things stable.

The test could use a comment on what it is testing, but I git blamed it. It looks like its attempting to ensure that fd 1 and 2 are always valid, and get opened on /dev/null if they aren't valid at startup: b5f25a9

I wonder if the test would be more robust if instead of calling process.exit(), it let the output data drain, had not try/catches, and asserted an exit code of zero, to guarantee node flushed all the data. Is it possible that sometimes stream.write() isn't touching the fd right away, so isn't noticing whether its valid or not until later, by which time the process has exited already?

/cc @bnoordhuis

Trott

Hopefully only an interim solution, but LGTM as both results comply with spec, as far as I am able to tell.

sam-github · 2016-12-12T19:27:21Z

@Trott 126 means stdout had a EBADF, which means its not working as intended, b5f25a9 is supposed to replace EBADF descriptors with a new descriptor open on /dev/null

I guess there is no way to just mark a test as skipped, so its more clearly something to come back to later?

Trott · 2016-12-12T19:35:23Z

@sam-github Not working as the Node.js code expects, but working in a correct fashion nonetheless. At least, that's my interpretation. Correction welcome. It seems to me that Node.js is trying to take advantage of a behavior that is permitted on other operating systems, but is not the only correct behavior.

Trott · 2016-12-12T19:43:36Z

@sam-github To get a little more specific about my understanding (and hopefully not my misunderstanding) about this issue: Node.js attempts to re-open the EBADF fd on /dev/null. POSIX allows the operating system to permit opening the closed stdio fd on /dev/null, but it does not require the operating system to permit it. So the test should accommodate the unexpected-by-Node.js behavior on AIX, at least as a minimal workaround.

(I'm certainly open to more robust solutions that might involve either addressing the issue on the C++ side or finding a way for the test to query the operating system ahead of time to see what the expected result should be.)

Trott · 2016-12-12T19:47:40Z

@sam-github wrote:

I guess there is no way to just mark a test as skipped, so its more clearly something to come back to later?

I think we could do something like this perhaps:

if (common.isAix and exitCode === 126) {
  common.skip('Skipping this test on AIX because blah blah something something.');
  return;
}

assert.strictEqual(exitCode, 42);

Not sure if there's an issue where that won't work in an exit handler, but I think it should.

sam-github · 2016-12-12T20:47:05Z

POSIX allows the operating system to permit opening the closed stdio fd on /dev/null, but it does not require the operating system to permit it.

Its not a spec corner, its just a normal open of a file, and it works on AIX usually. Whatever is going on here smells more like a race condition. Or possibly a difference in allocation pattern of new descriptors, but it only fails sometimes.

date > /dev/null is an example of something that would be broken if you couldn't redirect stdout to dev/null, and that's POSIX shell syntax.

Trott · 2016-12-12T22:49:16Z

date > /dev/null is an example of something that would be broken if you couldn't redirect stdout to dev/null

I'm not saying you can't redirect stdout to /dev/null. I'm saying that it's not clear that you must be permitted to re-open the stdout fd on /dev/null after that fd has been explicitly closed. Or perhaps a bit more specifically, it's not clear that open("/dev/null", O_RDWR) is required to return fd 1 after fd 1 has been closed. Maybe the OS considers fd 1 unavailable and instead returns 3 or whatever.

Interestingly, the open group man page for open() says it returns the lowest FD not currently open for the process. But AIX's man page for open() says it returns the lowest FD not previously open for the process. The former suggests it should return 1 if stdout is closed, but the latter suggests that it may never return 1 after stdout is closed.

(If I'm betraying profound ignorance and I should just stop already, you can say, "That's not how it works." and I'll stop. Just trying to be helpful.)

mhdawson · 2016-12-12T22:53:46Z

I would be happier with marking the test as flaky until we figure out the right answer.

sxa · 2016-12-13T11:00:54Z

Bear in mind it's generally completely deterministic on any given machine/environment in terms of which of the two results it gives, so marking it flaky on AIX (thus effectively ignoring the test) might be overkill and devalue its execution

sam-github · 2016-12-13T18:26:06Z

@Trott I'll talk about UNIX systems programming as long as you want, probably longer ;-). In the man pages you reference, previously and current mean the same thing, effectively. Previous means before open() was called, but current means at the moment open() was called. Since no change occurs between "before" a function is called and "current" to a function being called, they mean the same thing: that open is required to return the lowest numerically valued fd that it can. The text is confusing when compared to each other as you did, because it seems to imply a difference, but so it goes.

The use-lowest available fd behaviour is ancient UNIX behaviour. A system that didn't do that would break the world, whether POSIX speced it or not (though it looks like they did from what you posted, or at least the opengroup did later, when they cleaned up the stuff POSIX argued about).

Trott · 2016-12-13T20:23:17Z

@sam-github Yeah, it's all coming back to me now that this issue is more subtle than "AIX behaves differently". I mean, it does, but it's something subtle. Probably not going to discover the source of the issue by comparing man page texts. On the upside, this made me write a quick little C++ program to confirm my understanding and everything you're saying. Hooray for 10 more lines of C++ than I've written the entire rest of the year.

mhdawson · 2016-12-13T22:20:25Z

@sxa555 I see your point, my one concern is that its easier to forget once the test has changed. Marked flaky in the status file it is obvious that something needs to be fixed. I'm willing to defer to the others on the thread if they want to chime in on which way is best.

@sam-github the status file in the directory with the tests can be used to mark a test as either:

skip - don't run at all
run - but accept failure as ok. In the the job shows up as yellow instead of read in our CI when a failure occurs.

gibfahn · 2016-12-15T04:00:37Z

Refs: #8375

mhdawson · 2016-12-15T20:59:46Z

@sam-github @Trott if you 2 prefer landing with the option to accept either lets just do that. Can you comment here to confirm that.

Trott · 2016-12-15T21:45:58Z

It looks like @jBarz has determined the problem point and that it appears to be a ksh bug (AIX-specific or otherwise). See #10234 (comment)

If a workaround like I propose in #10234 (comment) is feasible, I think I'd prefer that to the change in this PR or skipping the test. The change I propose there has the benefit of still testing the C++ code that opens the fd on /dev/null, which is the point of the test.

mhdawson · 2016-12-15T22:35:15Z

@Trott, If running under bash allows the test to pass reliably that sounds good to me.

gibfahn

Discussion is ongoing in #10234, not approving to make sure no-one lands this by mistake.

richardlau · 2017-01-24T17:21:32Z

This was superseded and addressed by #10339

test: Allow all valid AIX rc in test-stdio-closed

e133cce

Fixes: nodejs#10234 Allow either of the two possible return codes on AIX to be considered successful on test-stdio-closed.js

nodejs-github-bot added lts-watch-v4.x test Issues and PRs related to the tests. labels Dec 12, 2016

Trott approved these changes Dec 12, 2016

View reviewed changes

mscdex added the aix Issues and PRs related to the AIX platform. label Dec 12, 2016

gibfahn suggested changes Dec 16, 2016

View reviewed changes

italoacasas added the blocked PRs that are blocked by other issues or PRs. label Dec 18, 2016

jasnell added the wip Issues and PRs that are still a work in progress. label Dec 27, 2016

gibfahn closed this Jan 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: Allow all valid AIX rc in test-stdio-closed #10239

test: Allow all valid AIX rc in test-stdio-closed #10239

sxa commented Dec 12, 2016

sam-github commented Dec 12, 2016 •

edited

Loading

Trott left a comment

sam-github commented Dec 12, 2016

Trott commented Dec 12, 2016 •

edited

Loading

Trott commented Dec 12, 2016 •

edited

Loading

Trott commented Dec 12, 2016 •

edited

Loading

sam-github commented Dec 12, 2016

Trott commented Dec 12, 2016 •

edited

Loading

mhdawson commented Dec 12, 2016

sxa commented Dec 13, 2016

sam-github commented Dec 13, 2016

Trott commented Dec 13, 2016

mhdawson commented Dec 13, 2016

gibfahn commented Dec 15, 2016

mhdawson commented Dec 15, 2016

Trott commented Dec 15, 2016

mhdawson commented Dec 15, 2016

gibfahn left a comment

richardlau commented Jan 24, 2017

test: Allow all valid AIX rc in test-stdio-closed #10239

test: Allow all valid AIX rc in test-stdio-closed #10239

Conversation

sxa commented Dec 12, 2016

Checklist

Affected core subsystem(s)

Description of change

sam-github commented Dec 12, 2016 • edited Loading

Trott left a comment

Choose a reason for hiding this comment

sam-github commented Dec 12, 2016

Trott commented Dec 12, 2016 • edited Loading

Trott commented Dec 12, 2016 • edited Loading

Trott commented Dec 12, 2016 • edited Loading

sam-github commented Dec 12, 2016

Trott commented Dec 12, 2016 • edited Loading

mhdawson commented Dec 12, 2016

sxa commented Dec 13, 2016

sam-github commented Dec 13, 2016

Trott commented Dec 13, 2016

mhdawson commented Dec 13, 2016

gibfahn commented Dec 15, 2016

mhdawson commented Dec 15, 2016

Trott commented Dec 15, 2016

mhdawson commented Dec 15, 2016

gibfahn left a comment

Choose a reason for hiding this comment

richardlau commented Jan 24, 2017

sam-github commented Dec 12, 2016 •

edited

Loading

Trott commented Dec 12, 2016 •

edited

Loading

Trott commented Dec 12, 2016 •

edited

Loading

Trott commented Dec 12, 2016 •

edited

Loading

Trott commented Dec 12, 2016 •

edited

Loading