Kernel#system bug?

Charlton · Dec 20, 2006

Running Ruby 1.8.4 on Linux

This problem seems to exist with the system() function but also has the
same problem with IO#popen. Seems the working directory of a subshell
can be affected by the command being executed? In the first call to
popen, I'm just executing "env". In the second, I'm calling "env;"
(with a semicolon. I encountered this because some commands I was
calling with system() were unable to find files that should have been
in the directory that I chdir'd to.

Program and Output below.

Thanks,
Charlton

Program:

Dir.mkdir("bug") if !FileTest.exists?("bug")
File.open("bug/bugfile", "w") { |file|
file << "Bug!"
}
Dir.chdir("bug")

puts "without semi-colon"
IO.popen("env").readlines.each do |entry|
puts entry if entry =~ /PWD/
end

puts "with semi-colon"
IO.popen("env;").readlines.each do |entry|
puts entry if entry =~ /PWD/
end

Output:

without semi-colon
PWD=/user/blah/tests/ruby
with semi-colon
PWD=/user/blah/tests/ruby/bug

Eric Hodel · Dec 20, 2006

Running Ruby 1.8.4 on Linux

This problem seems to exist with the system() function but also has
the
same problem with IO#popen. Seems the working directory of a subshell
can be affected by the command being executed? In the first call to
popen, I'm just executing "env". In the second, I'm calling "env;"
(with a semicolon. I encountered this because some commands I was
calling with system() were unable to find files that should have been
in the directory that I chdir'd to.

See the recent discussion on ruby-core.

Nobuyoshi Nakada · Dec 21, 2006

Hi,

At Thu, 21 Dec 2006 06:45:06 +0900,
Charlton wrote in [ruby-talk:230639]:

Program:

Dir.mkdir("bug") if !FileTest.exists?("bug")
File.open("bug/bugfile", "w") { |file|
file << "Bug!"
}
Dir.chdir("bug")

puts "without semi-colon"
IO.popen("env").readlines.each do |entry|
puts entry if entry =~ /PWD/
end

puts "with semi-colon"
IO.popen("env;").readlines.each do |entry|
puts entry if entry =~ /PWD/
end

Output:

without semi-colon
PWD=/user/blah/tests/ruby
with semi-colon
PWD=/user/blah/tests/ruby/bug

It is natural result.
PWD is not set by OS automatically, but set by sh.

You should use getcwd() in C or Dir.pwd in ruby instead of
relying on $PWD.

Charlton · Dec 21, 2006

Hi Nobu,

Hm, I'm not sure I understand how this is natural. It's true that the
shell sets PWD but if I'm executing the same command from within a
Kernel#system call, I would have expected the directory context to be
consistent. If the directory viewed by the shell isn't coherent with
wherever I've taken ruby to (via Dir.chdir), then I would almost say
it's a busted implementation.

Cherers,
Charlton

Nobuyoshi said:
Hi,

At Thu, 21 Dec 2006 06:45:06 +0900,
Charlton wrote in [ruby-talk:230639]:

Program:

Dir.mkdir("bug") if !FileTest.exists?("bug")
File.open("bug/bugfile", "w") { |file|
file << "Bug!"
}
Dir.chdir("bug")

puts "without semi-colon"
IO.popen("env").readlines.each do |entry|
puts entry if entry =~ /PWD/
end

puts "with semi-colon"
IO.popen("env;").readlines.each do |entry|
puts entry if entry =~ /PWD/
end

Output:

without semi-colon
PWD=/user/blah/tests/ruby
with semi-colon
PWD=/user/blah/tests/ruby/bug

Click to expand...

It is natural result.
PWD is not set by OS automatically, but set by sh.

You should use getcwd() in C or Dir.pwd in ruby instead of
relying on $PWD.

Charlton · Dec 21, 2006

Thanks, Eric. I see the discussion over at ruby-core. Unfortunately, I
don't really understand the resolution (if there is one). I'll keep my
eyes on the mailing list.

Cheers,
Charlton

ara.t.howard · Dec 21, 2006

Hi Nobu,

Hm, I'm not sure I understand how this is natural. It's true that the
shell sets PWD

and this here is the issue. in this command there is no shell involved:

which you can confirm thusly:

harp:~ > cat a.rb
IO.popen 'env'

harp:~ > strace -f -- ruby a.rb 2>&1|grep exec
execve("/home/ahoward/bin/ruby", ["ruby", "a.rb"], [/* 52 vars */]) = 0
set_thread_area({entry_number:-1 -> 6, base_addr:0xb75e1460, limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, limit_in_pages:1, seg_not_present:0, useable:1}) = 0
execve("/bin/env", ["env"], [/* 52 vars */]) = 0
set_thread_area({entry_number:-1 -> 6, base_addr:0xb75e2780, limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, limit_in_pages:1, seg_not_present:0, useable:1}) = 0

yet a semi-colon terminated command does indeed invoke /bin/sh:

harp:~ > cat b.rb
IO.popen 'env;'

harp:~ > strace -f -- ruby b.rb 2>&1|grep exec
execve("/home/ahoward/bin/ruby", ["ruby", "b.rb"], [/* 52 vars */]) = 0
set_thread_area({entry_number:-1 -> 6, base_addr:0xb75e0460, limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, limit_in_pages:1, seg_not_present:0, useable:1}) = 0
execve("/bin/sh", ["sh", "-c", "env;"], [/* 52 vars */]) = 0
set_thread_area({entry_number:-1 -> 6, base_addr:0xb75e0080, limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, limit_in_pages:1, seg_not_present:0, useable:1}) = 0
execve("/bin/env", ["env"], [/* 50 vars */]) = 0
set_thread_area({entry_number:-1 -> 6, base_addr:0xb75e4780, limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, limit_in_pages:1, seg_not_present:0, useable:1}) = 0

which you see here

this is because the command 'env;' is, in fact, not valid. in a c program you
will not be able to popen it. ruby, however, is kind, when it sees the special
chars

"*?{}[]<>()~&|\\$;'`\"\n"

in your system call it runs your command via sh. this is doccumented
somewhere, though i forget where attm...

so what's happening is that, in one case, you exec 'env' which simply inherits
the parents env, including current value of PWD. in the second case you
actually exec sh, which sets ENV[PWD], which in turn runs env as a child
process.

in summary, nobu is right - simply use Dir.pwd and do not rely on auto-magical
behaviour of child processes which set, or may not set, the PWD env var.
similarly, if you want to avoid the special handling of cmd strings given to
system/popen, make sure the commands given are valid (in the 'c' sense) so you
bypass ruby filtering them via /bin/sh.

regards.

-a

Charlton · Dec 22, 2006

Thanks, Ara,

That clarifies it beautifully. I wasn't aware that Ruby actually looked
behind the scenes for shell characters in order to determine whether or
not to execute the SHELL. I understand the behaviour now. I guess the
original program that caused me to run into this snag was Perforce
(p4). It's clearly using the PWD environment variable to do its work as
is witnessed by:

tcsh> ( setenv PWD dont_exist ; p4 -v3 info | grep cwd )
RpcSendBuffer cwd = dont_exist

Thanks all for clarifying.

Cheers,
Charlton

Hi Nobu,

Hm, I'm not sure I understand how this is natural. It's true that the
shell sets PWD

Click to expand...

and this here is the issue. in this command there is no shell involved:

which you can confirm thusly:

harp:~ > cat a.rb
IO.popen 'env'

harp:~ > strace -f -- ruby a.rb 2>&1|grep exec
execve("/home/ahoward/bin/ruby", ["ruby", "a.rb"], [/* 52 vars */]) = 0
set_thread_area({entry_number:-1 -> 6, base_addr:0xb75e1460, limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, limit_in_pages:1, seg_not_present:0, useable:1}) = 0
execve("/bin/env", ["env"], [/* 52 vars */]) = 0
set_thread_area({entry_number:-1 -> 6, base_addr:0xb75e2780, limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, limit_in_pages:1, seg_not_present:0, useable:1}) = 0

yet a semi-colon terminated command does indeed invoke /bin/sh:

harp:~ > cat b.rb
IO.popen 'env;'

harp:~ > strace -f -- ruby b.rb 2>&1|grep exec
execve("/home/ahoward/bin/ruby", ["ruby", "b.rb"], [/* 52 vars */]) = 0
set_thread_area({entry_number:-1 -> 6, base_addr:0xb75e0460, limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, limit_in_pages:1, seg_not_present:0, useable:1}) = 0
execve("/bin/sh", ["sh", "-c", "env;"], [/* 52 vars */]) = 0
set_thread_area({entry_number:-1 -> 6, base_addr:0xb75e0080, limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, limit_in_pages:1, seg_not_present:0, useable:1}) = 0
execve("/bin/env", ["env"], [/* 50 vars */]) = 0
set_thread_area({entry_number:-1 -> 6, base_addr:0xb75e4780, limit:1048575, seg_32bit:1, contents:0, read_exec_only:0, limit_in_pages:1, seg_not_present:0, useable:1}) = 0

which you see here

this is because the command 'env;' is, in fact, not valid. in a c program you
will not be able to popen it. ruby, however, is kind, when it sees the special
chars

"*?{}[]<>()~&|\\$;'`\"\n"

in your system call it runs your command via sh. this is doccumented
somewhere, though i forget where attm...

so what's happening is that, in one case, you exec 'env' which simply inherits
the parents env, including current value of PWD. in the second case you
actually exec sh, which sets ENV[PWD], which in turn runs env as a child
process.

in summary, nobu is right - simply use Dir.pwd and do not rely on auto-magical
behaviour of child processes which set, or may not set, the PWD env var.
similarly, if you want to avoid the special handling of cmd strings given to
system/popen, make sure the commands given are valid (in the 'c' sense) so you
bypass ruby filtering them via /bin/sh.

regards.

-a

Interactive System calls within Ruby Script	2	Sep 20, 2007
Possible bug in FileUtils::fu_mkdir (Errno::EEXIST)	0	Jul 10, 2010
System with multiple arguments fails on Windows when there areumlauts in the PATH	2	Feb 13, 2011
Bug report involving class variables	0	May 14, 2009
raven, rake, ant and ruby	6	Mar 21, 2008
[ANN] posix-spawn 0.3.0 -- first public release (codename, "tigersblood")	5	Mar 5, 2011
File size vs. Directory size problem	3	Nov 10, 2009
Is this a Ruby bug in Dir on Windows?	1	Oct 25, 2007

Kernel#system bug?

Charlton

Eric Hodel

Nobuyoshi Nakada

Charlton

Charlton

ara.t.howard

Charlton

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads