[
Date Prev][
Date Next][
Thread Prev][
Thread Next][
Date Index][
Thread Index]
[
List Home]
| [gmf-dev] [modeling-dev] Modeling projects fail to publish update site bits due to nfs problems (was Re: teneo not getting published on emf update sites) | 
I can wait, but it'll certainly need a look-see next week. No point 
ruining Denis' weekend w/ something that could take days (or minutes?) 
to solve.
Copying modeling-dev@ so as to minimize the freakouts in the meantime. :)
N
Webmaster (Karl Matthias) wrote:
I'm seeing some NFS errors on build like "build kernel: nfsacl: RPC call 
returned error 22".  Looks like RPC calls for ACLs are failing on 
occasion and that might be what's going on.  At least it makes some 
sense given what's happening.  I don't recall seeing these before, but I 
need to bounce this off of Denis who has spent more time on this stuff 
than I have to see what he thinks.  They look like they've only been 
happening in the last couple of weeks.  He's in on Monday, so unless you 
think this is urgent then I'll wait to then to ping him first.
Cheers,
Karl
Martin Taal wrote:
The cp was from my home dir to the update site dir. However, the unzip 
I did was from within this directory to this directory itself. I did 
not even mention that before I tried this unzip action I tried to copy 
the whole interim directory to an interim_orig directory (as backup) 
in /home/data/httpd/download.eclipse.org/modeling/emf/updates/
and I got this same Invalid argument exception.
So my guess is that there is something wrong with 
/home/data/httpd/download.eclipse.org/modeling/emf/updates and not 
with the home directories necessarily.
gr. Martin
Nick Boldt wrote:
That's seriously foo'd up. I've never seen linux behave where if you 
try the same thing twice it fails, then works on the second try.
I expect that behaviour from Windows, or p2, but Linux? Yikes. What's 
the world coming to? :P
p2 has this on-again-off-again problem because of out of date or 
hard-to-ping mirrors, so sometimes an install fails, sometimes it 
works (using the same input). I just had this happen to me today 
trying to install something that depends on a feature that was only 
added to the BIRT 2.5 update site about less than a day ago.
So... could this filesystem problem be related to network access 
snafus between mtaal's home dir (or mine, we tried building as nickb 
too) and /home/data/httpd/download.eclipse.org, both ostensibly on 
build.eclipse.org but in reality on different disks in different 
physical places?
N
Martin Taal wrote:
Hi,
I tried some things directly on build.eclipse.org in the 
/home/data/httpd/download.eclipse.org/modeling/emf/updates/interim 
directory.
First I tried to copy a zip file (with the update site) to the above 
update site directory. The first try fails with the invalid argument 
exception, when I try it again with the exact same argument it works 
out fine:
mtaal@build:/home/data/httpd/download.eclipse.org/modeling/emf/updates/interim> 
cp ~/emf-teneo-1.1-M-M200907090216.zip .
cp: cannot create regular file 
`./emf-teneo-1.1-M-M200907090216.zip': Invalid argument
mtaal@build:/home/data/httpd/download.eclipse.org/modeling/emf/updates/interim> 
cp ~/emf-teneo-1.1-M-M200907090216.zip .
mtaal@build:/home/data/httpd/download.eclipse.org/modeling/emf/updates/interim> 
then I did unzip of this zip file in the same directory and I get 
these errors (truncated for this email, see the attachment for the 
full list):
mtaal@build:/home/data/httpd/download.eclipse.org/modeling/emf/updates/interim> 
unzip emf-teneo-1.1-M-M200907090216.zip
Archive:  emf-teneo-1.1-M-M200907090216.zip
replace header.interim.site.xml? [y]es, [n]o, [A]ll, [N]one, 
[r]ename: A
error:  cannot create header.interim.site.xml
error:  cannot create buildUpdateSiteXML.sh
error:  cannot create buildUpdateSiteDigest.sh
error:  cannot create buildUpdateSiteMetadata.sh
error:  cannot create jarlist.clean.modeling.eclipse.org.txt
I am not sure what's wrong but it does not feel right.... Also that 
I needed two cp actions before I could copy the zip file.
gr. Martin
Webmaster (Karl Matthias) wrote:
Hi Guys,
That POSIX ACL looks fine to me...  are you sure you're 
experiencing a file permissions error?
Cheers,
Karl
Martin Taal wrote:
Hi Nick,
I mean build.eclipse.org.
Yes maybe... but to be honest I don't understand what this facl 
things mean...
Also Patrick has the same problems and promoting under your user 
did not work either.
gr. Martin
Nick Boldt wrote:
The folder 
/home/data/httpd/download.eclipse.org/modeling/emf/updates/interim 
on dev.eclipse.org (or build.eclipse.org, same thing(?)) says 
that the files are mostly owned by group modeling.emf.website; 
you're in that group. The only files I see which aren't in that 
group (and group-writable) are owned by mtaal:common..
So maybe it's an ACL problem?
$ getfacl 
/home/data/httpd/download.eclipse.org/modeling/emf/updates/interim
getfacl: Removing leading '/' from absolute path names
# file: 
home/data/httpd/download.eclipse.org/modeling/emf/updates/interim
# owner: nickb
# group: modeling.emf.website
user::rwx
user:khussey:rwx
user:nickb:rwx
group::rwx
mask::rwx
other::r-x
default:user::rwx
default:user:khussey:rwx
default:user:nickb:rwx
default:group::rwx
default:mask::rwx
default:other::r-x
Martin Taal wrote:
Hi Nick
I did that, see the buildUpdateSite2.sh and buildUpdateSite3.sh
My conclusion it fails in this step:
               unzip -uo -qq ~/${siteLabel}-${buildID}.zip;
In the script see the pushd statement just before, afaik that 
causes the unzip to take place in the update site directory. And 
there it fails! Looking at the update site directory the 
permissions look strange also.
I managed to get the zip file to my home directory on 
build.eclipse.org.
Then I have this script (see the modeling/script directory on 
modeling.eclipse.org and then the buildUpdateSite3.sh):
       ssh mtaal@xxxxxxxxxxxxxxxxx "
               echo -e 'Unzipping';
               pushd 
/home/data/httpd/download.eclipse.org/modeling/emf/updates/interim 
 >/dev/null;
               unzip -uo -qq ~/emf-teneo-1.1-M-M200907090216.zip
       ";
this one fails.
Can you check/see what is wrong with the update site directory 
permissions on build.eclipse.org?
gr. Martin
Nick Boldt wrote:
check buildUpdateSite.sh - match console output to echo "..." 
statements.
Martin Taal wrote:
We are in the same boat :-), with you as a user it also fails:
http://modeling.eclipse.org/promo_logs/promo_log_teneo_1.1.1.M200907090216_2009-07-10-05.37.52.txt 
As the error shows the complete contents of the zip I assume 
that the zip file is there and correct...
So I would think the error is with the target location. Does 
the unpack use a specific target directory? Or does it unpack 
in the home directory of the user?
Where is this unpack command? I could not find it in 
promoteToEclipse.sh
gr. Martin
Nick Boldt wrote:
sure, you can switch "mtaal" for "nickb" in your _common.php 
file.
Martin Taal wrote:
Hi Nick,
And is it possible to promote under another user (you! for 
example)? Just for this and maybe one other build before 
moving on to the new build system.
gr. Martin
Martin Taal wrote:
damn same result... I am at a total loss. Well going to 
sleep now, tomorrow is another day...
Thanks for your help Nick, ofcourse if you get an idea 
while I am sleeping and you are awake then let me know!
gr. Martin
Nick Boldt wrote:
cd /var/www/html/modeling/emf; sudo su; chown -R 
apache:www *; chmod -R g+w *
Done. Try again?
Martin Taal wrote:
I tried but there is no time to press ctrl-c between the 
upload and the remove of the temp files....
In addition it seems that ctrl-c does not work (it looks 
like a forked process or something).
One thing which I find strange that in the directory:
/var/www/html/modeling/emf/updates/interim
I see permissions for apache www and mtaal users. 
Everyone has apache www except the teneo ones I have been 
trying out. Can this be a symptom of a problem?
gr. Martin
Nick Boldt wrote:
yeah, you can run promoteToEclipse.sh via commandline 
and hit CTRL-C
while it's running. :)
On Thu, Jul 9, 2009 at 3:11 PM, Martin 
Taal<mtaal@xxxxxxxxx> wrote:
 
I saw in the log that there is a remove step of 
temporary zips etc. Can this
remove be disabled? To see which files are really there.
gr. Martin
Nick Boldt wrote:
 
So... you scp a file to mtaal@xxxxxxxxxxxxxxxxx's home 
dir, then ssh
in to unpack it and the unpacking fails to create files.
I tried ssh'ing in by hand as you and touching files 
in your home dir
and the target dir and it worked fine.
Wierder is that this has worked for months 
untouched... until now.
Has anything changed in your .bashrc on 
build.eclipse.org? Did the
reboot a couple weeks ago break anything, like umask 
settings, default
group, or has the filesystem changed so that "~" isn't 
properly
resolving to your home dir when pushed up as
mtaal@xxxxxxxxxxxxxxxxx:~/emf-teneo-1.1-M-M200907090216.zip 
?
Do we know for sure that that temporary zip is even 
being produced on
modeling.eclipse? Could there be a failure higher up 
in the build
itself?
N
On Thu, Jul 9, 2009 at 1:14 PM, Martin 
Taal<mtaal@xxxxxxxxx> wrote:
 
Here it is:
http://modeling.eclipse.org/promo_logs/promo_log_teneo_1.1.1.M200907090216_2009-07-09-13.08.14.txt 
My guess is that moving to Athena will be a few weeks 
work for me. The
last
time I did major things regarding builds was in 
February (getting a
separate
eclipselink feature). That took me about a month 
throughput time. So I am
not at all optimistic about the time it takes to 
accomplish that....
So as these builds are done and solve some issues it 
would be nice to get
them out of the way and then move to the new build 
environment.
gr. Martin
Nick Boldt wrote:
 
Try it again.
The line in the log with "Promote zip to 
build.eclipse.org..."
has been changed to "Promote zip to
$eclipseSSHUser:~/${siteLabel}-${buildID}.zip
which will give us a clue as to whose account is 
being used, and where
the
zip is created.
I wonder if the problem is nfs write lag? In theory 
using your home dir
on
build.eclipse should be fine, but maybe there's a 
write delay between
uploading the zip (scp or rsync) and it being 
available when we ssh in a
few
seconds later to unpack it & run the script to 
generate the metadata.
We used to have this problem when we used 
download1.eclipse.org because
you never knew which node you'd get with each 
connection and it could
take
up to 30 mins for the uploaded file to be available 
on all nodes. But
with
the increased space on build.eclipse, I thought we 
were past this
problem...
Of course if you switch to the Athena build then you 
run on
build.eclipse
and publishing is a simple copy (no ssh connection 
required)... if
problems
persist, that'd be the way to go IMHO.
N
Martin Taal wrote:
 
Hi Nick,
Thanks, the promote gets further but I still see 
errors around the
update
of the interim site:
http://modeling.eclipse.org/promo_logs/promo_log_teneo_1.1.1.M200907090216_2009-07-09-12.11.47.txt 
gr. Martin
Nick Boldt wrote:
 
File and dir permissions were incorrect.
You can now
$ W bash
$ ssh mtaal@xxxxxxxxxxxxxxxxxxxx
without a password prompt; try your promote again.
Martin Taal wrote:
 
Hi Nick,
The build on modeling.eclipse.org went fine. 
However with promote I
get
a permission denied exception:
http://modeling.eclipse.org/promo_logs/promo_log_teneo_1.1.1.M200907090216_2009-07-09-05.06.23.txt 
I can ssh from modeling to build, to dev and to 
download1 without
logging in. I can also ssh as the webuser to build:
W ssh mtaal@xxxxxxxxxxxxxxxxx
I updated the promote properties from cvs also.
So I am not sure what the remaining permission 
issue can be.
gr. Martin
Nick Boldt wrote:
 
Fwiw, build.eclipse.org is a real 4-core ppc box 
with lots of ram
and
nfs-mounted disc space.
If youy want to move there, Athena does 
everything the old Modeling
system does except galileo .build file 
generation (coming soon) and
javadoc generation in the doc plugin (waiting 
for someone to
rearchitect the crap we have now). You'd also 
get your own update
site
instead of being merged w/ EMF.
Something to consider.
N
On 7/8/09, Nick Boldt <nickboldt@xxxxxxxxx> wrote:
   
It's a virtual server, so I'm not sure. 'cat 
/proc/meminfo' should
tell you, iirc.
On 7/8/09, Martin Taal <mtaal@xxxxxxxxx> wrote:
     
Hi Nick,
Thanks I am building right now on that server. 
One question, it
seems
that this server has again (only) 512mb. Is 
that really true?
gr. Martin
Nick Boldt wrote:
       
ssh login for modeling.eclipse: use u: mtaal, 
p: mtaal1 (please
change
it once you're in using `passwd`)
for build page, use u: emf-build, p: emf$YAll
migration steps are here:
https://bugs.eclipse.org/bugs/show_bug.cgi?id=273485#c19 
Martin Taal wrote:
         
Yes I did try:
mtaal@xxxxxxxxxxxxxxxxx
and that worked fine (no password needed)
Can you forward your note again?
what is the login/password for 
modeling.eclipse.org?
I don't remember the pwd for 
emft.eclipse.org anymore.....
(login
was
emft-build)
gr. Martin
Nick Boldt wrote:
           
Close... But you should be connecing as 
mtaal@build.eclipse, I
think.
Have you looked at building on 
modeling.eclipse instead of
emft.eclipse?
Emft goes off the air next week... Did you 
see my note about
that?
On 7/8/09, Martin Taal <mtaal@xxxxxxxxx> 
wrote:
             
ssh mtaal@xxxxxxxxxxxxxxxxx works fine
when doing:
W ssh apache@xxxxxxxxxxxxxxxxx or
ssh www-data@xxxxxxxxxxxxxxxxx
then I need to enter a password for both.
Is this what you mean?
gr. Martin
Nick Boldt wrote:
               
Check that you and www-data@ or apache@ 
can ssh to
build.eclipse.org,
and that you have write permission once 
there.
On 7/8/09, Martin Taal <mtaal@xxxxxxxxx> 
wrote:
                 
Hi Nick,
It seems that Teneo maintenance builds 
are not published on
EMF
update
sites. Can you see what is wrong?
Here is the promolog (which contains 
several errors):
http://emft.eclipse.org/promo_logs/promo_log_teneo_1.1.1.M200907080536_2009-07-08-06.23.36.txt 
Thanks!
--
With Regards, Martin Taal
Springsite/Elver.org
Office: Hardwareweg 4, 3821 BV Amersfoort
Postal: Nassaulaan 7, 3941 EC Doorn
The Netherlands
Cell: +31 (0)6 288 48 943
Tel: +31 (0)84 420 2397
Fax: +31 (0)84 225 9307
Mail: mtaal@xxxxxxxxxxxxxx - 
mtaal@xxxxxxxxx
Web: www.springsite.com - www.elver.org
                                  
--
With Regards, Martin Taal
Springsite/Elver.org
Office: Hardwareweg 4, 3821 BV Amersfoort
Postal: Nassaulaan 7, 3941 EC Doorn
The Netherlands
Cell: +31 (0)6 288 48 943
Tel: +31 (0)84 420 2397
Fax: +31 (0)84 225 9307
Mail: mtaal@xxxxxxxxxxxxxx - mtaal@xxxxxxxxx
Web: www.springsite.com - www.elver.org
                              
                            
--
With Regards, Martin Taal
Springsite/Elver.org
Office: Hardwareweg 4, 3821 BV Amersfoort
Postal: Nassaulaan 7, 3941 EC Doorn
The Netherlands
Cell: +31 (0)6 288 48 943
Tel: +31 (0)84 420 2397
Fax: +31 (0)84 225 9307
Mail: mtaal@xxxxxxxxxxxxxx - mtaal@xxxxxxxxx
Web: www.springsite.com - www.elver.org
                      
--
Sent from my mobile device
Nick Boldt :: JBoss by Red Hat
Productization Lead :: JBoss Tools & Dev Studio
Release Engineer :: Dash Athena
http://nick.divbyzero.com
                    
                  
--
With Regards, Martin Taal
Springsite/Elver.org
Office: Hardwareweg 4, 3821 BV Amersfoort
Postal: Nassaulaan 7, 3941 EC Doorn
The Netherlands
Cell: +31 (0)6 288 48 943
Tel: +31 (0)84 420 2397
Fax: +31 (0)84 225 9307
Mail: mtaal@xxxxxxxxxxxxxx - mtaal@xxxxxxxxx
Web: www.springsite.com - www.elver.org
        
      
--
With Regards, Martin Taal
Springsite/Elver.org
Office: Hardwareweg 4, 3821 BV Amersfoort
Postal: Nassaulaan 7, 3941 EC Doorn
The Netherlands
Cell: +31 (0)6 288 48 943
Tel: +31 (0)84 420 2397
Fax: +31 (0)84 225 9307
Mail: mtaal@xxxxxxxxxxxxxx - mtaal@xxxxxxxxx
Web: www.springsite.com - www.elver.org
    
  
--
Nick Boldt :: http://nick.divbyzero.com
Release Engineer :: Eclipse Modeling & Dash Athena
_______________________________________________
modeling-dev mailing list
modeling-dev@xxxxxxxxxxx
https://dev.eclipse.org/mailman/listinfo/modeling-dev