Jump to content
Sign in to follow this  
RedDem0n

Folding for DIY-Street - Team: DIY-Street - Team #45454 (disease)

  

298 members have voted

  1. 1.

    • Aliens
      60
    • Disease
      170
    • because Angry has a straight-flush and you only got 2 pair
      59
    • other (please post...I don't know what else to fold for besides aliens/disease)
      8


Recommended Posts

Hey guys, I'm wondering if anyone has had this happen, or can point me in the right direction...

 

Seemingly at random while folding it will just stop, and both cores will go to 0% usage - my computer just sits there idle. Temperatures are normal, no errors on the fah window, or in the log.

 

I haven't had problems with any other programs, and the system is stable as well. I wasn't paying attention today, and lost 7 hours of good folding time...

 

Thoughts?

Share this post


Link to post
Share on other sites

Here is a snippet of my log file:

 

[15:11:22] Completed 390000 out of 500000 steps (78 percent)

[15:37:58] Completed 395000 out of 500000 steps (79 percent)

[16:04:39] Completed 400000 out of 500000 steps (80 percent)

[16:31:07] Completed 405000 out of 500000 steps (81 percent)

[16:57:14] Completed 410000 out of 500000 steps (82 percent)

[17:22:51] Completed 415000 out of 500000 steps (83 percent)

[17:49:09] Completed 420000 out of 500000 steps (84 percent)

[18:15:29] Completed 425000 out of 500000 steps (85 percent)

[18:41:57] Completed 430000 out of 500000 steps (86 percent)

 

[email protected] Client Shutdown at user request.

 

[email protected] Client Shutdown.

 

 

--- Opening Log file [May 29 02:24:18]

 

 

# SMP Client ##################################################################

###############################################################################

 

[email protected] Client Version 5.91beta

 

http://folding.stanford.edu

 

###############################################################################

###############################################################################

 

Launch directory: C:Program FilesaddonsFoldingSMP Client V1.01

Executable: C:Program FilesaddonsFoldingSMP Client V1.01fah.exe

 

 

[02:24:18] - Ask before connecting: No

[02:24:18] - User name: Jachyra007 (Team 45454)

[02:24:18] - User ID: 2E9D7E0054212E48

[02:24:18] - Machine ID: 1

[02:24:18]

[02:24:18] Loaded queue successfully.

[02:24:18]

[02:24:18] + Processing work unit

[02:24:18] Core required: FahCore_a1.exe

[02:24:18] Core found.

[02:24:18] Working on Unit 03 [May 29 02:24:18]

[02:24:18] + Working ...

[02:24:19]

[02:24:19] *------------------------------*

[02:24:19] [email protected] Gromacs SMP Core

[02:24:19] Version 1.74 (March 10, 2007)

[02:24:19]

[02:24:19] Preparing to commence simulation

[02:24:19] - Ensuring status. Please wait.

[02:24:36] - Looking at optimizations...

[02:24:36] - Working with standard loops on this execution.

[02:24:36] Examination of work files indicates 8 consecutive improper terminations of core.

[02:24:42] - Expanded 2441606 -> 12854153 (decompressed 526.4 percent)

[02:24:43]

[02:24:43] Project: 2651 (Run 0, Clone 559, Gen 9)

[02:24:43]

[02:24:45] Entering M.D.

[02:24:51] Calling FAH init

[02:24:52] in POPC

[02:24:52] Writing local files

[02:24:52] checkpoint)

[02:24:52] Read checkpoint

[02:24:53] 0 steps (86 percent)

[02:24:53] Extra SSE boost OK.

[02:24:53] es

[02:24:53] Completed 432211 out of 500000 steps (86 percent)

 

At the [18:41:57] timestamp, it quit, nothing after that.

Just before the [02:24:18] stamp, I noticed it, closed with ctrl+c, and restarted.

 

untitled-3.jpg

Share this post


Link to post
Share on other sites

Is the following problem from the SMP FAQ on the Folding.stanford.edu website still valid?

 

2. When something happens to my network (changing settings or other tweaks), the FAH/SMP client has problems.

One issue with the SMP client is that the client uses MPI to handle multiple processors and MPI uses the network system (albeit on the local loopback device). If the network is tweaked during a run, this can cause problems for the loopback device, causing problems with MPI, causing Gromacs to fail.

 

What should (in principle) then happen is that the client should restart from a check point. Are you finding this to be the case? It could be that the core can't identify this situation well and the client sees a major problem and restarts, not knowing something better to do.

 

We are looking into this and in particular whether we can detect this well enough such that the client restarts from a checkpoint (best case scenario). For now, please don't change the network settings while FAH/SMP is running (you can always stop the FAH client, change the settings, and then restart the client later).

 

I have been having network issues the past coupla days, and the timing almost [+/- 0.5 hours] matches up with my issues today.

Share this post


Link to post
Share on other sites

Update! Big Johnson is strong and virile once again! Now with improved 3.7GHz E6600 OC and 773/870MHz OC on the X1900XTX. Man am I happy it works again. It'll be a bit before he gets folding, I've got some gaming I desperately need to do.

Share this post


Link to post
Share on other sites

For whatever reason Win SMP acts goofy at times, it could have something to do with your ISP or it could be something on Stanford's end. I have lost a few WUs for no apparent reason. Beta programs are that way.

Share this post


Link to post
Share on other sites

Alrighty, I've got an update for you all. Big Johnson is now folding one again. My new folding farm line up is as follows.

 

Big Johnson - 3.6GHz E6600 Current project: 2610, 10min 15sec per %

Little Willy - 2.9GHz E6300 Current Project: 2609, 25min 30sec per %

Red-Headed Step Child - 2.0GHz E4300 Current Project: 2652, 19min 20sec per %

 

@ CPDMF Nice folding there, guy. Outproducing me this week.

@Unk David Get your butt to work. :)

Share this post


Link to post
Share on other sites

My Yonah setup actually surprised me a little with the amount it was kicking it out. Just chugging along 1.8GHz @ 1.0 volts. No match for a 3.4 C2D but doesn't use anywhere near the power. Might have to sling another one together and just let them both go 24/7.

Share this post


Link to post
Share on other sites

Anyone else noticing that their SMP points haven't been updated in a few days now? I've probably gained another 6k or more in the last three days and only my mom's rigs regular 200-500 point WUs are giving me anything.

Share this post


Link to post
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
Sign in to follow this  

×