Red Hat GLOBAL FILE SYSTEM 4.7 Guia do Utilizador Página 75

  • Descarregar
  • Adicionar aos meus manuais
  • Imprimir
Vista de página 74
File system service information and client communications messages 4–17
If a server in the HP SFS system is shut down or crashes, or if the file system itself is stopped, all client
connections go to the DISCONN state. Typically, the connections go back to alternating between the
CONNECT state and the DISCONN state after about 50 seconds. The REPLAY_WAIT state indicates
that the connection has been established and that the file system is recovering; in this case, the state
changes to FULL within a few minutes.
A convenient way to check the state of all nodes is to use the pdsh command as shown in the following
example:
# pdsh -a sfslstate | dshbak -c
NOTE: There is a known bug in the sfslstate command: during the mount process, the command
sometimes crashes with a backtrace. If this happens, wait for a few seconds and then enter the
sfslstate command again. This problem is normally only seen when large numbers (hundreds) of client
nodes are being mounted at the same time.
Table 4-3 shows a summary of the connection states displayed by the sfslstate command on a client
node.
4.10.2Examples of communications messages
On client nodes, the sshd service starts before the SFS service starts; this means that if a client node is
experiencing mount problems, it is possible to log into the node to examine the /var/log/messages file.
On compute nodes, the syslog service forwards logs to the consolidated log, so that if the utility nodes
that run the syslog_ng service are operating, the log messages may also be seen in the consolidated logs.
Note that the syslog_ng service starts after the SFS service starts; this means that the consolidated logs
on utility nodes are not updated until the SFS service finishes mounting any file systems that are mounted in
the foreground.
The following are examples and descriptions of some selected log messages associated with Lustre file
system mount operations:
The following message shows that the SFS service has issued a mount request for the data file
system:
server: sfsmount: /usr/sbin/sfsmount http://sfsalias/hptc_cluster /hptc_cluster
-o net=vib,max_cached_mb=128,lconf,repeat,dport=33009
The following message extract shows that a mount request has finished. The file system is mounted
and is operating normally:
server: sfsmount: Done. lconf output: loading module: libcfs srcdir …LOV:
hptc_cluster …OSC_n1044_sfsalias-ost185_MNT_client_vib
.
.
.
The following message shows that the InfiniBand network is not yet ready. The vstat command is
showing a status of PORT_INITIALIZE instead of PORT_READY.
sfsmount: Waiting for IB to be ready.
Table 4-3 File system service connection states
State Description
FULL The service is fully connected and operating normally.
NEW The mount request is being processed.
CONNECT An attempt is being made to connect to the service.
DISCONN The client node is disconnected from the service.
REPLAY-WAIT The connection has been established and the file system is recovering; the state
normally changes to FULL within a few minutes.
Vista de página 74
1 2 ... 70 71 72 73 74 75 76 77 78 79 80 ... 133 134

Comentários a estes Manuais

Sem comentários