Home Arrow Icon Knowledge base Arrow Icon Global Arrow Icon What are the common issues when setting up remote access on the DGX Station


What are the common issues when setting up remote access on the DGX Station


Setting up remote access on the NVIDIA DGX Station can involve several common issues, which are primarily related to network configuration, security, and user management. Here are some detailed explanations of these issues:

1. Network Configuration Issues**

- IP Addressing: The DGX Station has multiple Ethernet ports, and by default, both are configured for DHCP. If both ports are connected to the network, each will receive its own IP address, which can cause the system to alternate between these addresses, leading to connectivity issues[4]. To avoid this, ensure that only one port is connected unless you manually configure static IP addresses for both ports.
- Firewall Settings: Firewalls might block incoming SSH connections. You may need to open specific ports (e.g., port 22 for SSH) to allow remote access[2].

2. VPN Connectivity**

- VPN Credentials Management: When using a VPN to access the DGX Station, managing VPN credentials can be cumbersome. Storing credentials in plaintext files can pose security risks if not done securely[1].
- Connection Stability: VPN connections can sometimes be unstable, leading to dropped sessions during remote work. Ensuring a stable VPN connection is crucial for uninterrupted access.

3. SSH Server Configuration**

- User Management: The DGX Station uses an Ubuntu-based OS, which means adding users and managing SSH access follows standard Ubuntu practices. However, ensuring that all users have appropriate permissions and access rights can be challenging, especially in multi-user environments[2].
- SSH Server Security: Configuring the SSH server securely is important to prevent unauthorized access. This includes setting up strong passwords, limiting login attempts, and potentially using key-based authentication instead of passwords[2].

4. Remote Graphical Access**

- Desktop Environment Installation: Installing a graphical desktop environment (like Ubuntu GNOME) on the DGX Station can sometimes cause system instability or freezing issues, especially if not properly configured for remote access[5].
- VNC or Remote Desktop Setup: Setting up VNC or other remote desktop solutions requires careful configuration to ensure secure and stable connections. This includes setting up SSH tunnels for secure access[5].

5. BMC and KVM Access**

- BMC Configuration: The Baseboard Management Controller (BMC) provides remote access to the DGX A100's console. Configuring a static IP for the BMC and ensuring it is securely isolated on a management network is crucial for secure remote management[6][9].
- KVM Access: Using the BMC to launch a KVM session allows for full remote control of the system. However, this requires proper BMC setup and network connectivity[6].

6. Software Updates and Compatibility**

- DGX OS Updates: Keeping the DGX OS up-to-date is essential for security and functionality. However, updates can sometimes introduce compatibility issues with existing configurations or software[6].
- Third-Party Software: Installing third-party software might require additional configuration steps or could be disabled during OS updates, requiring manual re-enabling[2].

Addressing these common issues requires careful planning, secure configuration practices, and ongoing maintenance to ensure reliable remote access to the DGX Station.

Citations:
[1] https://hackmd.io/@akshatshah21/vjti-dgx-usage
[2] http://cdn.cnetcontent.com/2f/ed/2fed407b-03de-4bda-86b4-64cb9ec20a50.pdf
[3] https://askubuntu.com/questions/1173817/network-setup-for-an-nvidia-dgx-1
[4] https://docs.nvidia.com/dgx/pdf/DGX-Station-User-Guide.pdf
[5] https://forums.developer.nvidia.com/t/remote-graphical-desktop-with-dgx-a100-server/257638
[6] http://cdn.cnetcontent.com/2f/68/2f6888a0-063f-4d76-94e4-8666b7619dfd.pdf
[7] https://docs.nvidia.com/dgx/dgxa100-user-guide/network-config.html
[8] https://github.com/NVIDIA/deepops/blob/master/docs/deepops/dgx-diagnostic-firmware.md
[9] https://docs.nvidia.com/dgx/dgxa100-user-guide/connecting-to-dgxa100.html