The default memory settings assume that Data Studio is run on a dedicated (or largely dedicated) machine which isn't always the case. To ensure that not all available memory is allocated to Data Studio, you can adjust the maximum memory settings.
You can do this in the Aperture Data Studio Service 64bit.ini file which is in the root of the installation directory (by default,C:\Program Files\Experian\Aperture Data Studio <version>\Aperture Data Studio Service 64bit.ini ).
The following line controls memory allocation:
Virtual Machine Parameters=-Xms66:1000:16000P -Xmx66:1000:16000
These default settings mean that Data Studio will use 66% of the total system memory that's available, up to a maximum of 16 GB.
-Xmx<value>
This is the maximum amount of memory to be used. Example: to specify a fixed 24 GB for the application, enter: -Xms24g
-Xms<value>
This is the initial amount of memory to use upon start up. For the best performance this should be the same as the Xmx
value to avoid fragmentation of memory. Example: to specify that Data Studio uses only 8 GB at a maximum, enter: Virtual Machine Parameters=-Xms8G -Xmx8G
After installing the ODBC drivers, you should be able to create a Data Source Name (DSN) for both 32-bit and 64-bit clients.
The 32-bit ODBC Administrator is found at %systemdrive%/Windows/SysWoW64/odbcad32.exe
.
The 64-bit ODBC Administrator is found at %systemdrive%/Windows/System32/odbcad32.exe
.
%systemdrive%/Windows/SysWoW64/odbcad32.exe
%systemdrive%/Windows/System32/odbcad32.exe
In Data Studio, go to Data Explorer and Click here to create a new data source.
Select Apache Hadoop HDFS as Data source type.
Specify the following:
Hostname: the IP address or DNS name of the Primary NameNode of your Hadoop cluster.
Port: is the value of the fs.default.name
property. Typically, you will find this in the Hadoop configuration file core-site.xml.
Username/Password: the Linux username/password on the Hadoop host machine.
Note that your file access permissions will determine which files are visible to Data Studio.
Root directory: the starting directory for file discovery. If not specified, it will default to the root of the HDFS file system ("/").
Click Test connection to check connectivity and Create to save changes.
To start the service:
View technical recommendations before installing. Note that your setup requirements will depend on the size of your data and Aperture Data Studio usage.
ulimit -a
in the command shell. ulimit -Hn
should give the result 64000. The distribution exists as a .rpm file so you may use yum or rpm to install Data Studio. The installation directory created by the rpm is /home/experian/ApertureDataStudio/ApertureDataStudio_1.6.0.
$ sudo yum _install_ ApertureDataStudio-1.6.0-1.el7.x86_64.rpm
The hierarchy of directories is created at installation and is owned by ApertureDataStudio:experian. The user (ApertureDataStudio) and group (experian) are created, if absent.
The directory and its children are created with permissions of 770, i.e. owner: rwe, group: rwe and all others no access. Also, two service files are created: the first is used to control the Aperture Data Studio server (ApertureDataStudio_1.6.0.service); the second for the Standardize service (Standardize_4.6.14.service).
The following are created:
To re-install Data Studio:
$ sudo yum _reinstall_ ApertureDataStudio-1.6.0-1.el7.x86_64.rpm
Before you can use Data Studio you have to either start it as a service or run the executable directly in a terminal window.
Starting Aperture Data Studio directly
$ cd /home/experian/ApertureDataStudio/ApertureDataStudio_1.6.0
$ sudo java –Xms16g –Xmx16g –cp .:./pserver.jar:"./lib/*" com.experian.ServerMain STARTUP
Starting Aperture Data Studio as a service
# make the services known to the system
$ sudo systemctl daemon-reload
# create the symlinks
$ sudo systemctl enable ApertureDataStudio_1.6.0
# start the service
$ sudo systemctl start ApertureDataStudio_1.6.0
# Check the service status
$ sudo systemctl status –l ApertureDataStudio_1.6.0
Note that these services will restart on failure and at system boot time.
The server will run out-of-the-box, however, you may prefer to relocate the server and the data directories to another partition/directory. Both of these locations may be changed by editing the file server.properties in /home/experian/ApertureDataStudio/ApertureDataStudio_1.6.0
The two properties are:
You may create these directories yourself with the appropriate owner/permissions.
Additional file data sources may be specified in the file filedatastores.properties in /home/experian/ApertureDataStudio/ApertureDataStudio_1.6.0
# Install the prerequisites
$ sudo yum -y install lttng-ust libcurl openssl-libs krb5-libs libicu zlib
# make the services known to the system
$ sudo systemctl daemon-reload
# create the symlinks
$ sudo systemctl enable Standardize_4.6.14
# start the service
$ sudo systemctl start Standardize_4.6.14
# Check the service status
$ sudo systemctl status –l Standardize_4.6.14
This error is most likely to occur when the JVM in which Data Studio runs is attempting to allocate more memory than is available on the system.
The memory settings can be configured in the Aperture Data Studio Service 64bit.ini file which is in the root of the installation directory (by default, C:\Program Files\Experian\Aperture Data Studio <version>\Aperture Data Studio Service 64bit.ini).
This line in the file controls the memory allocation:
Virtual Machine Parameters=-Xms66:1000:16000P -Xmx66:1000:16000
By default, the Java Virtual Machine settings will use 66% of the total system memory that's available, up to a maximum of 16 GB (from 24 available). This assumes that Data Studio is run on a dedicated, or largely dedicated box.
If your environment isn't dedicated to Data Studio and has other applications running, it's possible that this much memory can't be allocated. In this case, the VM parameters can be changed to specify an exact value for maximum memory used. For example, to allocate 8 GB RAM: Virtual Machine Parameters=-Xms8g -Xmx8g
If the service isn't starting after changing the memory setting, other possible causes are: