Python and XCompact3d

XCompact3d 2021 Online Developer Meeting

Felipe N. Schuch, LaSET, School of Technology, PUCRS.

Introduction

Why Python?

Computational cost vs Cost for development;
Faster to Prototype ideas;
Code interactively using IPython and Jupyter;
It is a great tool for pre and post-processing.

Why Numpy?

It is a Python library that provides a multidimensional array object and an assortment of routines for fast operations on arrays;
Much faster option, because it runs in optimized, pre-compiled C code;
With Numpy, we have the best of two worlds, the performance of compiled code in the background, together with the flexibility of Python code for the user.

See https://numpy.org

Numpy - Example

x = np.linspace(start=0., stop=2*np.pi, num=50)
y = np.linspace(start=0., stop=2*np.pi, num=50)

ux = np.sin(x[:,np.newaxis])*np.cos(y[np.newaxis,:])
uy = -np.cos(x[:,np.newaxis])*np.sin(y[np.newaxis,:])

int = np.trapz(np.trapz(ux, x=x, axis=0), x=y, axis=0)

plt.streamplot(x,y,ux.T,uy.T)
plt.xlabel(r"$x_1$"); plt.ylabel(r"$x_2$");

Why Xarray?

Xarray introduces labels in the form of dimensions, coordinates and attributes on top of raw NumPy-like multidimensional arrays, which allows for a more intuitive, more concise, and less error-prone developer experience;
Besides, it is integrated to other tools for:
- Plotting (matplotlib, HoloViews and others);
- Parallel computing (Dask);
- I/O (NetCDF).

See http://xarray.pydata.org

Xarray - Example

dataset = xr.Dataset(
    coords={
        "y": np.linspace(start=0.0, stop=2 * np.pi, num=50),
        "x": np.linspace(start=0.0, stop=2 * np.pi, num=50),
    }
)
dataset["ux"] = np.sin(dataset["x"]) * np.cos(dataset["y"])
dataset["uy"] = -np.cos(dataset["x"]) * np.sin(dataset["y"])
dataset

<xarray.Dataset>
Dimensions:  (x: 50, y: 50)
Coordinates:
  * y        (y) float64 0.0 0.1282 0.2565 0.3847 ... 5.899 6.027 6.155 6.283
  * x        (x) float64 0.0 0.1282 0.2565 0.3847 ... 5.899 6.027 6.155 6.283
Data variables:
    ux       (x, y) float64 0.0 0.0 0.0 0.0 ... -2.369e-16 -2.429e-16 -2.449e-16
    uy       (x, y) float64 -0.0 -0.1279 -0.2537 ... 0.2537 0.1279 2.449e-16

Note: This is just the string representation, the dataset will look even better in HTML when running in Jupyter.

`XCompact3d-toolbox`

https://xcompact3d-toolbox.readthedocs.io

The physical and computational parameters are built on top of traitlets:
- IPywidgets for a friendly user interface;
Data structure is provided by xarray, again with:
- Plotting (matplotlib, HoloViews and others);
- Parallel computing (Dask);
- I/O (NetCDF).

Parameters' consistency with Traitlets

>>> prm = x3d.Parameters(loadfile="example.i3d")
>>> # Type checking
>>> prm.iibm = 10.0
TraitError: The 'iibm' trait of a Parameters instance expected an int,
not the float 10.0.
>>> # Limits are imposed
>>> prm.iibm = 5 # <--- This can be only 0, 1 or 2, as x3d expects
TraitError: The value of the 'iibm' trait of a Parameters instance
should not be greater than 2, but a value of 5 was specified

>>> # On change validation
>>> prm.nx = 93
TraitError: Invalid value for mesh points (nx)
>>> prm.nx = 17
>>> # On chance callbacks
>>> print(prm.nclx1, prm.nclxn, prm.nx, prm.dx)
2 2 17 0.0625
>>> prm.nclx1 = 0 # <--- Setting periodic BC
>>> print(prm.nclx1, prm.nclxn, prm.nx, prm.dx)
0 0 16 0.0625

User Interface with IPywidgets (try it online)

Flow Visualization with Passive Scalar Field

`XCompact3d-toolbox` - Example

prm = x3d.Parameters(loadfile="input.i3d")
ds = xr.Dataset()
# Make sure to have enough memory!
for var in "ux uy uz pp".split():
    ds[var] = prm.read_all_fields(f"./data/3d_snapshots/{var}-*.bin")
ds["phi"] = xr.concat([prm.read_all_fields(f"./data/3d_snapshots/phi{n+1}-*.bin") for n in range(prm.numscalar)], "n",).assign_coords(n=("n", range(prm.numscalar)))
ds

<xarray.Dataset>
Dimensions:  (n: 5, t: 76, x: 721, y: 49, z: 721)
Coordinates:
  * x        (x) float32 0.0 0.02083 0.04167 0.0625 ... 14.94 14.96 14.98 15.0
  * z        (z) float32 0.0 0.02083 0.04167 0.0625 ... 14.94 14.96 14.98 15.0
  * y        (y) float32 0.0 0.02083 0.04167 0.0625 ... 0.9375 0.9583 0.9792 1.0
  * n        (n) int32 0 1 2 3 4
  * t        (t) float64 0.0 0.4 0.8 1.2 1.6 2.0 ... 28.4 28.8 29.2 29.6 30.0
Data variables:
    phi      (n, t, x, y, z) float32 dask.array<chunksize=(5, 1, 721, 49, 721), meta=np.ndarray>
    ux       (t, x, y, z) float32 dask.array<chunksize=(1, 721, 49, 721), meta=np.ndarray>
    uy       (t, x, y, z) float32 dask.array<chunksize=(1, 721, 49, 721), meta=np.ndarray>
    uz       (t, x, y, z) float32 dask.array<chunksize=(1, 721, 49, 721), meta=np.ndarray>
    pp       (t, x, y, z) float32 dask.array<chunksize=(1, 721, 49, 721), meta=np.ndarray>

Now we have a real case using a xarray dataset;
This is from a polidispersed Turbidity Current in Axisymmetric Configuration;
We start with an empty dataset, and them populate it with all the variables from our simulation;
You see here the three velocity components and pressure;
With toolbox, we can read all files at once;
Besides five scalar fractions are concatenated in just one array with this command here;
And finally, we can see the dataset, with:
- 5 scalar fractions, from 76 snapshots in time, with this spatial resolution;
- The coordinates are also INCLUDED. With xarray, we can do many operations calling the coordinates by name, it is very powerful;
- and we see the five variables.
For me, it is really impressive to have ALL data AVAILABLE FOR US at once here in this single object;
But JUST MAKE SURE to have have enough memory for it!
Now, lets see how to use it

Xarray - Working with coordinates

ds.phi.sel(t=10.0).mean("y").plot(col="n")

ds['suspended'] = ds.phi.integrate(["x", "y", "z"]); ds.suspended.plot(hue="n")

ds['w1'] = ds.uz.differentiate("y") - ds.uy.x3d.first_derivative("z")

In the first example:
- From the dataset, we select the scalar;
- I’m choosing JUST where time is equals to 10.0;
- Computing a vertical average calling the coordinate by its name;
- And finally a plot for reference, presenting each scalar fraction in a different figure;
- The settling velocity is different for each fraction, so that is why the concentration is decreasing from LEFT to RIGHT;
In the second line, I’m showing how to compute the suspended material, it is defined as the volumetric INTEGRATION of the concentration fields, we can code it in this way, and again a plot for reference;
And the last code shows how to compute the first component of VORTICITY;
- It is equal to duz / dy SUBTRACTING duy / dz;
- We can use the standard second order scheme from xarray;
- Or the high order alternative from the toolbox;
From my experience working with xarray, we can solve more complicated PROBLEMS with FEWER lines of code;
Besides, calling the coordinates by their name, makes our code VERY READABLE, AND CONSEQUENTLY, it is easier to collaborate, share and maintain;

Could we handle larger-than-memory Datasets?

Yes, if the files were written as NetCDF:

ds = xr.open_mfdataset("./data/3d_snapshots/*.nc")

Actually, it is just what we did! In the previous example we handled a 66,5GB dataset in a 8GB virtual machine;
Let’s consider implementing I/O with NetCDF at XCompact3d?

Note: I’ve written a script to convert raw binaries to NetCDF, in order to test this concept.

Integrating Python and XCompact3d

F2PY - Fortran to Python interface generator

! xcompact3d.f90 | mpirun -n 4 ./xcompact3d
program xcompact3d

  use core

  implicit none

  call init_xcompact3d()
  call main_loop()
  call finalise_xcompact3d()

end program xcompact3d

# xcompact3d.py | mpirun -n 4 python xcompact3d.py
from xcompact3d import core

if __name__ == '__main__':
    core.init_xcompact3d()
    core.main_loop()
    core.finalise_xcompact3d()

Note: This example actually works, and with no performance penalty.

Overview / Objectives

Make key subroutines available in Python;
Testing them individually with unittest will increase XCompact3d’s maintainability;
Distributing the compiled code with pip may increase our user base.

F2PY - Fortran to Python interface generator

The next steep

from xcompact3d import core, solver

if __name__ == "__main__":

    core.init_xcompact3d()
    my_own_initial_conditions() # Low cost, very customizable

    while solver.is_running:

        my_own_boundary_conditions() # Low cost, very customizable
        solver.advance_time() # High performance with Fortran code
        my_own_postprocessing() # Low cost, very customizable

    core.finalise_xcompact3d()

Note 1: Here we have every Python tool at our disposal, like modules for optimization, control, visualization, machine learning, I/O, GPU accelerated computing (CuPy), etc. Note 2: It results in a very customizable interface without affecting the main code in Fortran.

It is time to discuss the conclusions

Felipe N. Schuch, LaSET, School of Technology, PUCRS.
🏠 fschuch.com ✉ felipe.schuch@edu.pucrs.br

Python and XCompact3d

XCompact3d 2021 Online Developer Meeting

Introduction

Why Python?

Why Numpy?

Numpy - Example

Why Xarray?

Xarray - Example

XCompact3d-toolbox

Parameters' consistency with Traitlets

User Interface with IPywidgets (try it online)

XCompact3d-toolbox - Example

Xarray - Working with coordinates

Could we handle larger-than-memory Datasets?

Integrating Python and XCompact3d

F2PY - Fortran to Python interface generator

Overview / Objectives

F2PY - Fortran to Python interface generator

The next steep

It is time to discuss the conclusions

`XCompact3d-toolbox`

`XCompact3d-toolbox` - Example