Skip to content Skip to sidebar Skip to footer

Numpy:finding Difference Between Two Files Containing Float Values

I am trying to use Python to compute the difference between two text files and print the first value and location where they start to diverge. I am not sure how to use loadtxt: i

Solution 1:

You could use

idx = np.where(np.abs(a-b) > 1e-6)[0]
firstidx = idx[0]

to find the first index where the values in a and b differ by more than some nominal amount like 1e-6:

import numpy as np

a = np.loadtxt("path/to/file", float)
b = np.loadtxt("path/to/file2", float)

idx = np.where(np.abs(a-b) > 1e-6)[0]
firstidx = idx[0]
print(firstidx, a[firstidx], b[firstidx])

Note that when dealing with floats, you rarely if ever want to compare with equality, such as with

np.abs(a-b) == 0

or the converse,

np.abs(a-b) != 0

because the inaccuracy of floating point representations can cause a and b to be slightly different even when their values should be exactly the same if their values were represented with infinite precision.

So use something like

np.abs(a-b) > 1e-6

instead. (Note that you have to choose a level of tolerance, e.g. 1e-6).


Here is a simple example demonstrating the pitfall of comparing floats using equality:

In [10]: 1.2-1.0==0.2Out[10]: False

Post a Comment for "Numpy:finding Difference Between Two Files Containing Float Values"