Using Non-hashable Python Objects As Keys In Dictionaries
Solution 1:
Based off solution by Chris Lutz again.
import collections
defhashable(obj):
ifisinstance(obj, collections.Hashable):
items = obj
elifisinstance(obj, collections.Mapping):
items = frozenset((k, hashable(v)) for k, v in obj.iteritems())
elifisinstance(obj, collections.Iterable):
items = tuple(hashable(item) for item in obj)
else:
raise TypeError(type(obj))
return items
Solution 2:
Don't. I agree with Andreys comment on the previous question that is doesn't make sense to have dictionaries as keys, and especially not nested ones. Your data-model is obviously quite complex, and dictionaries are probably not the right answer. You should try some OO instead.
Solution 3:
Based off solution by Chris Lutz. Note that this doesn't handle objects that are changed by iteration, such as streams, nor does it handle cycles.
import collections
defmake_hashable(obj):
"""WARNING: This function only works on a limited subset of objects
Make a range of objects hashable.
Accepts embedded dictionaries, lists or tuples (including namedtuples)"""ifisinstance(obj, collections.Hashable):
#Fine to be hashed without any changesreturn obj
elifisinstance(obj, collections.Mapping):
#Convert into a frozenset instead
items=list(obj.items())
for i, item inenumerate(items):
items[i]=make_hashable(item)
returnfrozenset(items)
elifisinstance(obj, collections.Iterable):
#Convert into a tuple instead
ret=[type(obj)]
for i, item inenumerate(obj):
ret.append(make_hashable(item))
returntuple(ret)
#Use the id of the objectreturnid(obj)
Solution 4:
If you really must, make your objects hashable. Subclass whatever you want to put in as a key, and provide a __hash__ function which returns an unique key to this object.
To illustrate:
>>> ("a",).__hash__()
986073539>>> {'a': 'b'}.__hash__()
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
TypeError: 'NoneType'objectisnotcallable
If your hash is not unique enough you will get collisions. May be slow as well.
Solution 5:
I totally disagree with comments & answers saying that this shouldn't be done for data model purity reason.
A dictionary associates an object with another object using the former one as a key. Dictionaries can't be used as keys because they're not hashable. This doesn't make any less meaningful/practical/necessary to map dictionaries to other objects.
As I understand the Python binding system, you can bind any dictionary to a number of variables (or the reverse, depends on your terminology) which means that these variables all know the same unique 'pointer' to that dictionary. Wouldn't it be possible to use that identifier as the hashing key ? If your data model ensures/enforces that you can't have two dictionaries with the same content used as keys then that seems to be a safe technique to me.
I should add that I have no idea whatsoever of how that can/should be done though.
I'm not entirely whether this should be an answer or a comment. Please correct me if needed.
Post a Comment for "Using Non-hashable Python Objects As Keys In Dictionaries"