README file update, Script update to first release

This commit is contained in:
2018-02-23 11:15:50 +09:00
parent 1aa052bc5d
commit 4b282c3d13
2 changed files with 153 additions and 21 deletions

100
README.md
View File

@@ -1,2 +1,98 @@
# reverse_geolocate # Reverse GeoLocate for XMP Sidecar files
Reverse GeoLocate from XMP files with optional LightRoom DB read
Reverse GeoLocate from XMP sidecar files with optional LightRoom DB read
This script will update any of the Country Code, Country, State, City and Location data that is missing in sidecard files. If a Lightroom DB is set, it will read any data from the database and fill in the fields before it tries to get the location name from google with the Latitude and Longitude found in either the XMP sidecar file or the LR database.
This script used the [Python XMP Tool kit](http://python-xmp-toolkit.readthedocs.io/)
## Command line arguments
reverse_geolocate.py [-h] -x
[XMP SOURCE FOLDER [XMP SOURCE FOLDER ...]]
[-l LIGHTROOM FOLDER]
[-f <overwrite, location, city, state, country, countrycode>]
[-g GOOGLE API KEY] [-n] [-v] [--debug] [--test]
### Arguments
Argument | Argument Value | Description
--- | --- | ---
-x, --xmp | XMP sidecar source folder or XMP sidecar file itself | Must given argument. It sets the path where the script will search for XMP sidecar files. It will traverse into subdirectories. A single XMP sidecar file can also be given. If the same file folder combination is found only one is processed.
-l, --lightroom | Lightroom DB base folder | The folder where the .lrcat file is located. Optional, if this is set, LR values are read before any Google maps connection is done. Fills the Latitude and Longitude and the location names. Lightroom data never overwrites data already set in the XMP sidecar file. It is recommended to have Lightroom write the XMP sidecar file before this script is run
-f, --field | Keyword: overwrite, location, city, state, country, countrycode | In the default no data is overwritten if it is already set. With the 'overwrite' flag all data is set new from the Google Maps location data. Other arguments are each of the location fields and if set only this field will be set. This can be combined with the 'overwrite' flag to overwrite already set data
-n, --nobackup | | Do not create a backup of XMP sidecar file when it is changed
-g, --google | Google Maps API Key | If available, to avoid the access limitations to the reverse location lookup
-v, --verbose | | More verbose output. Currently not used
--debug | | Full detailed debug output. Will print out alot of data
--test | | Does not write any changed back to the XMP sidecar file. For testing purposes
The script will created a backup of the current sidecar file named <original name>.BK.xmp in the same location as the original file.
The Lightroom lookup currently only uses the file name. Not that this can and will fail if there are more than one file with the same name in the database. It is planned to use the base path as additional search key. If more than one is found, no Lightroom data is used.
#### Example
```
reverse_geolocate.py -x Photos/2017/01 -x Photos/2017/02 -l LightRoom/MyCatalogue -f overwrite -g <API KEY>
```
Will find all XMP sidecar files in both folders *Photos/2017/01* and *Photos/2017/02* and all folder below it. Uses the Lightroom database at *LightRoom/MyCatalogue*. The script will overwrite all data, even if it is already set
```
reverse_geolocate.py -x Photos/2017/01/Event-01/some_photo.xmp -f location
```
Only works on *some_photo.xmp* file and will only set the *location* field if it is not yet set.
### Google data priority
Based in the JSON return data the following fields are set in order. If one can not be found for a target set, the next one below is used
order | type | target set
--- | --- | ---
1 | country | Country, CountryCode
2 | administrative_area_level_1 | State
3 | administrative_area_level_2 | State
4 | locality | City
5 | sublocality_level_1 | Location
6 | sublocality_level_2 | Location
7 | route | Location
### Script stats and errors on update
After the script is done the following overview will be printed
```
==============================
Found XMP Files : 3
Updated : 0
Skipped : 2
New GeoLocation Google: 0
GeoLocation from Cache: 0
Failed for Reverse Geo: 1
GeoLoc from Lightroom : 0
No Lightroom data : 0
```
If there are problems with getting data from the Google Maps API the complete errior sting will be printed
```
...
---> Photos/2017/02/some_file.xmp: Error in request: OVER_QUERY_LIMIT You have exceeded your daily request quota for this API. We recommend registering for a key at the Google Developers Console: https://console.developers.google.com/apis/credentials?project=_
(!) Could not geo loaction data [FAILED]
...
```
Also the files that could not be updated will be printed at the end of the run under the stats list
```
...
------------------------------
Files that failed to update:
Photos/2017/02/some_file.xmp
```
### Tested OS
This script has only been tested on macOS

View File

@@ -74,22 +74,31 @@ def reverseGeolocate(longitude, latitude):
# sensor (why?) # sensor (why?)
sensor = 'false' sensor = 'false'
# request to google # request to google
base = "http://maps.googleapis.com/maps/api/geocode/json?" # if a google api key is used, the request has to be via https
protocol = 'https://' if args.google_api_key else 'http://'
base = "maps.googleapis.com/maps/api/geocode/json?"
# build the base params
params = "latlng={lat},{lon}&sensor={sensor}".format(lon = lat_long['longitude'], lat = lat_long['latitude'], sensor = sensor) params = "latlng={lat},{lon}&sensor={sensor}".format(lon = lat_long['longitude'], lat = lat_long['latitude'], sensor = sensor)
# if we have a google api key, add it here
key = "&key={}".format(args.google_api_key) if args.google_api_key else '' key = "&key={}".format(args.google_api_key) if args.google_api_key else ''
url = "{base}{params}{key}".format(base = base, params = params, key = key) # build the full url and send it to google
url = "{protocol}{base}{params}{key}".format(protocol = protocol, base = base, params = params, key = key)
response = requests.get(url) response = requests.get(url)
# sift through the response to get the best matching entry # loop through the json response to get the best matching entry
geolocation = { geolocation = {
'CountryCode': '', 'CountryCode': '',
'Country': '', 'Country': '',
'State': '', 'State': '',
'City': '', 'City': '',
'Location': '' 'Location': '',
# below for error reports
'status': '',
'error_message': ''
} }
# print("Google response: {} => TEXT: {} JSON: {}".format(response, response.text, response.json())) if args.debug and args.verbose >= 1:
print("Google response: {} => TEXT: {} JSON: {}".format(response, response.text, response.json()))
# print("Error: {}".format(response.json()['status'])) # print("Error: {}".format(response.json()['status']))
if response.json()['status'] is not 'INVALID_REQUEST': if response.json()['status'] == 'OK':
# first entry for type = premise # first entry for type = premise
for entry in response.json()['results']: for entry in response.json()['results']:
for sub_entry in entry: for sub_entry in entry:
@@ -130,8 +139,13 @@ def reverseGeolocate(longitude, latitude):
if 'route' in addr['types'] and not geolocation['Location']: if 'route' in addr['types'] and not geolocation['Location']:
geolocation['Location'] = addr['long_name'] geolocation['Location'] = addr['long_name']
# print("Location (R): {}".format(location)) # print("Location (R): {}".format(location))
# write OK status
geolocation['status'] = response.json()['status']
else: else:
print("Error in request: {}".format(response.json()['error_message'])) geolocation['error_message'] = response.json()['error_message']
geolocation['status'] = response.json()['status']
print("Error in request: {} {}".format(geolocation['status'] , geolocation['error_message']))
# return # return
return geolocation return geolocation
@@ -254,17 +268,24 @@ parser.add_argument('-f', '--field',
type = str.lower, # make it lowercase for check type = str.lower, # make it lowercase for check
choices = ['overwrite', 'location', 'city', 'state', 'country', 'countrycode'], choices = ['overwrite', 'location', 'city', 'state', 'country', 'countrycode'],
dest = 'field_controls', dest = 'field_controls',
metavar = 'FIELD CONTROLS', metavar = '<overwrite, location, city, state, country, countrycode>',
help = 'On default only set fields that are not set yet. Options are: Overwrite (write all new), Location, City, State, Country, CountryCode. Multiple can be given. If with overwrite the field will be overwritten if already set, else it will be always skipped' help = 'On default only set fields that are not set yet. Options are: Overwrite (write all new), Location, City, State, Country, CountryCode. Multiple can be given for combination overwrite certain fields only or set only certain fields. If with overwrite the field will be overwritten if already set, else it will be always skipped.'
) )
# Google Maps API key to overcome restrictions # Google Maps API key to overcome restrictions
parser.add_argument('-g', '--google', parser.add_argument('-g', '--google',
dest = 'google_api_key', dest = 'google_api_key',
metavar = 'GOOGLE_API_KEY', metavar = 'GOOGLE API KEY',
help = 'Set a Google API Maps key to overcome the default lookup limitations' help = 'Set a Google API Maps key to overcome the default lookup limitations'
) )
# Do not create backup files
parser.add_argument('-n', '--nobackup',
dest = 'no_xmp_backup',
action = 'store_true',
help = 'Do not create a backup from the XMP file'
)
# verbose args for more detailed output # verbose args for more detailed output
parser.add_argument('-v', '--verbose', parser.add_argument('-v', '--verbose',
action = 'count', action = 'count',
@@ -325,6 +346,8 @@ data_set_original = {}
data_cache = {} data_cache = {}
# work files, all files + folders we need to work on # work files, all files + folders we need to work on
work_files = [] work_files = []
# all failed files
failed_files = []
# error flag # error flag
error = False error = False
# use lightroom # use lightroom
@@ -406,7 +429,7 @@ if args.debug:
print("### Work Files {}".format(work_files)) print("### Work Files {}".format(work_files))
# now we just loop through each file and work on them # now we just loop through each file and work on them
for xmp_file in work_files: for xmp_file in work_files:
print("---> {}".format(xmp_file)) print("---> {}: ".format(xmp_file), end = '')
#### ACTION FLAGs #### ACTION FLAGs
write_file = False write_file = False
lightroom_data_ok = True lightroom_data_ok = True
@@ -463,6 +486,7 @@ for xmp_file in work_files:
# base set done, now check if there is anything unset in the data_set, if yes do a lookup in google # base set done, now check if there is anything unset in the data_set, if yes do a lookup in google
# run this through the overwrite checker to get unset if we have a forced overwrite # run this through the overwrite checker to get unset if we have a forced overwrite
has_unset = False has_unset = False
failed = False
for loc in data_set_loc: for loc in data_set_loc:
if checkOverwrite(data_set[loc], loc, args.field_controls): if checkOverwrite(data_set[loc], loc, args.field_controls):
has_unset = True has_unset = True
@@ -483,6 +507,7 @@ for xmp_file in work_files:
# overwrite sets (note options check here) # overwrite sets (note options check here)
if args.debug: if args.debug:
print("### Google Location: {}".format(google_location)) print("### Google Location: {}".format(google_location))
# must have at least the country set to write anything back
if google_location['Country']: if google_location['Country']:
for loc in data_set_loc: for loc in data_set_loc:
# only write to XMP if overwrite check passes # only write to XMP if overwrite check passes
@@ -492,8 +517,8 @@ for xmp_file in work_files:
if write_file: if write_file:
count['google'] += 1 count['google'] += 1
else: else:
print("(!) Could not geo loaction for: {}".format(xmp_file)) print("(!) Could not geo loaction data ", end = '')
count['failed'] += 1 failed = True
else: else:
if args.debug: if args.debug:
print("Lightroom data use: {}, Lightroom data ok: {}".format(use_lightroom, lightroom_data_ok)) print("Lightroom data use: {}, Lightroom data ok: {}".format(use_lightroom, lightroom_data_ok))
@@ -511,15 +536,22 @@ for xmp_file in work_files:
if write_file: if write_file:
if not args.test: if not args.test:
# use copyfile to create a backup copy # use copyfile to create a backup copy
copyfile(xmp_file, "{}.BK.{}".format(os.path.splitext(xmp_file)[0], os.path.splitext(xmp_file)[1])) if not args.no_xmp_backup:
copyfile(xmp_file, "{}.BK{}".format(os.path.splitext(xmp_file)[0], os.path.splitext(xmp_file)[1]))
# write back to riginal file # write back to riginal file
with open(xmp_file, 'w') as fptr: with open(xmp_file, 'w') as fptr:
fptr.write(xmp.serialize_to_str(omit_packet_wrapper=True)) fptr.write(xmp.serialize_to_str(omit_packet_wrapper=True))
else: else:
print("[TEST] Would write {} to file {}".format(data_set, xmp_file)) print("[TEST] Would write {} ".format(data_set, xmp_file), end = '')
print("[UPDATED]")
count['changed'] += 1 count['changed'] += 1
elif failed:
print("[FAILED]")
count['failed'] += 1
# log data to array for post print
failed_files.append(xmp_file)
else: else:
print(". Data exists: SKIP") print("[SKIP]")
count['skipped'] += 1 count['skipped'] += 1
# close DB connection # close DB connection
@@ -527,15 +559,19 @@ lrdb.close()
# end stats # end stats
print("{}".format('=' * 30)) print("{}".format('=' * 30))
print("Found XMP Files : {:,}".format(count['all'])) print("XMP Files found : {:,}".format(count['all']))
print("Updated : {:,}".format(count['changed'])) print("Updated : {:,}".format(count['changed']))
print("Skipped : {:,}".format(count['skipped'])) print("Skipped : {:,}".format(count['skipped']))
print("New GeoLocation Google: {:,}".format(count['google'])) print("New GeoLocation Google: {:,}".format(count['google']))
print("GeoLocation from Cache: {:,}".format(count['cache'])) print("GeoLocation from Cache: {:,}".format(count['cache']))
print("Failed for Reverse Geo: {:,}".format(count['failed'])) print("Failed for Reverse Geo: {:,}".format(count['failed']))
if use_lightroom: if use_lightroom:
print("Geo from Lightroom : {:,}".format(count['lightroom'])) print("GeoLoc from Lightroom : {:,}".format(count['lightroom']))
print("No Lightroom data : {:,}".format(count['not_found'])) print("No Lightroom data : {:,}".format(count['not_found']))
# if we have failed data
if len(failed_files) > 0:
print("{}".format('-' * 30))
print("Files that failed to update:")
print("{}".format(', '.join(failed_files)))
# __END__ # __END__