GSLA - update handler to handle non gzipped files by lbesnard · Pull Request #264 · aodn/python-aodndata

lbesnard · 2021-04-19T06:57:52Z

Add logic to transform a received NetCDF file into a .nc.gz

historically, files were always sent as *.nc.gz. But as of April 2021, files might be pushed as *.nc
To be consistent with the existing dataset, and gogoduck, we transform this .nc into a .nz.gz

codecov · 2021-04-19T07:01:28Z

Codecov Report

Merging #264 (c2e1e13) into master (9d37dfe) will increase coverage by 0.06%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master     #264      +/-   ##
==========================================
+ Coverage   87.98%   88.04%   +0.06%     
==========================================
  Files          50       50              
  Lines        3221     3229       +8     
  Branches      536      537       +1     
==========================================
+ Hits         2834     2843       +9     
  Misses        243      243              
+ Partials      144      143       -1

Impacted Files	Coverage Δ
aodndata/gsla/handler.py	`94.00% <100.00%> (+1.60%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 9d37dfe...c2e1e13. Read the comment docs.

lbesnard · 2021-04-19T07:04:53Z

linked to https://github.com/aodn/chef-private/pull/3799

mhidas

I just made a few suggestions to simplify handling of input file and file collection using recently added aodncore functionality.

mhidas · 2021-04-20T01:54:40Z

+            netcdf_collection = self.file_collection.filter_by_attribute_id('file_type', FileType.NETCDF)
+            netcdf_file = netcdf_collection[0]
+            netcdf_file.publish_type = PipelineFilePublishType.NO_ACTION


Since you already know it's a single netCDF file being handled, you can do this more simply...

Suggested change

netcdf_collection = self.file_collection.filter_by_attribute_id('file_type', FileType.NETCDF)

netcdf_file = netcdf_collection[0]

netcdf_file.publish_type = PipelineFilePublishType.NO_ACTION

self.file_collection.set_publish_types(PipelineFilePublishType.NO_ACTION)

mhidas · 2021-04-20T01:55:35Z

+            netcdf_file = netcdf_collection[0]
+            netcdf_file.publish_type = PipelineFilePublishType.NO_ACTION
+
+            gzip_path = os.path.join(self.temp_dir,  os.path.basename(self.input_file + '.gz'))


There's a shortcut available here too... (file_basename property)

Suggested change

gzip_path = os.path.join(self.temp_dir, os.path.basename(self.input_file + '.gz'))

gzip_path = os.path.join(self.temp_dir, self.file_basename + '.gz')

mhidas · 2021-04-20T02:03:45Z

+            netcdf_file_gz = PipelineFile(gzip_path, file_update_callback=self._file_update_callback)
+            netcdf_file_gz.publish_type = PipelineFilePublishType.HARVEST_UPLOAD
+
+            self.file_collection.add(netcdf_file_gz)


And again... (see aodn/python-aodncore#209)

Suggested change

netcdf_file_gz = PipelineFile(gzip_path, file_update_callback=self._file_update_callback)

netcdf_file_gz.publish_type = PipelineFilePublishType.HARVEST_UPLOAD

self.file_collection.add(netcdf_file_gz)

self.add_to_collection(gzip_path, publish_type=PipelineFilePublishType.HARVEST_UPLOAD)

lbesnard · 2021-04-21T02:01:55Z

thanks @mhidas I followed your suggestions

mhidas · 2021-04-21T03:34:43Z

👍 I haven't really looked at the unittests, but they all seem to be passing, so should be ok. Is this ready to merge then?

lbesnard · 2021-04-21T03:40:30Z

it is ! thanks. oh yeah I looked at them quite a lot!

GSLA - update handler to handle non gzipped files

58186ab

mhidas reviewed Apr 20, 2021

View reviewed changes

GSLA: code cleaning

c2e1e13

mhidas merged commit 01adcf2 into master Apr 21, 2021

mhidas deleted the gslaUpdate branch April 21, 2021 04:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GSLA - update handler to handle non gzipped files#264

GSLA - update handler to handle non gzipped files#264
mhidas merged 2 commits into
masterfrom
gslaUpdate

lbesnard commented Apr 19, 2021

Uh oh!

codecov Bot commented Apr 19, 2021 •

edited

Loading

Uh oh!

lbesnard commented Apr 19, 2021

Uh oh!

mhidas left a comment

Uh oh!

mhidas Apr 20, 2021

Uh oh!

mhidas Apr 20, 2021

Uh oh!

mhidas Apr 20, 2021

Uh oh!

lbesnard commented Apr 21, 2021

Uh oh!

mhidas commented Apr 21, 2021

Uh oh!

lbesnard commented Apr 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	gzip_path = os.path.join(self.temp_dir, os.path.basename(self.input_file + '.gz'))
	gzip_path = os.path.join(self.temp_dir, self.file_basename + '.gz')

Conversation

lbesnard commented Apr 19, 2021

Uh oh!

codecov Bot commented Apr 19, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

lbesnard commented Apr 19, 2021

Uh oh!

mhidas left a comment

Choose a reason for hiding this comment

Uh oh!

mhidas Apr 20, 2021

Choose a reason for hiding this comment

Uh oh!

mhidas Apr 20, 2021

Choose a reason for hiding this comment

Uh oh!

mhidas Apr 20, 2021

Choose a reason for hiding this comment

Uh oh!

lbesnard commented Apr 21, 2021

Uh oh!

mhidas commented Apr 21, 2021

Uh oh!

lbesnard commented Apr 21, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

codecov Bot commented Apr 19, 2021 •

edited

Loading