New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

added gaussian process #19

Open

Emaasit wants to merge 6 commits into parsing-science:master from Emaasit:master

Emaasit commented Oct 6, 2018

[1.1.4] - 2018-09-06

Added

Gaussian Process Regression, Sparse Gaussian Process Regression and Students T Process Regression models
Notebooks

Emaasit added 6 commits

October 4, 2018 01:51


          added gp, sparse gp, studentsT gp

f068daa


          Merge pull request #1 from Emaasit/gps

8de1682

added gp, sparse gp, studentsT gp


          added tests for gp, studentsT process

7bc517e


          Merge pull request #2 from Emaasit/gps

184dd3b

added tests for gp, studentsT process


          updated CHANGELOG

315087d


          Merge pull request #3 from Emaasit/gps

aa7244f

updated CHANGELOG

Contributor

rlouf commented Nov 26, 2018

Thanks for the contribution @Emaasit , I think this would make a great contribution to the library. I'll try to have a look at it in the next few days.

parsing-science requested changes

View reviewed changes

Owner

parsing-science left a comment

@Emaasit: Thanks for submitting this! I finally had the time to look at your PR.

I made a bunch of changes to the library recently so could you please merge/rebase in master and resolve any conflicts?

I'm not very familiar with Gaussian processes, but I looked in your notebooks and it seems like the models aren't doing very well. Is that expected behavior? I really like the Criticize the model steps in the notebooks though.

Most of my other comments are style things to make the code consistent with the rest of the library.

pymc3_models/__init__.py

		@@ -1,2 +1,9 @@
		__version__ = "1.1.3"

Owner

parsing-science Jan 8, 2019

I added in a _version.py file in master so you can remove this.

Owner

parsing-science Jan 8, 2019

I think you might have to modify your notebooks too.

pymc3_models/models/GaussianProcessRegression.py

+                  """
+                  def __init__(self, prior_mean=0.0):
+                      self.ppc = None

Owner

parsing-science Jan 8, 2019

Could these properties be alphabetized?

pymc3_models/models/GaussianProcessRegression.py

+                          # cov = signal_variance**2 * pm.gp.cov.ExpQuad(1, length_scale)
+                          cov = signal_variance ** 2 * pm.gp.cov.Matern52(1, length_scale)
+                          # mean_function = pm.gp.mean.Zero()

Owner

parsing-science Jan 8, 2019

Is this comment outdated since you allow the user to specify a prior_mean now?

pymc3_models/models/GaussianProcessRegression.py

+                          signal_variance = pm.HalfCauchy('signal_variance', beta=5, shape=(1))
+                          noise_variance = pm.HalfCauchy('noise_variance', beta=5, shape=(1))
+                          # cov = signal_variance**2 * pm.gp.cov.ExpQuad(1, length_scale)

Owner

parsing-science Jan 8, 2019

Is this line needed?

pymc3_models/models/GaussianProcessRegression.py

+                      if self.cached_model is None:
+                          self.cached_model = self.create_model()
+                      self._set_shared_vars({'model_input': X,

Owner

parsing-science Jan 8, 2019

I prefer to format longer lines like this:

self._set_shared_vars({
    'model_input': X,
    'model_output': np.zeros(num_samples)
})

Could you please change your code to match that style of indenting?

pymc3_models/models/GaussianProcessRegression.py

+                                             'model_output': np.zeros(num_samples)})
+                      with self.cached_model:
+                          f_pred = self.gp.conditional("f_pred", X)

Owner

parsing-science Jan 8, 2019

Please switch to single quotes to be consistent with the rest of the code.

pymc3_models/models/StudentsTProcessRegression.py

+                          'model_output': model_output,
+                      }
+                      self.gp = None

Owner

parsing-science Jan 8, 2019

I believe the gp property is already set to None since it inherits from GaussianProcessRegression. Is this needed here?

tests/models/test_GaussianProcessRegression.py

+                                                                             self.length_scale)
+                      mean_func = pm.gp.mean.Zero()
+                      f_ = np.random.multivariate_normal(mean_func(X).eval(),

Owner

parsing-science Jan 8, 2019

When I started running the unittests on Travis, I realized I forgot to set a random seed so the tests weren't repeatable. Could one to all your tests that generate data? Like here, https://github.com/parsing-science/pymc3_models/blob/master/tests/models/test_LinearRegression.py#L23

tests/models/test_GaussianProcessRegression.py

+                                             int(self.test_GPR.summary['mean']['noise_variance__0']),
+)
+                  # def test_nuts_fit_returns_correct_model(self):

Owner

parsing-science Jan 8, 2019

Is this test commented out because it takes too long to run? If so, could you please leave a comment about that?

tests/models/test_StudentsTProcessRegression.py



		class StudentsTProcessRegressionScoreTestCase(StudentsTProcessRegressionTestCase):
		def test_score_matches_sklearn_performance(self):

Owner

parsing-science Jan 8, 2019

I like this test a lot! Good call comparing to sklearn.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet