profile
viewpoint
If you are wondering where the data of this site comes from, please visit https://api.github.com/users/Carmezim/events. GitMemory does not store any data, but only uses NGINX to cache data for a period of time. The idea behind GitMemory is simply to give users a better reading experience.

Carmezim/MIT-6.S094 75

MIT-6.S094: Deep Learning for Self-Driving Cars Assignments solutions

Carmezim/crypto-twitter-sentiment-analysis 10

Sentiment analysis on Twitter crypto corpus in Keras

Carmezim/MIT-6.S191 7

Clone of Intro to Deep Learning (6.S191) repository with labs assignments

Carmezim/DeepLearningLectures2017 6

Curated list of Deep Learning academic courses

Carmezim/getstocks-cli 2

silly stock data CLI 📈

Carmezim/AIrtsy 1

TensorFlow Implementation of "A Neural Algorithm of Artistic Style".

Carmezim/Article-to-Speech 1

Chrome extension that generates audio from NYT articles.

Carmezim/CS224N 1

Assignments for Stanford CS224N Natural Language Processing

Carmezim/javascript 1

JavaScript Style Guide

Carmezim/numpy-exercises 1

Solutions for numpy exercises

issue closedAICoE/prometheus-anomaly-detector

PAD Crashes Every Ten Minutes

I've over provisioned the deployment running on a single metric over 5 thousand data points in a 1d period.

It does not matter though what resources I configure, parallelism etc the pod crashes and restarts every 10 minutes with exit code 137 (OOMkill).

closed time in 10 days

Carmezim

issue openedAICoE/prometheus-anomaly-detector

PAD Crashes Every Ten Minutes

I've over provisioned the deployment running on a single metric over 5 thousand data points in a 1d period.

It does not matter though what resources I configure, parallelism etc the pod crashes and restarts every 10 minutes with exit code 137 (OOMkill).

created time in 11 days

issue closedAICoE/prometheus-anomaly-detector

Pod Constantly Crashes And Restarts Silently (In DEBUG)

I seem to not be able to get the Tornado serving the endpoints. I am able to train the data and see the results though.

It seems the pod is crashing and restarting each few minutes without any logs in DEBUG mode so it never finishes training and get to start the server.

This is the service

spec:
  ports:
    - name: metrics
      protocol: TCP
      port: 8080
      targetPort: 8080
  selector:
    app.kubernetes.io/name: prometheus-anomaly-detector
  clusterIP: *******
  clusterIPs:
    - ******
  type: ClusterIP
  sessionAffinity: None
status:
  loadBalancer: {}

The deployment ports:

spec:
      containers:
        - name: prometheus-anomaly-detector
          image: prometheus-anomaly-detector-image
          ports:
            - containerPort: 8080
              protocol: TCP
          hostNetwork: false

Running master.

closed time in 13 days

Carmezim

issue commentAICoE/prometheus-anomaly-detector

Pod Constantly Crashes And Restarts Silently (In DEBUG)

That was given under-provisioning the deployment.

Carmezim

comment created time in 13 days

issue openedAICoE/prometheus-anomaly-detector

Cannot Access Web Server

I seem to not be able to get the Tornado serving the endpoints. I am able to train the data and see the results though.

When doing a wget from the container to localhost:8080 I get connection refused and Prometheus is unable to reach the PAD target.

This is the service

spec:
  ports:
    - name: metrics
      protocol: TCP
      port: 8080
      targetPort: 8080
  selector:
    app.kubernetes.io/name: prometheus-anomaly-detector
  clusterIP: *******
  clusterIPs:
    - ******
  type: ClusterIP
  sessionAffinity: None
status:
  loadBalancer: {}

The deployment ports:

spec:
      containers:
        - name: prometheus-anomaly-detector
          image: prometheus-anomaly-detector-image
          ports:
            - containerPort: 8080
              protocol: TCP

Running master.

created time in 17 days