SlideShare a Scribd company logo
1 of 69
Ne#lix	
  Cloud	
  Pla#orm	
  
  Building	
  Blocks	
  
Architectural	
  Building	
  Blocks	
  
             On	
  	
  
   Amazon	
  Public	
  Cloud	
  


         Sudhir	
  Tonse	
  	
  	
  	
  	
  	
  	
  (@stonse)	
  
                    #gitpronflx	
  
IntroducEon	
  
What	
  is	
  Ne#lix?	
  
 With	
  more	
  than	
  26	
  million	
  streaming	
  
 members	
  in	
  the	
  United	
  States,	
  Canada,	
  
 LaEn	
  America,	
  the	
  United	
  Kingdom	
  and	
  
 Ireland,	
  Ne#lix,	
  Inc.	
  (NASDAQ:	
  NFLX)	
  is	
  
 the	
  world's	
  leading	
  internet	
  subscripEon	
  
 service	
  for	
  enjoying	
  movies	
  and	
  TV	
  
 programs.	
  	
  
 …	
  
 In	
  all,	
  more	
  than	
  800	
  devices	
  that	
  stream	
  
 from	
  Ne#lix	
  are	
  available.	
  
 	
  
 (hp://ir.netflix.com)	
  




hp://bit.ly/LWST5w	
  
	
  
Who	
  Am	
  I	
  
•  In	
  the	
  Movie	
  Business	
  J	
  
    –  Manager,	
  Cloud	
  Pla#orm/	
  Infrastructure	
  @	
  Ne#lix	
  
    –  @	
  Ne#lix	
  since	
  2008	
  
    –  Prior	
  day	
  jobs	
  
        •  System	
  Architect/Lead	
  @	
  AOL	
  (Netscape,	
  iPlanet,	
  Sun)	
  
•  	
  	
  	
  	
  	
  	
  @stonse	
  
•  	
  	
  	
  	
  	
  	
  hp://www.linkedin.com/in/sudhirtonse	
  

            Important:	
  This	
  talk	
  is	
  a	
  developer	
  community	
  outreach	
  by	
  me	
  as	
  an	
  individual	
  and	
  	
  
            the	
  content	
  here	
  may	
  or	
  may	
  not	
  reflect	
  Ne#lix’s	
  official	
  view.	
  
Why	
  am	
  I	
  here?	
  
•  Share	
  the	
  Story	
  of	
  Ne,lix	
  and	
  its	
  use	
  of	
  the	
  
   Amazon	
  Cloud	
  
     –  Why	
  did	
  Ne#lix	
  move	
  to	
  the	
  Cloud?	
  
     –  How	
  did	
  we	
  move?	
  
     –  What	
  did	
  we	
  learn?	
  
•  Share	
  Technical	
  Challenges	
  and	
  SoluEons	
  
     –  Contribute	
  back	
  to	
  the	
  community	
  
•  Perhaps	
  Interest	
  you	
  in	
  Helping	
  us	
  Reach	
  the	
  
   Next	
  Steps	
  
     –  Yes,	
  I	
  am	
  Hiring!	
  
What	
  is	
  in	
  it	
  for	
  You?	
  
•    Various	
  Open	
  Source	
  Offerings	
  
•    Tech	
  papers	
  
•    Blogs	
  &	
  ArEcles	
  
•    Meetups	
  and	
  Talks	
  like	
  this	
  J	
  
What’s	
  in	
  it	
  for	
  Ne#lix?	
  
      bird	
  in	
  a	
  Big	
  Cloud	
  
•  Small	
  

•  Tech	
  Community	
  Engagement	
  
•  Open	
  Source	
  ContribuEons	
  
Cloud	
  
Cloud	
  
•  What	
  is	
  it?	
  
•  Why	
  Cloud?	
  
What’s	
  a	
  Cloud?	
  
•  Cloud:	
  
Cloud	
  compu<ng	
  is	
  the	
  delivery	
  of	
  
compuEng	
  and	
  storage	
  capacity	
  [1]	
  as	
  a	
  service	
  
[2]	
  to	
  a	
  heterogeneous	
  community	
  of	
  end-­‐
recipients.	
  




                                                                          Images	
  Courtesy:	
  Wikipedia/Company	
  logos	
  
Cloud	
  Stack	
  
                         Clients	
  
           Browsers,	
  Mobile,	
  Televisions	
  …	
  
                            	
  




                       SaaS/ApplicaEons	
                                         Ne#lix	
  
                              	
                                               Apps/Services	
  


                                   PaaS	
                                          Ne#lix	
  
ExecuEon	
  Env	
  (JVM),	
  Web/App	
  Servers,	
  Frameworks,	
  Tools	
     Cloud	
  Pla#orm	
  

                                    IaaS	
  
      Virtual	
  Machines,	
  Networking,	
  Load	
  Balancers	
  …	
  	
  
                                      	
  
Ne#lix	
  Cloud	
  Pla#orm	
  

•  PaaS	
  Building	
  Blocks	
  on	
  top	
  of	
  Amazon’s	
  IaaS	
  
Pla#orm	
  Blocks	
  
 InternalizaEon/                                     App	
  Infrastructure	
  
                              Messaging	
  
    L10N/Geo	
  

                                                     Server	
       Client	
  

     Security	
                Big	
  Data	
  	
  




Tools/Frameworks	
       Design/Architecture	
  



                            ConfiguraEon	
  
   DiagnosEcs	
             Management	
  
Web	
  Scale	
  
•    Billions	
  of	
  Requests	
  per	
  day	
  
•    Terabytes	
  of	
  data	
  per	
  day	
  
•    Millions	
  of	
  Metric	
  data	
  points	
  per	
  day	
  
•    Hundreds	
  of	
  services	
  	
  
Why	
  Cloud	
  




hp://techblog.ne#lix.com/2011/02/redesigning-­‐ne#lix-­‐api.html	
  
Why	
  Cloud	
  contd	
  …	
  
•  UndifferenEated	
  Heavy	
  Lising	
  

   –  MulE	
  Region	
  
   –  On	
  Demand	
  CompuEng	
  Power	
  
   –  Tons	
  of	
  Features	
  J	
  
On	
  Demand	
  Auto	
  Scaling!	
  
•  Traffic	
  Paerns	
  




                                          Compute	
  




                                                                                  Compute	
  
•  Scale	
  UP	
  &	
  Down	
  
   based	
  on	
  Demand	
  
           –  Use	
  CloudWatch	
                                                                     Time	
  
                 •  RPS	
  
                                                                Time	
  
                 •  Load	
  Average	
                     Slow	
  Growth	
                      Periodic	
  Jobs	
  
                 •  …	
  




                                                                                 Compute	
  
   Compute	
  




                                                Compute	
  




                   Time	
                                         Time	
                              Time	
  
          Predictable	
  Bursts	
                  Unpredictable	
  Bursts	
                    Steady	
  State	
  
Instance	
  
                                                                    Instance	
  
                                 Instance	
  


                                                                                   Instance	
                Instance	
  
                                                                                                                                           Instance	
  
                  Instance	
  
                                                     Instance	
  


                  Scale	
  Up	
                                                                              Instance	
  

                                                                                   Instance	
  
                                      Instance	
  

                                                                                                                            Instance	
  


                                                                                   Instance	
  




Scale	
  Up	
  
Scale	
  Down	
                                                            Instance	
  

                                      Instance	
  


                    Instance	
  

                                                            Instance	
  



                                   Instance	
  
                                                                           Instance	
  



                                                     Instance	
  
Story	
  of	
  Ne#lix	
  
DataCenter	
  to	
  Cloud	
  Timeline	
  
DC	
  to	
  SOA	
  
•  Old	
  DataCenter	
              •  Ne#lix	
  Cloud	
  
   (2008)	
                            (2012)	
  
•  Everything	
  in	
  one	
        •  100s	
  of	
  Fine	
  
   WebApp	
  (.war)	
  	
              Grained	
  Services	
  
Old	
  Lessons	
  
•  One	
  missing	
  Semi	
  Colon	
  can	
  bring	
  your	
  site	
  
   down!!!	
  
    –  Runaway	
  Thread	
  	
  
        •  Lessons	
  
             –  Async	
  execuEon	
  
             –  Timed	
  gets()	
  	
  (i.e.	
  use	
  java	
  Future)	
  
Deployment	
  Concepts	
  
                                              ApplicaEon	
  


                    Cluster	
  1	
                        Cluster	
  2	
               Cluster	
  n	
  


    ASG	
  1	
                 ASG	
  2	
                               ASG	
  1	
  



Instances	
               Instances	
                              Instances	
  
Sample	
  Deployment	
  Architecture	
  
Showcasing	
  Pla#orm	
  Components	
  
What’s	
  in	
  a	
  name?	
  
•  Cloud	
  instances	
  are	
  ephemeral	
  
   –  They	
  have	
  no	
  fixed	
  NAME	
  
   –  The	
  have	
  a	
  public	
  IP	
  address,	
  a	
  private	
  IP	
  address	
  
      and	
  can	
  opEonally	
  be	
  associated	
  with	
  an	
  ElasEc	
  IP	
  
      Address	
  
   –  How	
  can	
  you	
  address	
  your	
  services?	
  
        •  Via	
  ElasEc	
  IP	
  (but	
  these	
  are	
  limited	
  per	
  account)	
  
        •  Route	
  53	
  (A	
  DNS	
  service	
  offered	
  by	
  Amazon)	
  
        •  Ne#lix	
  uses	
  in-­‐house	
  app	
  called	
  Discovery	
  Service	
  
             –  Keeper	
  of	
  addresses	
  and	
  metadata	
  of	
  running	
  instances	
  


                                                                                           Shakespeare	
  
Inter	
  Process	
  CommunicaEon	
  
•  Ne#lix	
  uses	
  NIWS	
  
   –  Ne#lix	
  Internal	
  Web	
  Services	
  
   –  Common	
  infrastructural	
  library	
  that	
  aids	
  in	
  RPC	
  
       •  Based	
  on	
  JSR-­‐311	
  (Jersey)	
  
       •  Uses	
  Discovery	
  Service	
  to	
  obtain	
  instances	
  of	
  every	
  
          service	
  
       •  Has	
  an	
  in	
  built	
  Mid	
  Tier	
  s/w	
  LoadBalancer	
  




                                                                                  Sudhir	
  Tonse	
  
BiopSys	
  
                                                                            Danny	
  Yuan	
  

•  Search	
  Logs	
  on	
  1000s	
  of	
  Amazon	
  Instances	
  
   –  Per	
  Cluster,	
  Apps,	
  Instances,	
  Time	
  Range	
  etc.	
  
S3	
  DiagnosEcs	
  
•  Help	
  Debug	
  S3	
  Latency	
  if	
  any	
  	
  




                                                         Sudhir	
  Tonse	
  
Metrics	
  
•  One	
  cannot	
  fully	
  Understand	
  what	
  One	
  cannot	
  
   Observe	
  J	
  
•  Ne#lix	
  Pla#orm	
  has	
  several	
  Metrics/Data	
  
   CollecEon	
  components	
  
   –  Servo	
  (	
  @Monitors)	
  
   –  Tracers/Counters	
  
   –  Chukwa	
  (for	
  Log	
  Events	
  and	
  Business	
  Metrics)	
  
   –  More	
  J	
  
Metrics	
  




              @royrapaport	
  
Cassandra	
  Dashboard	
  
•  Visualize	
  Status	
  of	
  MulEple	
  Cassandra	
  Clusters	
  




                                                             Eran	
  Landau	
  
Lessons	
  Learned	
  
•  Roman	
  Riding	
  is	
  hard	
  
    –  e.g.	
  sharing	
  traffic	
  between	
  Datacenter	
  (SQL)	
  and	
  
       Cloud	
  (NoSQL)	
  
•  Plan	
  for	
  Failure	
  
    –  Test	
  for	
  Failure	
  (Chaos	
  Monkey	
  &	
  Simian	
  Army)	
  
Commandments	
  of	
  Web	
  Scale	
  	
  
    Cloud	
  Deployment	
  
Cloud	
  Commandments	
  
1.  Thou shalt not have Sticky in-memory sessions	

     –  Hard to Scale
2.  Thou shalt not direclty use a Central SQL database in the user request path	

     –  Atleast not one that uses locks and transactions
3.  Thou shalt not store important data on ephemeral instances	

     –  These are lost when instances go down. Use EBS volumes, S3 or other persistence stores
4.  Thou shalt embrace a homogenous architecture	

     –  Much easier to achieve operational efficiency
5.  Thou shalt understand and embrace the CAP theorem	

     –  Choice between CP and AP. Most web scale deployments choose AP
6.  Thou shalt gaurd all external calls using the 
    Dependency Command Pattern	

     –  Idea is to effectively gaurd user request procesing threads
7.  Thou shalt be prepared to scale according to thy needs	

     –  Web traffic can come in bursts, its important to scale up/down the whole SOA stack based on
        resources needed
Cloud	
  Commandments	
  contd…	
  
8.  Thou shalt keep a wary eye on thy cost	

     –  It all adds up eventually. Plenty of low hanging fruits avaialble to save costs
9.  Thou shalt secure thy data and instances	

     –  Encrypt data; secure access to instances. (Pay attention to Security Groups)
10. Thou shalt instrument thy code	

     –  You cant trust what you cant see
11. Thou shalt effectively monitor thy access points	

     –  Its the cloud and things can go wrong or go reaaaal slooow
12. Thou shalt deploy thy instances in multiple regions 
    and zones	

     –  For maximizing SLAs and availability
13. Thou shalt be wary of SPOF	

     –  Mantra of distributed system design
14. Thou shalt always plan for failure	

     –  Its just a question of when, not if. Have a good backup plan
Concepts	
  
•  Throling/Metering	
  
   –  Thundering	
  Herd	
  (Retry	
  Storms)	
  
•  Graceful	
  DegradaEon	
  
   –  Appropriate	
  Fallbacks	
  
Metering	
  
•  Protocol	
  Level	
  
    –  NIWS	
  features	
  
         •  Client	
  Side	
  Guards	
  
         •  Service	
  Side	
  Metering	
  
•  Client	
  API	
  level	
  
    –  Dependency	
  Command	
  Paern	
  
Dependency	
  Command	
  Paern	
  




      hp://techblog.ne#lix.com/search/label/fault-­‐tolerance	
  
      	
                                                             Ben	
  Christensen	
  
Dependency	
  Command	
  	
  
  Effect	
  of	
  Latency	
  …	
  
Dependency	
  Command	
  
Dependency	
  Command	
  
•  network	
  Emeouts	
  and	
  retries	
  
•  separate	
  threads	
  on	
  per-­‐dependency	
  thread	
  
   pools	
  
•  semaphores	
  (via	
  a	
  
   tryAcquire,	
  not	
  a	
  blocking	
  call)	
  
•  circuit	
  breakers	
  
Failures	
  
•  Failures	
  will	
  happen	
  
    –  It’s	
  a	
  quesEon	
  of	
  when	
  and	
  how	
  NOT	
  “if”	
  
    –  Plan	
  
         •  Regularly	
  Test	
  for	
  possible	
  Failures	
  
              –  Ne#lix	
  Simian	
  Army:	
  e.g.	
  Chaos	
  Monkey,	
  Latency	
  Monkey	
  …	
  
         •  Severity	
  
              –  Minimize	
  the	
  impact	
  of	
  a	
  failure	
  
         •  Occurrence	
  
              –  Minimize	
  the	
  frequency	
  of	
  a	
  failure	
  
         •  Observability	
  
              –  Minimize	
  the	
  Eme	
  to	
  detect	
  and	
  respond	
  
Simian	
  Army	
  
Chaos	
  Monkey	
  	
  
    •	
  Simulates	
  hard	
  failures	
  in	
  AWS	
  by	
  killing	
  a	
  few	
  
    instances	
  per	
  ASG	
  (e.g.	
  Auto	
  Scale	
  Group)	
  	
  
                •	
  Similar	
  to	
  how	
  EC2	
  instances	
  can	
  be	
  killed	
  by	
  
                AWS	
  with	
  lile	
  warning	
  	
  
    •	
  Tests	
  clientsʼ	
  ability	
  to	
  gracefully	
  deal	
  with	
  
    broken	
  connecEons,	
  interrupted	
  calls,	
  etc...	
  	
  
    •	
  Verifies	
  that	
  all	
  services	
  are	
  running	
  within	
  the	
  
    protecEon	
  of	
  AWS	
  Auto	
  Scale	
  Groups,	
  which	
  
    reincarnates	
  killed	
  instances	
  
    •	
  If	
  not,	
  the	
  Chaos	
  monkey	
  will	
  win!	
  	
  


  Conformity	
  Monkey	
  .	
  	
  
      •	
  Verifies	
  that	
  all	
  services	
  are	
  running	
  
      within	
  the	
  protecEon	
  of	
  AWS	
  Auto	
  Scale	
  
      Groups,	
  which	
  reincarnates	
  killed	
  
      instances	
  
      •	
  If	
  not,	
  app/service	
  team	
  is	
  noEfied	
  
Simian	
  Army	
  …	
  
Latency	
  Monkey	
  	
  
    •	
  Simulates	
  sos	
  failures	
  -­‐-­‐	
  i.e.	
  a	
  service	
  
    gets	
  slower	
  	
  
    •	
  Injects	
  random	
  delays	
  in	
  NIWS	
  (client-­‐
    side)	
  or	
  Server	
  (server-­‐side)	
  of	
  a	
  client-­‐
    server	
  interacEon	
  
    •	
  Tests	
  the	
  ability	
  of	
  applicaEons	
  to	
  detect	
  
    and	
  recover	
  (i.e.	
  Graceful	
  DegradaEon)	
  
    from	
  the	
  harder	
  problem	
  of	
  delays,	
  that	
  
    leads	
  to	
  thundering	
  herd	
  and	
  Emeouts	
  	
  

Other	
  Monkeys	
  	
  
    •  Security	
  Monkey	
                                                   Chaos	
  Gorilla	
  
    •  Janitor	
  Monkey	
                                                    •  Simulates	
  Zone	
  Outage	
  
    •  Efficiency	
  Monkey	
  
    •  ..	
  more	
  
    	
  
Building	
  Redundancy	
  and	
  Availability	
  

•  Deploy	
  in	
  mulEple	
  zone	
  and	
  consider	
  mulEple	
  regions	
  
•  Pay	
  aenEon	
  to	
  various	
  modes	
  of	
  failures	
  
Three	
  Balanced	
  Availability	
  Zones	
  
                                        	
  

                            Load	
  Balancers	
  




       Zone	
  A	
                  Zone	
  B	
                Zone	
  C	
  

 Persistence	
  Store	
      Persistence	
  Store	
     Persistence	
  Store	
  




                                                         Courtesy	
  @adrianco	
  
Triple	
  Replicated	
  Persistence	
  
                           	
  
                           Load	
  Balancers	
  




      Zone	
  A	
                  Zone	
  B	
                Zone	
  C	
  

Persistence	
  Store	
      Persistence	
  Store	
     Persistence	
  Store	
  
Isolated	
  Regions	
  

                           US-­‐East	
  Load	
  Balancers	
                                                             EU-­‐West	
  Load	
  Balancers	
  




      Zone	
  A	
                             Zone	
  B	
                    Zone	
  C	
                Zone	
  A	
                        Zone	
  B	
                 Zone	
  C	
  

Persistence	
  Store	
                 Persistence	
  Store	
         Persistence	
  Store	
     Cassandra	
  Replicas	
            Cassandra	
  Replicas	
     Cassandra	
  Replicas	
  
Cassandra	
  Global	
  Ring	
  




     Reference:	
  hp://techblog.ne#lix.com	
  
     	
  
Tips	
  	
  Guidelines	
  
Tips	
  	
  Guidelines	
  contd	
  …	
  
•  Amazon	
  CloudWatch	
  
    –  Is	
  your	
  friend!	
  Ne#lix	
  Servo	
  (
       hp://github.com/ne#lix/servo)	
  helps	
  you	
  publish	
  
       metrics	
  to	
  CloudWatch	
  
•  ELB	
  
    –  Always	
  keep	
  your	
  Zones	
  Balanced!	
  
    –  Healthcheck	
  URLs	
  are	
  important	
  	
  
•  Auto	
  Scaling	
  Groups	
  
    –  This	
  is	
  an	
  amazing	
  feature	
  that	
  can	
  really	
  save	
  you	
  $$
       $s	
  and	
  help	
  you	
  run	
  more	
  efficiently.	
  Read	
  
       hp://bit.ly/NgwS0K	
  
Tips	
  	
  Guidelines	
  contd	
  …	
  
•  Keep	
  acEve	
  track	
  of	
  Usage	
  Costs	
  
    –  Usage	
  costs	
  can	
  surprise	
  you!	
  
    –  Ne#lix	
  has	
  an	
  internal	
  tool	
  which	
  we	
  may	
  open	
  
       source.	
  Watch	
  @Ne#lixOSS	
  
•  Reserve	
  Instances	
  
    –  ReservaEon	
  can	
  save	
  you	
  $$$s	
  (upto	
  71%	
  !!)	
  
       (YMMV)	
  
    –  Guarantees	
  availability	
  when	
  you	
  need	
  it	
  
Tips/Guidelines	
  
•  S3	
  Best	
  PracEces	
  
    –  Amazon	
  doc:	
  hp://bit.ly/MW93xj	
  
    –  Know	
  when	
  to	
  use	
  Regional	
  S3	
  Endpoints	
  
         •  Important	
  when	
  your	
  dev/test	
  team	
  and	
  deployments	
  are	
  in	
  
            different	
  regions	
  
    –  Use	
  Smart	
  Bucket/Key	
  naming	
  
         •  Use	
  3	
  to	
  63	
  characters.	
  
         •  Use	
  only	
  lower	
  case	
  leers	
  (at	
  least	
  one),	
  numbers,	
  '.'	
  and	
  '-­‐'.	
  
         •  Don't	
  start	
  or	
  end	
  the	
  bucket	
  name	
  with	
  '.'	
  and	
  don't	
  follow	
  or	
  
            precede	
  a	
  '.'	
  with	
  a	
  '-­‐'.	
  
    –  Compress	
  Data	
  
    –  Use	
  TTLs	
  	
  
    –  Many	
  more	
  …	
  
         	
  
Open	
  Source	
  
Open	
  Source	
  	
  
•  @Ne#lixOSS	
  
•  hp://ne#lix.github.com/	
  
•  Built	
  for	
  the	
  CLOUD	
  
How	
  can	
  you	
  benefit?	
  
Deployment	
  Tool	
  
•  ASGARD	
  
ConfiguraEon	
  Management	
  
•  Archaius	
  (ProperEes	
  Management)	
  
•  More	
  
   Coming	
  
   Soon	
  …	
  
NoSQL	
  Persistence	
  
            Cassandra	
  based	
  offerings	
  
•  Priam	
  (Token	
  Management)	
  
•  Astyanax	
  (Cassandra	
  Client)	
  
•  Jmeter	
  plugin	
  for	
  Load	
  Tests	
  
Technical	
  Knowledge	
  Sharing	
  
•  hp://techblog.ne#lix.com	
  
   –  Cloud	
  Usage	
  
   –  PersonalizaEon	
  	
  RecommendaEons	
  
   –  Hadoop	
  and	
  Big	
  Data	
  papers	
  
   –  CDN	
  (Content	
  Delivery	
  Networks)	
  
   –  General	
  Architectural	
  Guidelines	
  
   –  Performance	
  	
  Scalability	
  
•  Slideshare	
  
   –  hp://www.slideshare.net/search/slideshow?
      searchfrom=headerq=Ne#lix	
  
New	
  Challenges	
  
•  More	
  Global	
  Expansion	
  
•  Real	
  Time	
  Data	
  Infrastructure	
  
•  March	
  towards	
  Connuous	
  IntegraEon	
  and	
  
   Deployment	
  
Ne#lix	
  
•  Freedom	
  and	
  Responsibility	
  
   –  Empower	
  engineers	
  
   –  #DevOps	
  
   –  Context	
  not	
  Control	
  
Want	
  to	
  Join	
  us?	
  
hp://jobs.ne#lix.com/jobs.html	
  
	
  
Credits	
  
Adrian	
  Cockros	
  	
  (@adrianco),	
  	
  
Ruslan	
  Meshenberg	
  (@rusmeshenberg),	
  Yury	
  Izrailevsky,	
  	
  
Joe	
  Sondow	
  (@joesondow),	
  Ben	
  Christensen	
  (@benchristensen),	
  
Jordan	
  Zimmerman	
  (@randgalt),	
  Ariel	
  Tseltlin	
  (@atseitlin),	
  	
  
Allen	
  Wang,	
  
Eran	
  Landau,	
  Danny	
  Yuan,	
  	
  Pradeep	
  Kamath	
  
	
  
And	
  	
  
	
  
Members	
  of	
  the	
  Ne#lix	
  Cloud	
  Pla#orm	
  Team	
  
	
  
Q	
  	
  A	
  
•  	
  	
  	
  	
  	
  	
  	
  	
  @stonse	
  
Amazon Cloud Terminology Reference
     See http://aws.amazon.com/ This is not a full list of Amazon Web Service features
                                 (courtesy @adrianco)

•    AWS	
  –	
  Amazon	
  Web	
  Services	
  (common	
  name	
  for	
  Amazon	
  cloud)	
  
•    AMI	
  –	
  Amazon	
  Machine	
  Image	
  (archived	
  boot	
  disk,	
  Linux,	
  Windows	
  etc.	
  plus	
  applicaEon	
  code)	
  
•    EC2	
  –	
  ElasEc	
  Compute	
  Cloud	
  
       –    Range	
  of	
  virtual	
  machine	
  types	
  m1,	
  m2,	
  c1,	
  cc,	
  cg.	
  Varying	
  memory,	
  CPU	
  and	
  disk	
  configuraEons.	
  
       –    Instance	
  –	
  a	
  running	
  computer	
  system.	
  Ephemeral,	
  when	
  it	
  is	
  de-­‐allocated	
  nothing	
  is	
  kept.	
  
       –    Reserved	
  Instances	
  –	
  pre-­‐paid	
  to	
  reduce	
  cost	
  for	
  long	
  term	
  usage	
  
       –    Availability	
  Zone	
  –	
  datacenter	
  with	
  own	
  power	
  and	
  cooling	
  hosEng	
  cloud	
  instances	
  
       –    Region	
  –	
  group	
  of	
  Availability	
  Zones	
  –	
  US-­‐East,	
  US-­‐West,	
  EU-­‐Eire,	
  Asia-­‐Singapore,	
  Asia-­‐Japan	
  
•    ASG	
  –	
  Auto	
  Scaling	
  Group	
  (instances	
  booEng	
  from	
  the	
  same	
  AMI)	
  
•    S3	
  –	
  Simple	
  Storage	
  Service	
  (hp	
  access)	
  
•    EBS	
  –	
  ElasEc	
  Block	
  Storage	
  (network	
  disk	
  filesystem	
  can	
  be	
  mounted	
  on	
  an	
  instance)	
  
•    RDS	
  –	
  RelaEonal	
  Database	
  Service	
  (managed	
  MySQL	
  master	
  and	
  slaves)	
  
•    SDB	
  –	
  Simple	
  Data	
  Base	
  (hosted	
  hp	
  based	
  NoSQL	
  data	
  store)	
  
•    SQS	
  –	
  Simple	
  Queue	
  Service	
  (hp	
  based	
  message	
  queue)	
  
•    SNS	
  –	
  Simple	
  NoEficaEon	
  Service	
  (hp	
  and	
  email	
  based	
  topics	
  and	
  messages)	
  
•    EMR	
  –	
  ElasEc	
  Map	
  Reduce	
  (automaEcally	
  managed	
  Hadoop	
  cluster)	
  
•    ELB	
  –	
  ElasEc	
  Load	
  Balancer	
  
•    EIP	
  –	
  ElasEc	
  IP	
  (stable	
  IP	
  address	
  mapping	
  assigned	
  to	
  instance	
  or	
  ELB)	
  
•    VPC	
  –	
  Virtual	
  Private	
  Cloud	
  (extension	
  of	
  enterprise	
  datacenter	
  network	
  into	
  cloud)	
  
•    IAM	
  –	
  IdenEty	
  and	
  Access	
  Management	
  (fine	
  grain	
  role	
  based	
  security	
  keys)	
  

More Related Content

What's hot

AWSを含めたハイブリッド環境の監視の実現 ~zabbixのクラウド対応モジュールHyClops~
AWSを含めたハイブリッド環境の監視の実現 ~zabbixのクラウド対応モジュールHyClops~AWSを含めたハイブリッド環境の監視の実現 ~zabbixのクラウド対応モジュールHyClops~
AWSを含めたハイブリッド環境の監視の実現 ~zabbixのクラウド対応モジュールHyClops~Daisuke Ikeda
 
ABD317_Building Your First Big Data Application on AWS - ABD317
ABD317_Building Your First Big Data Application on AWS - ABD317ABD317_Building Your First Big Data Application on AWS - ABD317
ABD317_Building Your First Big Data Application on AWS - ABD317Amazon Web Services
 
(ENT209) Netflix Cloud Migration, DevOps and Distributed Systems | AWS re:Inv...
(ENT209) Netflix Cloud Migration, DevOps and Distributed Systems | AWS re:Inv...(ENT209) Netflix Cloud Migration, DevOps and Distributed Systems | AWS re:Inv...
(ENT209) Netflix Cloud Migration, DevOps and Distributed Systems | AWS re:Inv...Amazon Web Services
 
Amazon Kinesis Familyを活用したストリームデータ処理
Amazon Kinesis Familyを活用したストリームデータ処理Amazon Kinesis Familyを活用したストリームデータ処理
Amazon Kinesis Familyを活用したストリームデータ処理Amazon Web Services Japan
 
ヤフーでは開発迅速性と品質のバランスをどう取ってるか
ヤフーでは開発迅速性と品質のバランスをどう取ってるかヤフーでは開発迅速性と品質のバランスをどう取ってるか
ヤフーでは開発迅速性と品質のバランスをどう取ってるかYahoo!デベロッパーネットワーク
 
Amazon VPC: Security at the Speed Of Light (NET313) - AWS re:Invent 2018
Amazon VPC: Security at the Speed Of Light (NET313) - AWS re:Invent 2018Amazon VPC: Security at the Speed Of Light (NET313) - AWS re:Invent 2018
Amazon VPC: Security at the Speed Of Light (NET313) - AWS re:Invent 2018Amazon Web Services
 
マネジメントレスのススメ - VPoE-Meetup by manaboo
マネジメントレスのススメ - VPoE-Meetup by manabooマネジメントレスのススメ - VPoE-Meetup by manaboo
マネジメントレスのススメ - VPoE-Meetup by manabooNoriaki Kadota
 
[よくわかるクラウドデータベース] CassandraからAmazon DynamoDBへの移行事例
[よくわかるクラウドデータベース] CassandraからAmazon DynamoDBへの移行事例[よくわかるクラウドデータベース] CassandraからAmazon DynamoDBへの移行事例
[よくわかるクラウドデータベース] CassandraからAmazon DynamoDBへの移行事例Amazon Web Services Japan
 
serverspecでサーバ環境のテストを書いてみよう
serverspecでサーバ環境のテストを書いてみようserverspecでサーバ環境のテストを書いてみよう
serverspecでサーバ環境のテストを書いてみようDaisuke Ikeda
 
A Starter Guide to IT Managed Services
A Starter Guide to IT Managed ServicesA Starter Guide to IT Managed Services
A Starter Guide to IT Managed ServicesDavid Castro
 
浸透するサーバーレス 実際に見るユースケースと実装パターン
浸透するサーバーレス 実際に見るユースケースと実装パターン浸透するサーバーレス 実際に見るユースケースと実装パターン
浸透するサーバーレス 実際に見るユースケースと実装パターンAmazon Web Services Japan
 
AWS認定12冠制覇への道
AWS認定12冠制覇への道AWS認定12冠制覇への道
AWS認定12冠制覇への道Junji Koide
 
Amazon S3による静的Webサイトホスティング
Amazon S3による静的WebサイトホスティングAmazon S3による静的Webサイトホスティング
Amazon S3による静的WebサイトホスティングYasuhiro Horiuchi
 
Yahoo! JAPANが持つデータ分析ソリューションの紹介 #yjdsnight
Yahoo! JAPANが持つデータ分析ソリューションの紹介 #yjdsnightYahoo! JAPANが持つデータ分析ソリューションの紹介 #yjdsnight
Yahoo! JAPANが持つデータ分析ソリューションの紹介 #yjdsnightYahoo!デベロッパーネットワーク
 
もしSIerのエンジニアがSRE本を読んだら
もしSIerのエンジニアがSRE本を読んだらもしSIerのエンジニアがSRE本を読んだら
もしSIerのエンジニアがSRE本を読んだらTomoki Ando
 
4つの戦犯から考えるサービスづくりの失敗
4つの戦犯から考えるサービスづくりの失敗4つの戦犯から考えるサービスづくりの失敗
4つの戦犯から考えるサービスづくりの失敗toshihiro ichitani
 
[AKIBA.AWS] AWS Elemental MediaConvertから学ぶコーデック入門
[AKIBA.AWS] AWS Elemental MediaConvertから学ぶコーデック入門[AKIBA.AWS] AWS Elemental MediaConvertから学ぶコーデック入門
[AKIBA.AWS] AWS Elemental MediaConvertから学ぶコーデック入門Shuji Kikuchi
 
글로벌 ITSM시장 트렌드, Global ITSM Market trends
글로벌 ITSM시장 트렌드, Global ITSM Market trends글로벌 ITSM시장 트렌드, Global ITSM Market trends
글로벌 ITSM시장 트렌드, Global ITSM Market trendsHyunmyung Kim
 
キーワード駆動によるシステムテストの自動化について 2015
キーワード駆動によるシステムテストの自動化について 2015キーワード駆動によるシステムテストの自動化について 2015
キーワード駆動によるシステムテストの自動化について 2015Toru Koido
 

What's hot (20)

AWSを含めたハイブリッド環境の監視の実現 ~zabbixのクラウド対応モジュールHyClops~
AWSを含めたハイブリッド環境の監視の実現 ~zabbixのクラウド対応モジュールHyClops~AWSを含めたハイブリッド環境の監視の実現 ~zabbixのクラウド対応モジュールHyClops~
AWSを含めたハイブリッド環境の監視の実現 ~zabbixのクラウド対応モジュールHyClops~
 
Modeling Microservices
Modeling MicroservicesModeling Microservices
Modeling Microservices
 
ABD317_Building Your First Big Data Application on AWS - ABD317
ABD317_Building Your First Big Data Application on AWS - ABD317ABD317_Building Your First Big Data Application on AWS - ABD317
ABD317_Building Your First Big Data Application on AWS - ABD317
 
(ENT209) Netflix Cloud Migration, DevOps and Distributed Systems | AWS re:Inv...
(ENT209) Netflix Cloud Migration, DevOps and Distributed Systems | AWS re:Inv...(ENT209) Netflix Cloud Migration, DevOps and Distributed Systems | AWS re:Inv...
(ENT209) Netflix Cloud Migration, DevOps and Distributed Systems | AWS re:Inv...
 
Amazon Kinesis Familyを活用したストリームデータ処理
Amazon Kinesis Familyを活用したストリームデータ処理Amazon Kinesis Familyを活用したストリームデータ処理
Amazon Kinesis Familyを活用したストリームデータ処理
 
ヤフーでは開発迅速性と品質のバランスをどう取ってるか
ヤフーでは開発迅速性と品質のバランスをどう取ってるかヤフーでは開発迅速性と品質のバランスをどう取ってるか
ヤフーでは開発迅速性と品質のバランスをどう取ってるか
 
Amazon VPC: Security at the Speed Of Light (NET313) - AWS re:Invent 2018
Amazon VPC: Security at the Speed Of Light (NET313) - AWS re:Invent 2018Amazon VPC: Security at the Speed Of Light (NET313) - AWS re:Invent 2018
Amazon VPC: Security at the Speed Of Light (NET313) - AWS re:Invent 2018
 
マネジメントレスのススメ - VPoE-Meetup by manaboo
マネジメントレスのススメ - VPoE-Meetup by manabooマネジメントレスのススメ - VPoE-Meetup by manaboo
マネジメントレスのススメ - VPoE-Meetup by manaboo
 
[よくわかるクラウドデータベース] CassandraからAmazon DynamoDBへの移行事例
[よくわかるクラウドデータベース] CassandraからAmazon DynamoDBへの移行事例[よくわかるクラウドデータベース] CassandraからAmazon DynamoDBへの移行事例
[よくわかるクラウドデータベース] CassandraからAmazon DynamoDBへの移行事例
 
serverspecでサーバ環境のテストを書いてみよう
serverspecでサーバ環境のテストを書いてみようserverspecでサーバ環境のテストを書いてみよう
serverspecでサーバ環境のテストを書いてみよう
 
A Starter Guide to IT Managed Services
A Starter Guide to IT Managed ServicesA Starter Guide to IT Managed Services
A Starter Guide to IT Managed Services
 
浸透するサーバーレス 実際に見るユースケースと実装パターン
浸透するサーバーレス 実際に見るユースケースと実装パターン浸透するサーバーレス 実際に見るユースケースと実装パターン
浸透するサーバーレス 実際に見るユースケースと実装パターン
 
AWS認定12冠制覇への道
AWS認定12冠制覇への道AWS認定12冠制覇への道
AWS認定12冠制覇への道
 
Amazon S3による静的Webサイトホスティング
Amazon S3による静的WebサイトホスティングAmazon S3による静的Webサイトホスティング
Amazon S3による静的Webサイトホスティング
 
Yahoo! JAPANが持つデータ分析ソリューションの紹介 #yjdsnight
Yahoo! JAPANが持つデータ分析ソリューションの紹介 #yjdsnightYahoo! JAPANが持つデータ分析ソリューションの紹介 #yjdsnight
Yahoo! JAPANが持つデータ分析ソリューションの紹介 #yjdsnight
 
もしSIerのエンジニアがSRE本を読んだら
もしSIerのエンジニアがSRE本を読んだらもしSIerのエンジニアがSRE本を読んだら
もしSIerのエンジニアがSRE本を読んだら
 
4つの戦犯から考えるサービスづくりの失敗
4つの戦犯から考えるサービスづくりの失敗4つの戦犯から考えるサービスづくりの失敗
4つの戦犯から考えるサービスづくりの失敗
 
[AKIBA.AWS] AWS Elemental MediaConvertから学ぶコーデック入門
[AKIBA.AWS] AWS Elemental MediaConvertから学ぶコーデック入門[AKIBA.AWS] AWS Elemental MediaConvertから学ぶコーデック入門
[AKIBA.AWS] AWS Elemental MediaConvertから学ぶコーデック入門
 
글로벌 ITSM시장 트렌드, Global ITSM Market trends
글로벌 ITSM시장 트렌드, Global ITSM Market trends글로벌 ITSM시장 트렌드, Global ITSM Market trends
글로벌 ITSM시장 트렌드, Global ITSM Market trends
 
キーワード駆動によるシステムテストの自動化について 2015
キーワード駆動によるシステムテストの自動化について 2015キーワード駆動によるシステムテストの自動化について 2015
キーワード駆動によるシステムテストの自動化について 2015
 

Viewers also liked

Netflix Global Cloud Architecture
Netflix Global Cloud ArchitectureNetflix Global Cloud Architecture
Netflix Global Cloud ArchitectureAdrian Cockcroft
 
Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data Elo7
 
DevCommerce Conference 2016: Busca e Data Lake Analytics
DevCommerce Conference 2016: Busca e Data Lake AnalyticsDevCommerce Conference 2016: Busca e Data Lake Analytics
DevCommerce Conference 2016: Busca e Data Lake AnalyticsiMasters
 
NetflixOSS for Triangle Devops Oct 2013
NetflixOSS for Triangle Devops Oct 2013NetflixOSS for Triangle Devops Oct 2013
NetflixOSS for Triangle Devops Oct 2013aspyker
 
Netflix cloud architecture...continued
Netflix cloud architecture...continuedNetflix cloud architecture...continued
Netflix cloud architecture...continuedCloud Genius
 
Netflix Velocity Conference 2011
Netflix Velocity Conference 2011Netflix Velocity Conference 2011
Netflix Velocity Conference 2011Adrian Cockcroft
 
Devops at Netflix (re:Invent)
Devops at Netflix (re:Invent)Devops at Netflix (re:Invent)
Devops at Netflix (re:Invent)Jeremy Edberg
 
Talk7 - Da concepção ao desenvolvimento de um aplicativo de mensagens
Talk7 - Da concepção ao desenvolvimento de um aplicativo de mensagensTalk7 - Da concepção ao desenvolvimento de um aplicativo de mensagens
Talk7 - Da concepção ao desenvolvimento de um aplicativo de mensagensElo7
 
Back-end de extrema performance
Back-end de extrema performanceBack-end de extrema performance
Back-end de extrema performanceElo7
 
A influência dos usuários no desenvolvimento do Talk7
A influência dos usuários no desenvolvimento do Talk7A influência dos usuários no desenvolvimento do Talk7
A influência dos usuários no desenvolvimento do Talk7Elo7
 
Times de Alta Performance
Times de Alta PerformanceTimes de Alta Performance
Times de Alta PerformanceElo7
 
Big Data Pipeline and Analytics Platform
Big Data Pipeline and Analytics PlatformBig Data Pipeline and Analytics Platform
Big Data Pipeline and Analytics PlatformSudhir Tonse
 
Big Data Pipeline and Analytics Platform Using NetflixOSS and Other Open Sour...
Big Data Pipeline and Analytics Platform Using NetflixOSS and Other Open Sour...Big Data Pipeline and Analytics Platform Using NetflixOSS and Other Open Sour...
Big Data Pipeline and Analytics Platform Using NetflixOSS and Other Open Sour...Sudhir Tonse
 
Desenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntosDesenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntosElo7
 
Implementando realtime no frontend
Implementando realtime no frontendImplementando realtime no frontend
Implementando realtime no frontendElo7
 
SV Forum Platform Architecture SIG - Netflix Open Source Platform
SV Forum Platform Architecture SIG - Netflix Open Source PlatformSV Forum Platform Architecture SIG - Netflix Open Source Platform
SV Forum Platform Architecture SIG - Netflix Open Source PlatformAdrian Cockcroft
 
Como o Magazine Luiza inova suas operações utilizando as soluções de IoT e Bi...
Como o Magazine Luiza inova suas operações utilizando as soluções de IoT e Bi...Como o Magazine Luiza inova suas operações utilizando as soluções de IoT e Bi...
Como o Magazine Luiza inova suas operações utilizando as soluções de IoT e Bi...Amazon Web Services LATAM
 
Recommendations for Building Machine Learning Software
Recommendations for Building Machine Learning SoftwareRecommendations for Building Machine Learning Software
Recommendations for Building Machine Learning SoftwareJustin Basilico
 
Models and Service Layers, Hemoglobin and Hobgoblins
Models and Service Layers, Hemoglobin and HobgoblinsModels and Service Layers, Hemoglobin and Hobgoblins
Models and Service Layers, Hemoglobin and HobgoblinsRoss Tuck
 
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...Adrian Cockcroft
 

Viewers also liked (20)

Netflix Global Cloud Architecture
Netflix Global Cloud ArchitectureNetflix Global Cloud Architecture
Netflix Global Cloud Architecture
 
Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data Elytics - Construindo uma plataforma de big data
Elytics - Construindo uma plataforma de big data
 
DevCommerce Conference 2016: Busca e Data Lake Analytics
DevCommerce Conference 2016: Busca e Data Lake AnalyticsDevCommerce Conference 2016: Busca e Data Lake Analytics
DevCommerce Conference 2016: Busca e Data Lake Analytics
 
NetflixOSS for Triangle Devops Oct 2013
NetflixOSS for Triangle Devops Oct 2013NetflixOSS for Triangle Devops Oct 2013
NetflixOSS for Triangle Devops Oct 2013
 
Netflix cloud architecture...continued
Netflix cloud architecture...continuedNetflix cloud architecture...continued
Netflix cloud architecture...continued
 
Netflix Velocity Conference 2011
Netflix Velocity Conference 2011Netflix Velocity Conference 2011
Netflix Velocity Conference 2011
 
Devops at Netflix (re:Invent)
Devops at Netflix (re:Invent)Devops at Netflix (re:Invent)
Devops at Netflix (re:Invent)
 
Talk7 - Da concepção ao desenvolvimento de um aplicativo de mensagens
Talk7 - Da concepção ao desenvolvimento de um aplicativo de mensagensTalk7 - Da concepção ao desenvolvimento de um aplicativo de mensagens
Talk7 - Da concepção ao desenvolvimento de um aplicativo de mensagens
 
Back-end de extrema performance
Back-end de extrema performanceBack-end de extrema performance
Back-end de extrema performance
 
A influência dos usuários no desenvolvimento do Talk7
A influência dos usuários no desenvolvimento do Talk7A influência dos usuários no desenvolvimento do Talk7
A influência dos usuários no desenvolvimento do Talk7
 
Times de Alta Performance
Times de Alta PerformanceTimes de Alta Performance
Times de Alta Performance
 
Big Data Pipeline and Analytics Platform
Big Data Pipeline and Analytics PlatformBig Data Pipeline and Analytics Platform
Big Data Pipeline and Analytics Platform
 
Big Data Pipeline and Analytics Platform Using NetflixOSS and Other Open Sour...
Big Data Pipeline and Analytics Platform Using NetflixOSS and Other Open Sour...Big Data Pipeline and Analytics Platform Using NetflixOSS and Other Open Sour...
Big Data Pipeline and Analytics Platform Using NetflixOSS and Other Open Sour...
 
Desenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntosDesenvolvimento Mobile: Android e iOS caminhando juntos
Desenvolvimento Mobile: Android e iOS caminhando juntos
 
Implementando realtime no frontend
Implementando realtime no frontendImplementando realtime no frontend
Implementando realtime no frontend
 
SV Forum Platform Architecture SIG - Netflix Open Source Platform
SV Forum Platform Architecture SIG - Netflix Open Source PlatformSV Forum Platform Architecture SIG - Netflix Open Source Platform
SV Forum Platform Architecture SIG - Netflix Open Source Platform
 
Como o Magazine Luiza inova suas operações utilizando as soluções de IoT e Bi...
Como o Magazine Luiza inova suas operações utilizando as soluções de IoT e Bi...Como o Magazine Luiza inova suas operações utilizando as soluções de IoT e Bi...
Como o Magazine Luiza inova suas operações utilizando as soluções de IoT e Bi...
 
Recommendations for Building Machine Learning Software
Recommendations for Building Machine Learning SoftwareRecommendations for Building Machine Learning Software
Recommendations for Building Machine Learning Software
 
Models and Service Layers, Hemoglobin and Hobgoblins
Models and Service Layers, Hemoglobin and HobgoblinsModels and Service Layers, Hemoglobin and Hobgoblins
Models and Service Layers, Hemoglobin and Hobgoblins
 
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...
CMG2013 Workshop: Netflix Cloud Native, Capacity, Performance and Cost Optimi...
 

Similar to Netflix Cloud Platform Building Blocks

Windows Azure Platform Overview
Windows Azure Platform OverviewWindows Azure Platform Overview
Windows Azure Platform OverviewRobert MacLean
 
Lap around windows azure
Lap around windows azureLap around windows azure
Lap around windows azureManish Corriea
 
Oracle+golden+gate+introduction
Oracle+golden+gate+introductionOracle+golden+gate+introduction
Oracle+golden+gate+introductionxiakaicd
 
20120614 innovation days - Windows Azure
20120614   innovation days - Windows Azure20120614   innovation days - Windows Azure
20120614 innovation days - Windows AzureRicardo González
 
[.Net Juniors Academy] Introdução ao Cloud Computing e Windows Azure Platform
[.Net Juniors Academy] Introdução ao Cloud Computing e Windows Azure Platform[.Net Juniors Academy] Introdução ao Cloud Computing e Windows Azure Platform
[.Net Juniors Academy] Introdução ao Cloud Computing e Windows Azure PlatformVitor Tomaz
 
BOI 2011 - Be what's next
BOI 2011 - Be what's nextBOI 2011 - Be what's next
BOI 2011 - Be what's nextTudor Damian
 
Windows Azure Platform - The Color of Cloud Computing
Windows Azure Platform - The Color of Cloud ComputingWindows Azure Platform - The Color of Cloud Computing
Windows Azure Platform - The Color of Cloud ComputingGeorge Kanellopoulos
 
An Introduction To Space Based Architecture
An Introduction To Space Based ArchitectureAn Introduction To Space Based Architecture
An Introduction To Space Based ArchitectureAmin Abbaspour
 
039 keynote mike-culver_-_amazon_web_services
039 keynote mike-culver_-_amazon_web_services039 keynote mike-culver_-_amazon_web_services
039 keynote mike-culver_-_amazon_web_servicesGeneXus
 
Seattle Scalability - GigaSpaces / Cassandra
Seattle Scalability - GigaSpaces / CassandraSeattle Scalability - GigaSpaces / Cassandra
Seattle Scalability - GigaSpaces / Cassandraclive boulton
 
Leverage Azure and SQL Azure to build SaaS applications
Leverage Azure and SQL Azure to build SaaS applications Leverage Azure and SQL Azure to build SaaS applications
Leverage Azure and SQL Azure to build SaaS applications Common Sense
 
Complex Event Processing: What?, Why?, How?
Complex Event Processing: What?, Why?, How?Complex Event Processing: What?, Why?, How?
Complex Event Processing: What?, Why?, How?Fabien Coppens
 
Continuous Deployment Practices, with Production, Test and Development Enviro...
Continuous Deployment Practices, with Production, Test and Development Enviro...Continuous Deployment Practices, with Production, Test and Development Enviro...
Continuous Deployment Practices, with Production, Test and Development Enviro...Amazon Web Services
 
Developing polyglot applications on Cloud Foundry (#oredev 2012)
Developing polyglot applications on Cloud Foundry (#oredev 2012)Developing polyglot applications on Cloud Foundry (#oredev 2012)
Developing polyglot applications on Cloud Foundry (#oredev 2012)Chris Richardson
 
Cloud becomingreality
Cloud becomingrealityCloud becomingreality
Cloud becomingrealityJosh Holmes
 
Developing modular, polyglot applications with Spring (SpringOne India 2012)
Developing modular, polyglot applications with Spring (SpringOne India 2012)Developing modular, polyglot applications with Spring (SpringOne India 2012)
Developing modular, polyglot applications with Spring (SpringOne India 2012)Chris Richardson
 
First Operational Technology (OT) High Performance Messaging Patterns for Ent...
First Operational Technology (OT) High Performance Messaging Patterns for Ent...First Operational Technology (OT) High Performance Messaging Patterns for Ent...
First Operational Technology (OT) High Performance Messaging Patterns for Ent...Real-Time Innovations (RTI)
 
vert.x - asynchronous event-driven web applications on the JVM
vert.x - asynchronous event-driven web applications on the JVMvert.x - asynchronous event-driven web applications on the JVM
vert.x - asynchronous event-driven web applications on the JVMjbandi
 

Similar to Netflix Cloud Platform Building Blocks (20)

Windows Azure Platform Overview
Windows Azure Platform OverviewWindows Azure Platform Overview
Windows Azure Platform Overview
 
Lap around windows azure
Lap around windows azureLap around windows azure
Lap around windows azure
 
Oracle+golden+gate+introduction
Oracle+golden+gate+introductionOracle+golden+gate+introduction
Oracle+golden+gate+introduction
 
Windows Azure Overview
Windows Azure OverviewWindows Azure Overview
Windows Azure Overview
 
20120614 innovation days - Windows Azure
20120614   innovation days - Windows Azure20120614   innovation days - Windows Azure
20120614 innovation days - Windows Azure
 
[.Net Juniors Academy] Introdução ao Cloud Computing e Windows Azure Platform
[.Net Juniors Academy] Introdução ao Cloud Computing e Windows Azure Platform[.Net Juniors Academy] Introdução ao Cloud Computing e Windows Azure Platform
[.Net Juniors Academy] Introdução ao Cloud Computing e Windows Azure Platform
 
BOI 2011 - Be what's next
BOI 2011 - Be what's nextBOI 2011 - Be what's next
BOI 2011 - Be what's next
 
Windows Azure Platform - The Color of Cloud Computing
Windows Azure Platform - The Color of Cloud ComputingWindows Azure Platform - The Color of Cloud Computing
Windows Azure Platform - The Color of Cloud Computing
 
An Introduction To Space Based Architecture
An Introduction To Space Based ArchitectureAn Introduction To Space Based Architecture
An Introduction To Space Based Architecture
 
039 keynote mike-culver_-_amazon_web_services
039 keynote mike-culver_-_amazon_web_services039 keynote mike-culver_-_amazon_web_services
039 keynote mike-culver_-_amazon_web_services
 
Seattle Scalability - GigaSpaces / Cassandra
Seattle Scalability - GigaSpaces / CassandraSeattle Scalability - GigaSpaces / Cassandra
Seattle Scalability - GigaSpaces / Cassandra
 
Leverage Azure and SQL Azure to build SaaS applications
Leverage Azure and SQL Azure to build SaaS applications Leverage Azure and SQL Azure to build SaaS applications
Leverage Azure and SQL Azure to build SaaS applications
 
Complex Event Processing: What?, Why?, How?
Complex Event Processing: What?, Why?, How?Complex Event Processing: What?, Why?, How?
Complex Event Processing: What?, Why?, How?
 
Continuous Deployment Practices, with Production, Test and Development Enviro...
Continuous Deployment Practices, with Production, Test and Development Enviro...Continuous Deployment Practices, with Production, Test and Development Enviro...
Continuous Deployment Practices, with Production, Test and Development Enviro...
 
Developing polyglot applications on Cloud Foundry (#oredev 2012)
Developing polyglot applications on Cloud Foundry (#oredev 2012)Developing polyglot applications on Cloud Foundry (#oredev 2012)
Developing polyglot applications on Cloud Foundry (#oredev 2012)
 
Cloud becomingreality
Cloud becomingrealityCloud becomingreality
Cloud becomingreality
 
Developing modular, polyglot applications with Spring (SpringOne India 2012)
Developing modular, polyglot applications with Spring (SpringOne India 2012)Developing modular, polyglot applications with Spring (SpringOne India 2012)
Developing modular, polyglot applications with Spring (SpringOne India 2012)
 
First Operational Technology (OT) High Performance Messaging Patterns for Ent...
First Operational Technology (OT) High Performance Messaging Patterns for Ent...First Operational Technology (OT) High Performance Messaging Patterns for Ent...
First Operational Technology (OT) High Performance Messaging Patterns for Ent...
 
vert.x - asynchronous event-driven web applications on the JVM
vert.x - asynchronous event-driven web applications on the JVMvert.x - asynchronous event-driven web applications on the JVM
vert.x - asynchronous event-driven web applications on the JVM
 
Vert.x
Vert.xVert.x
Vert.x
 

More from Sudhir Tonse

Big Data Pipelines and Machine Learning at Uber
Big Data Pipelines and Machine Learning at UberBig Data Pipelines and Machine Learning at Uber
Big Data Pipelines and Machine Learning at UberSudhir Tonse
 
ML and Data Science at Uber - GITPro talk 2017
ML and Data Science at Uber - GITPro talk 2017ML and Data Science at Uber - GITPro talk 2017
ML and Data Science at Uber - GITPro talk 2017Sudhir Tonse
 
Stream Computing & Analytics at Uber
Stream Computing & Analytics at UberStream Computing & Analytics at Uber
Stream Computing & Analytics at UberSudhir Tonse
 
Pros and Cons of a MicroServices Architecture talk at AWS ReInvent
Pros and Cons of a MicroServices Architecture talk at AWS ReInventPros and Cons of a MicroServices Architecture talk at AWS ReInvent
Pros and Cons of a MicroServices Architecture talk at AWS ReInventSudhir Tonse
 
MicroServices at Netflix - challenges of scale
MicroServices at Netflix - challenges of scaleMicroServices at Netflix - challenges of scale
MicroServices at Netflix - challenges of scaleSudhir Tonse
 
Architecting for the Cloud using NetflixOSS - Codemash Workshop
Architecting for the Cloud using NetflixOSS - Codemash WorkshopArchitecting for the Cloud using NetflixOSS - Codemash Workshop
Architecting for the Cloud using NetflixOSS - Codemash WorkshopSudhir Tonse
 
Web Scale Applications using NeflixOSS Cloud Platform
Web Scale Applications using NeflixOSS Cloud PlatformWeb Scale Applications using NeflixOSS Cloud Platform
Web Scale Applications using NeflixOSS Cloud PlatformSudhir Tonse
 

More from Sudhir Tonse (7)

Big Data Pipelines and Machine Learning at Uber
Big Data Pipelines and Machine Learning at UberBig Data Pipelines and Machine Learning at Uber
Big Data Pipelines and Machine Learning at Uber
 
ML and Data Science at Uber - GITPro talk 2017
ML and Data Science at Uber - GITPro talk 2017ML and Data Science at Uber - GITPro talk 2017
ML and Data Science at Uber - GITPro talk 2017
 
Stream Computing & Analytics at Uber
Stream Computing & Analytics at UberStream Computing & Analytics at Uber
Stream Computing & Analytics at Uber
 
Pros and Cons of a MicroServices Architecture talk at AWS ReInvent
Pros and Cons of a MicroServices Architecture talk at AWS ReInventPros and Cons of a MicroServices Architecture talk at AWS ReInvent
Pros and Cons of a MicroServices Architecture talk at AWS ReInvent
 
MicroServices at Netflix - challenges of scale
MicroServices at Netflix - challenges of scaleMicroServices at Netflix - challenges of scale
MicroServices at Netflix - challenges of scale
 
Architecting for the Cloud using NetflixOSS - Codemash Workshop
Architecting for the Cloud using NetflixOSS - Codemash WorkshopArchitecting for the Cloud using NetflixOSS - Codemash Workshop
Architecting for the Cloud using NetflixOSS - Codemash Workshop
 
Web Scale Applications using NeflixOSS Cloud Platform
Web Scale Applications using NeflixOSS Cloud PlatformWeb Scale Applications using NeflixOSS Cloud Platform
Web Scale Applications using NeflixOSS Cloud Platform
 

Recently uploaded

unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rick Flair
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 

Recently uploaded (20)

unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...Rise of the Machines: Known As Drones...
Rise of the Machines: Known As Drones...
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 

Netflix Cloud Platform Building Blocks

  • 1. Ne#lix  Cloud  Pla#orm   Building  Blocks   Architectural  Building  Blocks   On     Amazon  Public  Cloud   Sudhir  Tonse              (@stonse)   #gitpronflx  
  • 3. What  is  Ne#lix?   With  more  than  26  million  streaming   members  in  the  United  States,  Canada,   LaEn  America,  the  United  Kingdom  and   Ireland,  Ne#lix,  Inc.  (NASDAQ:  NFLX)  is   the  world's  leading  internet  subscripEon   service  for  enjoying  movies  and  TV   programs.     …   In  all,  more  than  800  devices  that  stream   from  Ne#lix  are  available.     (hp://ir.netflix.com)   hp://bit.ly/LWST5w    
  • 4. Who  Am  I   •  In  the  Movie  Business  J   –  Manager,  Cloud  Pla#orm/  Infrastructure  @  Ne#lix   –  @  Ne#lix  since  2008   –  Prior  day  jobs   •  System  Architect/Lead  @  AOL  (Netscape,  iPlanet,  Sun)   •             @stonse   •             hp://www.linkedin.com/in/sudhirtonse   Important:  This  talk  is  a  developer  community  outreach  by  me  as  an  individual  and     the  content  here  may  or  may  not  reflect  Ne#lix’s  official  view.  
  • 5. Why  am  I  here?   •  Share  the  Story  of  Ne,lix  and  its  use  of  the   Amazon  Cloud   –  Why  did  Ne#lix  move  to  the  Cloud?   –  How  did  we  move?   –  What  did  we  learn?   •  Share  Technical  Challenges  and  SoluEons   –  Contribute  back  to  the  community   •  Perhaps  Interest  you  in  Helping  us  Reach  the   Next  Steps   –  Yes,  I  am  Hiring!  
  • 6. What  is  in  it  for  You?   •  Various  Open  Source  Offerings   •  Tech  papers   •  Blogs  &  ArEcles   •  Meetups  and  Talks  like  this  J  
  • 7. What’s  in  it  for  Ne#lix?   bird  in  a  Big  Cloud   •  Small   •  Tech  Community  Engagement   •  Open  Source  ContribuEons  
  • 9. Cloud   •  What  is  it?   •  Why  Cloud?  
  • 10. What’s  a  Cloud?   •  Cloud:   Cloud  compu<ng  is  the  delivery  of   compuEng  and  storage  capacity  [1]  as  a  service   [2]  to  a  heterogeneous  community  of  end-­‐ recipients.   Images  Courtesy:  Wikipedia/Company  logos  
  • 11. Cloud  Stack   Clients   Browsers,  Mobile,  Televisions  …     SaaS/ApplicaEons   Ne#lix     Apps/Services   PaaS   Ne#lix   ExecuEon  Env  (JVM),  Web/App  Servers,  Frameworks,  Tools   Cloud  Pla#orm   IaaS   Virtual  Machines,  Networking,  Load  Balancers  …      
  • 12. Ne#lix  Cloud  Pla#orm   •  PaaS  Building  Blocks  on  top  of  Amazon’s  IaaS  
  • 13. Pla#orm  Blocks   InternalizaEon/ App  Infrastructure   Messaging   L10N/Geo   Server   Client   Security   Big  Data     Tools/Frameworks   Design/Architecture   ConfiguraEon   DiagnosEcs   Management  
  • 14. Web  Scale   •  Billions  of  Requests  per  day   •  Terabytes  of  data  per  day   •  Millions  of  Metric  data  points  per  day   •  Hundreds  of  services    
  • 16. Why  Cloud  contd  …   •  UndifferenEated  Heavy  Lising   –  MulE  Region   –  On  Demand  CompuEng  Power   –  Tons  of  Features  J  
  • 17. On  Demand  Auto  Scaling!   •  Traffic  Paerns   Compute   Compute   •  Scale  UP  &  Down   based  on  Demand   –  Use  CloudWatch   Time   •  RPS   Time   •  Load  Average   Slow  Growth   Periodic  Jobs   •  …   Compute   Compute   Compute   Time   Time   Time   Predictable  Bursts   Unpredictable  Bursts   Steady  State  
  • 18. Instance   Instance   Instance   Instance   Instance   Instance   Instance   Instance   Scale  Up   Instance   Instance   Instance   Instance   Instance   Scale  Up  
  • 19. Scale  Down   Instance   Instance   Instance   Instance   Instance   Instance   Instance  
  • 21. DataCenter  to  Cloud  Timeline  
  • 22. DC  to  SOA   •  Old  DataCenter   •  Ne#lix  Cloud   (2008)   (2012)   •  Everything  in  one   •  100s  of  Fine   WebApp  (.war)     Grained  Services  
  • 23. Old  Lessons   •  One  missing  Semi  Colon  can  bring  your  site   down!!!   –  Runaway  Thread     •  Lessons   –  Async  execuEon   –  Timed  gets()    (i.e.  use  java  Future)  
  • 24. Deployment  Concepts   ApplicaEon   Cluster  1   Cluster  2   Cluster  n   ASG  1   ASG  2   ASG  1   Instances   Instances   Instances  
  • 27. What’s  in  a  name?   •  Cloud  instances  are  ephemeral   –  They  have  no  fixed  NAME   –  The  have  a  public  IP  address,  a  private  IP  address   and  can  opEonally  be  associated  with  an  ElasEc  IP   Address   –  How  can  you  address  your  services?   •  Via  ElasEc  IP  (but  these  are  limited  per  account)   •  Route  53  (A  DNS  service  offered  by  Amazon)   •  Ne#lix  uses  in-­‐house  app  called  Discovery  Service   –  Keeper  of  addresses  and  metadata  of  running  instances   Shakespeare  
  • 28. Inter  Process  CommunicaEon   •  Ne#lix  uses  NIWS   –  Ne#lix  Internal  Web  Services   –  Common  infrastructural  library  that  aids  in  RPC   •  Based  on  JSR-­‐311  (Jersey)   •  Uses  Discovery  Service  to  obtain  instances  of  every   service   •  Has  an  in  built  Mid  Tier  s/w  LoadBalancer   Sudhir  Tonse  
  • 29. BiopSys   Danny  Yuan   •  Search  Logs  on  1000s  of  Amazon  Instances   –  Per  Cluster,  Apps,  Instances,  Time  Range  etc.  
  • 30. S3  DiagnosEcs   •  Help  Debug  S3  Latency  if  any     Sudhir  Tonse  
  • 31. Metrics   •  One  cannot  fully  Understand  what  One  cannot   Observe  J   •  Ne#lix  Pla#orm  has  several  Metrics/Data   CollecEon  components   –  Servo  (  @Monitors)   –  Tracers/Counters   –  Chukwa  (for  Log  Events  and  Business  Metrics)   –  More  J  
  • 32. Metrics   @royrapaport  
  • 33. Cassandra  Dashboard   •  Visualize  Status  of  MulEple  Cassandra  Clusters   Eran  Landau  
  • 34. Lessons  Learned   •  Roman  Riding  is  hard   –  e.g.  sharing  traffic  between  Datacenter  (SQL)  and   Cloud  (NoSQL)   •  Plan  for  Failure   –  Test  for  Failure  (Chaos  Monkey  &  Simian  Army)  
  • 35. Commandments  of  Web  Scale     Cloud  Deployment  
  • 36.
  • 37. Cloud  Commandments   1.  Thou shalt not have Sticky in-memory sessions –  Hard to Scale 2.  Thou shalt not direclty use a Central SQL database in the user request path –  Atleast not one that uses locks and transactions 3.  Thou shalt not store important data on ephemeral instances –  These are lost when instances go down. Use EBS volumes, S3 or other persistence stores 4.  Thou shalt embrace a homogenous architecture –  Much easier to achieve operational efficiency 5.  Thou shalt understand and embrace the CAP theorem –  Choice between CP and AP. Most web scale deployments choose AP 6.  Thou shalt gaurd all external calls using the Dependency Command Pattern –  Idea is to effectively gaurd user request procesing threads 7.  Thou shalt be prepared to scale according to thy needs –  Web traffic can come in bursts, its important to scale up/down the whole SOA stack based on resources needed
  • 38. Cloud  Commandments  contd…   8.  Thou shalt keep a wary eye on thy cost –  It all adds up eventually. Plenty of low hanging fruits avaialble to save costs 9.  Thou shalt secure thy data and instances –  Encrypt data; secure access to instances. (Pay attention to Security Groups) 10. Thou shalt instrument thy code –  You cant trust what you cant see 11. Thou shalt effectively monitor thy access points –  Its the cloud and things can go wrong or go reaaaal slooow 12. Thou shalt deploy thy instances in multiple regions and zones –  For maximizing SLAs and availability 13. Thou shalt be wary of SPOF –  Mantra of distributed system design 14. Thou shalt always plan for failure –  Its just a question of when, not if. Have a good backup plan
  • 39. Concepts   •  Throling/Metering   –  Thundering  Herd  (Retry  Storms)   •  Graceful  DegradaEon   –  Appropriate  Fallbacks  
  • 40. Metering   •  Protocol  Level   –  NIWS  features   •  Client  Side  Guards   •  Service  Side  Metering   •  Client  API  level   –  Dependency  Command  Paern  
  • 41. Dependency  Command  Paern   hp://techblog.ne#lix.com/search/label/fault-­‐tolerance     Ben  Christensen  
  • 42. Dependency  Command     Effect  of  Latency  …  
  • 44. Dependency  Command   •  network  Emeouts  and  retries   •  separate  threads  on  per-­‐dependency  thread   pools   •  semaphores  (via  a   tryAcquire,  not  a  blocking  call)   •  circuit  breakers  
  • 45. Failures   •  Failures  will  happen   –  It’s  a  quesEon  of  when  and  how  NOT  “if”   –  Plan   •  Regularly  Test  for  possible  Failures   –  Ne#lix  Simian  Army:  e.g.  Chaos  Monkey,  Latency  Monkey  …   •  Severity   –  Minimize  the  impact  of  a  failure   •  Occurrence   –  Minimize  the  frequency  of  a  failure   •  Observability   –  Minimize  the  Eme  to  detect  and  respond  
  • 46. Simian  Army   Chaos  Monkey     •  Simulates  hard  failures  in  AWS  by  killing  a  few   instances  per  ASG  (e.g.  Auto  Scale  Group)     •  Similar  to  how  EC2  instances  can  be  killed  by   AWS  with  lile  warning     •  Tests  clientsʼ  ability  to  gracefully  deal  with   broken  connecEons,  interrupted  calls,  etc...     •  Verifies  that  all  services  are  running  within  the   protecEon  of  AWS  Auto  Scale  Groups,  which   reincarnates  killed  instances   •  If  not,  the  Chaos  monkey  will  win!     Conformity  Monkey  .     •  Verifies  that  all  services  are  running   within  the  protecEon  of  AWS  Auto  Scale   Groups,  which  reincarnates  killed   instances   •  If  not,  app/service  team  is  noEfied  
  • 47. Simian  Army  …   Latency  Monkey     •  Simulates  sos  failures  -­‐-­‐  i.e.  a  service   gets  slower     •  Injects  random  delays  in  NIWS  (client-­‐ side)  or  Server  (server-­‐side)  of  a  client-­‐ server  interacEon   •  Tests  the  ability  of  applicaEons  to  detect   and  recover  (i.e.  Graceful  DegradaEon)   from  the  harder  problem  of  delays,  that   leads  to  thundering  herd  and  Emeouts     Other  Monkeys     •  Security  Monkey   Chaos  Gorilla   •  Janitor  Monkey   •  Simulates  Zone  Outage   •  Efficiency  Monkey   •  ..  more    
  • 48. Building  Redundancy  and  Availability   •  Deploy  in  mulEple  zone  and  consider  mulEple  regions   •  Pay  aenEon  to  various  modes  of  failures  
  • 49. Three  Balanced  Availability  Zones     Load  Balancers   Zone  A   Zone  B   Zone  C   Persistence  Store   Persistence  Store   Persistence  Store   Courtesy  @adrianco  
  • 50. Triple  Replicated  Persistence     Load  Balancers   Zone  A   Zone  B   Zone  C   Persistence  Store   Persistence  Store   Persistence  Store  
  • 51. Isolated  Regions   US-­‐East  Load  Balancers   EU-­‐West  Load  Balancers   Zone  A   Zone  B   Zone  C   Zone  A   Zone  B   Zone  C   Persistence  Store   Persistence  Store   Persistence  Store   Cassandra  Replicas   Cassandra  Replicas   Cassandra  Replicas  
  • 52. Cassandra  Global  Ring   Reference:  hp://techblog.ne#lix.com    
  • 54. Tips    Guidelines  contd  …   •  Amazon  CloudWatch   –  Is  your  friend!  Ne#lix  Servo  ( hp://github.com/ne#lix/servo)  helps  you  publish   metrics  to  CloudWatch   •  ELB   –  Always  keep  your  Zones  Balanced!   –  Healthcheck  URLs  are  important     •  Auto  Scaling  Groups   –  This  is  an  amazing  feature  that  can  really  save  you  $$ $s  and  help  you  run  more  efficiently.  Read   hp://bit.ly/NgwS0K  
  • 55. Tips    Guidelines  contd  …   •  Keep  acEve  track  of  Usage  Costs   –  Usage  costs  can  surprise  you!   –  Ne#lix  has  an  internal  tool  which  we  may  open   source.  Watch  @Ne#lixOSS   •  Reserve  Instances   –  ReservaEon  can  save  you  $$$s  (upto  71%  !!)   (YMMV)   –  Guarantees  availability  when  you  need  it  
  • 56. Tips/Guidelines   •  S3  Best  PracEces   –  Amazon  doc:  hp://bit.ly/MW93xj   –  Know  when  to  use  Regional  S3  Endpoints   •  Important  when  your  dev/test  team  and  deployments  are  in   different  regions   –  Use  Smart  Bucket/Key  naming   •  Use  3  to  63  characters.   •  Use  only  lower  case  leers  (at  least  one),  numbers,  '.'  and  '-­‐'.   •  Don't  start  or  end  the  bucket  name  with  '.'  and  don't  follow  or   precede  a  '.'  with  a  '-­‐'.   –  Compress  Data   –  Use  TTLs     –  Many  more  …    
  • 58. Open  Source     •  @Ne#lixOSS   •  hp://ne#lix.github.com/   •  Built  for  the  CLOUD  
  • 59. How  can  you  benefit?  
  • 61. ConfiguraEon  Management   •  Archaius  (ProperEes  Management)   •  More   Coming   Soon  …  
  • 62. NoSQL  Persistence   Cassandra  based  offerings   •  Priam  (Token  Management)   •  Astyanax  (Cassandra  Client)   •  Jmeter  plugin  for  Load  Tests  
  • 63. Technical  Knowledge  Sharing   •  hp://techblog.ne#lix.com   –  Cloud  Usage   –  PersonalizaEon    RecommendaEons   –  Hadoop  and  Big  Data  papers   –  CDN  (Content  Delivery  Networks)   –  General  Architectural  Guidelines   –  Performance    Scalability   •  Slideshare   –  hp://www.slideshare.net/search/slideshow? searchfrom=headerq=Ne#lix  
  • 64. New  Challenges   •  More  Global  Expansion   •  Real  Time  Data  Infrastructure   •  March  towards  Connuous  IntegraEon  and   Deployment  
  • 65. Ne#lix   •  Freedom  and  Responsibility   –  Empower  engineers   –  #DevOps   –  Context  not  Control  
  • 66. Want  to  Join  us?   hp://jobs.ne#lix.com/jobs.html    
  • 67. Credits   Adrian  Cockros    (@adrianco),     Ruslan  Meshenberg  (@rusmeshenberg),  Yury  Izrailevsky,     Joe  Sondow  (@joesondow),  Ben  Christensen  (@benchristensen),   Jordan  Zimmerman  (@randgalt),  Ariel  Tseltlin  (@atseitlin),     Allen  Wang,   Eran  Landau,  Danny  Yuan,    Pradeep  Kamath     And       Members  of  the  Ne#lix  Cloud  Pla#orm  Team    
  • 68. Q    A   •                 @stonse  
  • 69. Amazon Cloud Terminology Reference See http://aws.amazon.com/ This is not a full list of Amazon Web Service features (courtesy @adrianco) •  AWS  –  Amazon  Web  Services  (common  name  for  Amazon  cloud)   •  AMI  –  Amazon  Machine  Image  (archived  boot  disk,  Linux,  Windows  etc.  plus  applicaEon  code)   •  EC2  –  ElasEc  Compute  Cloud   –  Range  of  virtual  machine  types  m1,  m2,  c1,  cc,  cg.  Varying  memory,  CPU  and  disk  configuraEons.   –  Instance  –  a  running  computer  system.  Ephemeral,  when  it  is  de-­‐allocated  nothing  is  kept.   –  Reserved  Instances  –  pre-­‐paid  to  reduce  cost  for  long  term  usage   –  Availability  Zone  –  datacenter  with  own  power  and  cooling  hosEng  cloud  instances   –  Region  –  group  of  Availability  Zones  –  US-­‐East,  US-­‐West,  EU-­‐Eire,  Asia-­‐Singapore,  Asia-­‐Japan   •  ASG  –  Auto  Scaling  Group  (instances  booEng  from  the  same  AMI)   •  S3  –  Simple  Storage  Service  (hp  access)   •  EBS  –  ElasEc  Block  Storage  (network  disk  filesystem  can  be  mounted  on  an  instance)   •  RDS  –  RelaEonal  Database  Service  (managed  MySQL  master  and  slaves)   •  SDB  –  Simple  Data  Base  (hosted  hp  based  NoSQL  data  store)   •  SQS  –  Simple  Queue  Service  (hp  based  message  queue)   •  SNS  –  Simple  NoEficaEon  Service  (hp  and  email  based  topics  and  messages)   •  EMR  –  ElasEc  Map  Reduce  (automaEcally  managed  Hadoop  cluster)   •  ELB  –  ElasEc  Load  Balancer   •  EIP  –  ElasEc  IP  (stable  IP  address  mapping  assigned  to  instance  or  ELB)   •  VPC  –  Virtual  Private  Cloud  (extension  of  enterprise  datacenter  network  into  cloud)   •  IAM  –  IdenEty  and  Access  Management  (fine  grain  role  based  security  keys)