ebook img

Cloudera Administration PDF

481 Pages·2017·8.82 MB·English
by  
Save to my drive
Quick download
Download
Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.

Preview Cloudera Administration

Cloudera Administration ImportantNotice ©2010-2021Cloudera,Inc.Allrightsreserved. Cloudera,theClouderalogo,andanyotherproductor servicenamesorsloganscontainedinthisdocumentaretrademarksofClouderaand itssuppliersorlicensors,andmaynotbecopied,imitatedorused,inwholeorinpart, withoutthepriorwrittenpermissionofClouderaortheapplicabletrademarkholder.If thisdocumentationincludescode,includingbutnotlimitedto,codeexamples,Cloudera makesthisavailabletoyouunderthetermsoftheApacheLicense,Version2.0,including anyrequirednotices.AcopyoftheApacheLicenseVersion2.0,includinganynotices, isincludedherein.AcopyoftheApacheLicenseVersion2.0canalsobefoundhere: https://opensource.org/licenses/Apache-2.0 HadoopandtheHadoopelephantlogoaretrademarksoftheApacheSoftware Foundation.Allothertrademarks,registeredtrademarks,productnamesandcompany namesorlogosmentionedinthisdocumentarethepropertyoftheirrespectiveowners. Referencetoanyproducts,services,processesorotherinformation,bytradename, trademark,manufacturer,supplierorotherwisedoesnotconstituteorimply endorsement,sponsorshiporrecommendationthereofbyus. Complyingwithallapplicablecopyrightlawsistheresponsibilityoftheuser.Without limitingtherightsundercopyright,nopartofthisdocumentmaybereproduced,stored inorintroducedintoaretrievalsystem,ortransmittedinanyformorbyanymeans (electronic,mechanical,photocopying,recording,orotherwise),orforanypurpose, withouttheexpresswrittenpermissionofCloudera. Clouderamayhavepatents,patentapplications,trademarks,copyrights,orother intellectualpropertyrightscoveringsubjectmatterinthisdocument.Exceptasexpressly providedinanywrittenlicenseagreementfromCloudera,thefurnishingofthisdocument doesnotgiveyouanylicensetothesepatents,trademarkscopyrights,orother intellectualproperty.ForinformationaboutpatentscoveringClouderaproducts,see http://tiny.cloudera.com/patents. Theinformationinthisdocumentissubjecttochangewithoutnotice.Clouderashall notbeliableforanydamagesresultingfromtechnicalerrorsoromissionswhichmay bepresentinthisdocument,orfromuseofthisdocument. Cloudera,Inc. 395PageMillRoad PaloAlto,CA94306 [email protected] US:1-888-789-1488 Intl:1-650-362-0488 www.cloudera.com ReleaseInformation Version:ClouderaEnterprise5.5.x Date:February3,2021 Table of Contents About Cloudera Administration................................................................................7 Managing CDH and Managed Services.....................................................................8 ManagingCDHandManagedServicesUsingClouderaManager........................................................................8 Configuration Overview.........................................................................................................................................................8 Managing Clusters...............................................................................................................................................................32 ManagingServices...............................................................................................................................................................36 Managing Roles...................................................................................................................................................................45 ManagingHosts...................................................................................................................................................................49 MaintenanceMode..............................................................................................................................................................62 ManagingCDHUsingtheCommandLine..........................................................................................................65 StartingCDHServicesUsingtheCommandLine..................................................................................................................66 StoppingCDHServicesUsingtheCommandLine................................................................................................................71 MigratingDatabetweenClustersUsingdistcp....................................................................................................................73 ManagingIndividualServices.............................................................................................................................78 ManagingFlume..................................................................................................................................................................78 ManagingtheHBaseService...............................................................................................................................................80 Managing HDFS.................................................................................................................................................................134 ManagingHive...................................................................................................................................................................165 Managing Hue...................................................................................................................................................................173 Managing Impala..............................................................................................................................................................197 ManagingKey-ValueStoreIndexer....................................................................................................................................208 ManagingMapReduceandYARN......................................................................................................................................209 ManagingOozie.................................................................................................................................................................217 Managing Solr...................................................................................................................................................................225 ManagingSpark.................................................................................................................................................................228 ManagingtheSqoop1Client.............................................................................................................................................231 ManagingSqoop2.............................................................................................................................................................232 ManagingZooKeeper.........................................................................................................................................................232 ConfiguringServicestoUsetheGPLExtrasParcel.............................................................................................................233 Resource Management........................................................................................235 Schedulers........................................................................................................................................................235 Cloudera Manager Resource Management.....................................................................................................235 LinuxControlGroups(cgroups)........................................................................................................................237 ResourceManagementwithControlGroups.....................................................................................................................239 ConfiguringResourceParameters......................................................................................................................................240 StaticServicePools...........................................................................................................................................241 Dynamic Resource Pools..................................................................................................................................242 ManagingDynamicResourcePools...................................................................................................................................243 YARNPoolStatusandConfigurationOptions....................................................................................................................245 AssigningApplicationsandQueriestoResourcePools......................................................................................................246 Configuration Sets.............................................................................................................................................................248 Scheduling Rules................................................................................................................................................................249 Managing Impala Admission Control...............................................................................................................250 Managingthe Impala Llama ApplicationMaster..............................................................................................252 EnablingIntegratedResourceManagementUsingClouderaManager............................................................................253 DisablingIntegratedResourceManagementUsingClouderaManager...........................................................................254 ConfiguringLlamaUsingClouderaManager.....................................................................................................................254 Impala Resource Management........................................................................................................................254 AdmissionControlandQueryQueuing..............................................................................................................................254 IntegratedResourceManagementwithYARN...................................................................................................................262 Performance Management...................................................................................264 OptimizingPerformanceinCDH.......................................................................................................................264 ChoosingaDataCompressionFormat.............................................................................................................267 Tuningthe Solr Server......................................................................................................................................268 TuningtoCompleteDuringSetup......................................................................................................................................268 GeneralTuning...................................................................................................................................................................268 Other Resources.................................................................................................................................................................275 Tuning Spark Applications................................................................................................................................275 Tuning YARN.....................................................................................................................................................281 Overview............................................................................................................................................................................282 Cluster Configuration.........................................................................................................................................................285 YARN Configuration...........................................................................................................................................................286 MapReduce Configuration.................................................................................................................................................288 Step7:MapReduceConfiguration.....................................................................................................................................288 Step7A:MapReduceSanityChecking................................................................................................................................288 ConfiguringYourClusterInClouderaManager.................................................................................................................288 High Availability...................................................................................................290 HDFS High Availability......................................................................................................................................290 IntroductiontoHDFSHighAvailability...............................................................................................................................290 ConfiguringHardwareforHDFSHA...................................................................................................................................291 EnablingHDFSHA..............................................................................................................................................................292 DisablingandRedeployingHDFSHA..................................................................................................................................304 ConfiguringOtherCDHComponentstoUseHDFSHA.......................................................................................................305 AdministeringanHDFSHighAvailabilityCluster...............................................................................................................308 ChangingaNameserviceNameforHighlyAvailableHDFSUsingClouderaManager......................................................312 MapReduce(MRv1)andYARN(MRv2)HighAvailability..................................................................................312 YARN(MRv2)ResourceManagerHighAvailability.............................................................................................................313 WorkPreservingRecoveryforYARNComponents.............................................................................................................320 MapReduce(MRv1)JobTrackerHighAvailability..............................................................................................................322 ClouderaNavigatorKeyTrusteeServerHighAvailability.................................................................................334 ConfiguringKeyTrusteeServerHighAvailabilityUsingClouderaManager......................................................................334 ConfiguringKeyTrusteeServerHighAvailabilityUsingtheCommandLine......................................................................335 RecoveringaKeyTrusteeServer........................................................................................................................................337 KeyTrusteeKMSHighAvailability....................................................................................................................337 HighAvailabilityforOtherCDHComponents...................................................................................................338 HBaseHighAvailability......................................................................................................................................................338 Hive Metastore High Availability.......................................................................................................................................343 HueHighAvailability.........................................................................................................................................................345 Llama High Availability......................................................................................................................................................348 ConfiguringOozieforHighAvailability..............................................................................................................................349 Search High Availability.....................................................................................................................................................350 ConfiguringClouderaManagerforHighAvailabilityWithaLoadBalancer.....................................................352 IntroductiontoClouderaManagerDeploymentArchitecture...........................................................................................352 PrerequisitesforSettingupClouderaManagerHighAvailability......................................................................................353 High-LevelStepstoConfigureClouderaManagerHighAvailability.................................................................................354 DatabaseHighAvailabilityConfiguration..........................................................................................................................380 TLSandKerberosConfigurationforClouderaManagerHighAvailability.........................................................................381 Backup and Disaster Recovery..............................................................................383 PortRequirementsforBackupandDisasterRecovery.....................................................................................383 Data Replication...............................................................................................................................................384 Designating a Replication Source......................................................................................................................................386 HDFS Replication................................................................................................................................................................387 Hive Replication.................................................................................................................................................................396 Impala Metadata Replication............................................................................................................................................404 UsingSnapshotswithReplication......................................................................................................................................404 EnablingReplicationBetweenClustersinDifferentKerberosRealms................................................................................404 Replication of Encrypted Data...........................................................................................................................................406 HBase Replication..............................................................................................................................................................407 Snapshots.........................................................................................................................................................414 Cloudera Manager Snapshot Policies................................................................................................................................414 Managing HBase Snapshots..............................................................................................................................................417 ManagingHDFSSnapshots................................................................................................................................................428 Cloudera Manager Administration........................................................................433 Starting,Stopping,andRestartingtheClouderaManagerServer...................................................................433 ConfiguringClouderaManagerServerPorts....................................................................................................433 MovingtheClouderaManagerServertoaNewHost.....................................................................................433 ManagingtheClouderaManagerServerLog...................................................................................................434 Viewing the Log.................................................................................................................................................................434 SettingtheClouderaManagerServerLogLocation..........................................................................................................435 ClouderaManagerAgents................................................................................................................................435 Starting,Stopping,andRestartingClouderaManagerAgents..........................................................................................436 ConfiguringClouderaManagerAgents.............................................................................................................................437 ManagingClouderaManagerAgentLogs.........................................................................................................................440 ChangingHostnames........................................................................................................................................441 Configuring Network Settings..........................................................................................................................443 Alerts................................................................................................................................................................443 Managing Alerts................................................................................................................................................................444 Managing Licenses...........................................................................................................................................451 SendingUsageandDiagnosticDatatoCloudera.............................................................................................456 ConfiguringaProxyServer.................................................................................................................................................456 ManagingAnonymousUsageDataCollection..................................................................................................................456 ManagingHueAnalyticsDataCollection..........................................................................................................................456 DiagnosticDataCollection.................................................................................................................................................457 ExportingandImportingClouderaManagerConfiguration.............................................................................459 Backing up Cloudera Manager.........................................................................................................................459 BackingupDatabases........................................................................................................................................................461 OtherClouderaManagerTasksandSettings...................................................................................................461 Settings..............................................................................................................................................................................461 Alerts..................................................................................................................................................................................462 Users..................................................................................................................................................................................462 Kerberos.............................................................................................................................................................................462 License...............................................................................................................................................................................462 UserInterfaceLanguage....................................................................................................................................................462 Peers..................................................................................................................................................................................462 Cloudera Management Service........................................................................................................................462 Cloudera Navigator Data Management Component Administration......................468 ClouderaNavigatorAuditServer......................................................................................................................468 PublishingAuditEvents......................................................................................................................................................471 ClouderaNavigatorMetadataServer...............................................................................................................472 Managing Metadata Extraction........................................................................................................................................478 ManagingMetadataPolicies.............................................................................................................................................479 Appendix: Apache License, Version 2.0.................................................................480 AboutClouderaAdministration About Cloudera Administration ThisguidedescribeshowtoconfigureandadministeraClouderadeployment.Administratorsmanageresources, availability,andbackupandrecoveryconfigurations.Inaddition,thisguideshowshowtoimplementhighavailability, anddiscussesintegration. ClouderaAdministration|7 ManagingCDHandManagedServices Managing CDH and Managed Services IfyouuseClouderaManagertomanageyourcluster,configuringandmanagingyourcluster,aswellasindividual servicesandhosts,usesadifferentparadigmthanifyouuseCDHwithoutClouderaManager.Forthisreason,many oftheseconfigurationtasksoffertwodifferentsubtasks,oneeachforclustersmanagedbyClouderaManagerand oneforclusterswhichdonotuseClouderaManager.Often,thetasksarenotinterchangeable.Forinstance,ifyouuse ClouderaManageryoucannotusestandardHadoopcommand-lineutilitiestostartandstopservices.Instead,youuse ClouderaManagertoperformthesetasks. Managing CDH and Managed Services Using Cloudera Manager YoumanageCDHandmanagedservicesusingtheClouderaManagerAdminConsoleandClouderaManagerAPI. ThefollowingsectionsfocusontheClouderaManagerAdminConsole. ConfigurationOverview WhenClouderaManagerconfiguresaservice,itallocatesrolesthatarerequiredforthatservicetothehostsinyour cluster.Theroledetermineswhichservicedaemonsrunonahost. Forexample,foranHDFSserviceinstance,ClouderaManagerconfigures: • OnehosttoruntheNameNoderole. • OnehosttorunasthesecondaryNameNoderole. • OnehosttoruntheBalancerrole. • RemaininghostsastorunDataNoderoles. Arolegroupisasetofconfigurationpropertiesforaroletype,aswellasalistofroleinstancesassociatedwiththat group.ClouderaManagerautomaticallycreatesadefaultrolegroupnamedRoleTypeDefaultGroupforeachrole type. Whenyouruntheinstallationorupgradewizard,ClouderaManagerconfiguresthedefaultrolegroupsitadds,and addsanyotherrequiredrolegroupsforagivenroletype.Forexample,aDataNoderoleonthesamehostasthe NameNodemightrequireadifferentconfigurationthanDataNoderolesrunningonotherhosts.ClouderaManager createsaseparaterolegroupfortheDataNoderolerunningontheNameNodehostandusesthedefaultconfiguration forDataNoderolesrunningonotherhosts. ClouderaManagerwizardsautoconfigurerolegrouppropertiesbasedontheresourcesavailableonthehosts.For propertiesthatarenotdependentonhostresources,ClouderaManagerdefaultvaluestypicallyalignwithCDHdefault valuesforthatconfiguration.ClouderaManagerdeviateswhentheCDHdefaultisnotarecommendedconfiguration orwhenthedefaultvaluesareillegal. ClouderaManagerConfigurationLayout AfterrunningtheInstallationwizard,useClouderaManagertoreconfiguretheexistingservicesandaddandconfigure additionalhostsandservices. ClouderaManagerconfigurationscreensoffertwolayoutoptions:new(thedefault)andclassic.Youcanswitchbetween layoutsusingtheSwitchtoXXXlayoutlinkatthetoprightofthepage.Keepthefollowinginmindwhenyouselecta layout: • Ifyouswitchtotheclassiclayout,ClouderaManagerpreservesthatsettingwhenyouupgradetoanewversion. • Selectionsmadeinonelayoutarenotpreservedwhenyouswitch. • Certainfeatures,includingcontrolsforconfiguringNavigatorauditeventsandHDFSlogredaction,aresupported onlyinthenewlayout. 8|ClouderaAdministration ManagingCDHandManagedServices Newlayoutpagescontaincontrolsthatallowyoufilterconfigurationpropertiesbasedonconfigurationstatus,category, andgroup.Forexample,todisplaytheJournalNodemaximumlogsizeproperty(JournalNodeMaxLogSize),clickthe CATEGORY>JournalNodeandGROUP>Logsfilters: Whenaconfigurationpropertyhasbeensettoavaluedifferentfromthedefault,aresettodefaultvalueicon displays. Classiclayoutpagesareorganizedbyrolegroupandcategorieswithintherolegroup.Forexample,todisplaythe JournalNodemaximumlogsizeproperty(JournalNodeMaxLogSize),selectJournalNodeDefaultGroup>Logs. ClouderaAdministration|9 ManagingCDHandManagedServices Whenaconfigurationpropertyhasbeensettoavaluedifferentfromthedefault,aResettothedefaultvaluelink displays. Thereisnomechanismforresettingtoanautoconfiguredvalue.However,youcanusetheconfigurationhistoryand rollbackfeaturetorevertanyconfigurationchanges. ModifyingConfigurationPropertiesUsingClouderaManager Note: ThistopicdiscusseshowtoconfigurepropertiesusingtheClouderaManager"newlayout."Theolder layout,calledthe"classiclayout"isstillavailable.Forinstructionsonusingtheclassiclayout,see ModifyingConfigurationProperties(ClassicLayout)onpage15. Toswitchbetweenthelayouts,clickeithertheSwitchtothenewlayoutorSwitchtotheclassic layoutlinksintheupper-rightportionofallconfigurationpages. MinimumRequiredRole:Configurator(alsoprovidedbyClusterAdministrator,FullAdministrator) WhenaserviceisaddedtoClouderaManager,eitherthroughtheinstallationorupgradewizardorwiththeAdd Servicesworkflow,ClouderaManagerautomaticallysetstheconfigurationproperties,basedontheneedsoftheservice andcharacteristicsoftheclusterinwhichitwillrun.Theseconfigurationpropertiesincludebothservice-wide configurationproperties,aswellasspecificpropertiesforeachroletypeassociatedwiththeservice,managedthrough rolegroups.Arolegroupisasetofconfigurationpropertiesforaroletype,aswellasalistofroleinstancesassociated withthatgroup.ClouderaManagerautomaticallycreatesadefaultrolegroupnamedRoleTypeDefaultGroupfor eachroletype.SeeRoleGroupsonpage48. ChangingtheConfigurationofaServiceorRoleInstance 1. Gototheservicestatuspage.(Cluster>servicename) 2. ClicktheConfigurationtab. 3. Locatethepropertyyouwanttoedit.Youcantypeallorpartofthepropertynameinthesearchbox,orusethe filtersontheleftsideofthescreen: • TheStatussectionlimitsthedisplayedpropertiesbytheirstatus.Possiblestatusesinclude: 10|ClouderaAdministration

Description:
Hadoop and the Hadoop elephant logo are trademarks of the Apache Software. Foundation. All other trademarks, registered trademarks, product
See more

The list of books you might like

Most books are stored in the elastic cloud where traffic is expensive. For this reason, we have a limit on daily download.