Data Scraping Glossary
A comprehensive glossary of terms related to web scraping, automation, CAPTCHA solving, data extraction, proxies, and browser automation.
#
1 term
A
50 terms
Access Control List ACLAccount Takeover AtoAccuracyActorActor RunActor TaskAd BlockerAd FraudAd HidingAd InjectionAd NetworkAd StackingAdvertising ArbitrageAdvertising BotAffiliate HijackingAffiliate Id SwappingAffiliate MarketingAffiliate Marketing FraudAged LeadAi Training Data CollectionAI Web ScrapingAIOAlert FatigueAlternative DataAnonymous ProxyAnti-Scraping MechanismsAntidetect BrowserAPIApi CallApi CreditsApi EndpointApi KeyApi LibraryApi ParametersApi SecurityApi Terms Of ServiceApi TestingApp Shell ModelApplication SecurityArrayArtificial BotAsnAssortmentAsync ApiAudiocontext FingerprintingAuto DetectAuto Pagination DetectionAutomated ExtractionAvailabilityAxios
B
21 terms
Backconnect ProxyBacklink Checker BotBad ResultBannersBaseline PriceBeautiful SoupBenchmarkingBig DataBotBot AdvertisingBot DetectionBot FraudBot ManagementBot PreventionBot Protection SoftwareBot TrafficBotnetBreadcrumbs Data ContextBreadth First SearchBrowser As A ServiceBrowser Behavior Profiling
C
57 terms
C Plus PlusC SharpCacheCallbackCallback Lead ScamsCampaignCaptchaCardingCDNCDPChainingChatbotCheerioCicd For ScrapersCivil Investigative Demand (Cid)Click BaitClick BotClick FloodingClick FraudClick InjectionClick SpammingClick TagClick-Through RateClickjackingCloud ExtractionCloud Governance Cloud MigrationCloud SecurityCloudflare TurnstileCollyCompetitive IntelligenceCompliance FiltersConcurrenciesConsistent Click PatternsContainer SecurityContainerized ScrapingCookieCookie StuffingCopyright BotCopyright Infringement DetectionCounter BotCpa (Cost Per Action)Cpc (Cost Per Click)Cpl (Cost Per Lead)Cpm (Cost Per Mille)CrawlCrawl RunCrawleeCrawlerCSSCss SelectorCsvCtv FraudCustom TaskCyber Security SolutionsCyber Security ThreatsCyber Warfare
D
107 terms
DaasData AnalysisData BlendingData BreachData CenterData Center ProxiesData Center TrafficData CleansingData CollectionData CurationData DeduplicationData DiscoveryData ExtractionData FederationData FeedData FusionData GovernanceData IntegrityData LineageData LiteracyData Loss Prevention (DLP)Data MartsData MashupData MigrationData MiningData ModelingData NormalizationData ObfuscationData PipelineData ProfilingData ProtectionData ProvenanceData QualityData Quality AssuranceData Readiness LevelsData ReconciliationData RecoveryData ReductionData RefinementData RegistriesData ReportData RepurposingData ResilienceData RetentionData RetrievalData Science PlatformsData SecurityData SemanticsData SerializationData ServerData ServiceData SinkData StagingData StandardsData StewardData StreamingData StructuringData SubsettingData TaxonomyData TracingData Traffic AnalysisData Transformation ServicesData Transmutation Data UtilizationData Value ChainData VerificationData Visualization ToolsData WarehouseDatabase DesignDatabase IndexingDatabase SecurityDatacenter ProxyDataframeDatasetDDoS AttackDecision Support SystemsDemand Side Platform (DSP) Denial Of ServiceDepth First SearchDescriptive AnalyticsDevice FingerprintingDevice Spoofing (Ua Spoofing)Diagnostic AnalyticsDifferential PrivacyDigital ForensicsDigital MarketingDigital Marketing ArbitrageDigital ShelfDimensional ModelingDirect ResultsDisplay FraudDLPDMCA Takedown NoticesDNSDns ProtectionDNS ProxyDocument StorageDomDom TreeDomainDomain SpoofingDropDummy WebsitesDuplicate Ad RequestsDynamic PageDynamic RenderingDynamic Scraping
E
18 terms
F
23 terms
Fake LeadsFake ReviewsFalse PositiveFeature ExtractionFeatured SnippetFederated LearningFeed DeliveryFeed Fetcher BotFetchField Level EncryptionFile Format ConversionFile-Sharing BotFinancial Services CybersecurityFont FingerprintingFor LoopForced RedirectsForensic Data AnalysisForm BotFraud PreventionFraud RatesFraud TrendsFraudulent LeadsFully Managed Service
H
28 terms
HacktivistHaystackHeadless BrowserHealthcare CybersecurityHeuristic AnalysisHidden Api ScrapingHidden Web DataHierarchical Data FormatHigh Performance ComputingHipaa Privacy RuleHomomorphic EncryptionHoneypot TechniqueHostnameHTMLHTML AttributeHtml ParsingHtml TagHtml/Xml ParserHtmlAgilityPackHTTPHTTP HeaderHTTP MethodHTTP RedirectHTTP RequestHTTP ResponseHTTP TransactionHttpartyHybrid Data Models
I
26 terms
IdIdempotencyImpersonator BotImpression BotIn Memory ComputingIncremental LearningIndexingInformation ArchitectureInformation RetrievalInformation TheoryIngestionInsightsInstanceInternet Service ProviderIntersection Observer ApiIntrusion Detection PreventionInvalid Proxy TrafficInvalid Traffic (IVT)IoTIP AddressIp BlockingIP IntegrityIP Masking Or IP SpoofingIP RotationIpv4ISP Proxies
M
21 terms
Machine DataMachine LearningMacrosMagecartMalvertisingMap Minimum Advertised PriceMarket IntelligenceMaster Data ManagementMetadataMetadata HarvestingMetadata ManagementMimeMismatched Referral DataMismatched User AgentMobileMobile ProxiesMonitoringMonitoring BotMulti Dimensional AnalysisMulti-Threaded Web ScrapingMySQL
P
28 terms
Pagerank AlgorithmPaginationParsingPartitioningPatent FilingsPay Per ClickPayloadPci Dss CertificationPenetration TestingPhishingPlaying BotPlaywrightPost RequestPotential SavingsPre-BidPre-FetchingPrevent BotsPrevent DDOS AttacksPrice IndexPrice IntelligencePrivileged User MonitoringProduct DataPrometheus MonitoringPromotion ProxyProxy SubnetPuppeteerPython Requests
R
33 terms
RAGRansom Ddos RddosRate Backoff AlgorithmsRate LimitingRate ThrottlingRdfaReal-Time ResponseRecaptchaRecycled Aged LeadReferrerReflected Xss AttacksRegexRenderingRendering EnginesRequestRequest QueueRequest RateRequests (Library)Residential ProxyResponsive DesignRestful ApiReverse EngineeringReverse ProxyReviews And RatingsRoasRobots TxtRoi (Return On Investment)RootingRotating ProxiesRotating Residential ProxiesRPARule SetsRvest
S
55 terms
SaasSampling Sast Iast DastScalingSchema ScraperScraper BlockingScraper BotScrapingScraping Resilience MetricsScrapyScrapySharpSdk SpoofingSearch & Social Protectâ„¢Search ArbitrageSearch Engine BotSearch Engine Optimization (Seo)SeleniumSelenium GridSelenium WebDriverSelf-DealingSentiment AnalysisSERPServer ResponseServingShardingSIEMSingle Sign-On (SSO)SivtSmart Proxy RoutingSMBSneaker BotSoc 2 ComplianceSocial Media BotSocial Network BotSophisticated Invalid TrafficSourceSpamSpam BotSpiderSpoofingSpy BotSQLStatic PageStatic ScrapingStatus CodeStickinessStock LevelStop BotsStop DDOS AttacksStructured DataSuccess RateSupply Side Platform (SSP)SuspectSynchronous Request
T
21 terms
Template TaskTerms Of Use And Privacy PoliciesThoroughnessTimeoutTls FingerprintingTls Ja3 Hash Collsion TokenTrademark ProtectionTrader BotTraffic Acquisition Cost (TAC)Traffic ArbitrageTraffic OriginTrainingTransfer BotTransformationTransparencyTransparent ProxyTraversing The DomTrustworthy Accountability Group (TAG)Two-Factor Authentication (2FA)Typo-Squatting
X
2 terms
Y
1 term
Z
2 terms