术语表
您的工作流自动化、网页抓取、AI 智能体及相关领域的全面参考。
A
AI Agent
An AI agent is an autonomous software system that uses large language models to perceive its environment, make decisions, and take actions to accomplish goals with minimal human direction.
Agentic AI
Agentic AI refers to artificial intelligence systems that autonomously plan, execute, and adapt multi-step tasks by using tools and reasoning loops, going beyond simple prompt-response interactions.
AI Worker
An AI worker is an AI-powered digital employee that autonomously performs business tasks traditionally handled by humans, including data processing, research, reporting, and cross-application workflows, operating continuously without breaks or manual supervision.
AI Software Engineer
An AI software engineer is an AI system capable of writing, debugging, testing, and deploying code autonomously. It interprets requirements in natural language and produces working software, handling tasks from bug fixes to full feature development.
AI Marketer
An AI marketer is an AI-powered agent that autonomously handles marketing tasks including content creation, campaign management, SEO optimization, social media scheduling, competitor analysis, and performance reporting without continuous human direction.
AI Automation
AI automation uses artificial intelligence to automate tasks that require judgment, learning, and adaptation, going beyond rule-based systems to handle unstructured data, dynamic environments, and context-dependent decisions.
API Integration
API integration is the process of connecting two or more applications through their APIs so they can exchange data and trigger actions automatically. It eliminates manual data entry by letting systems communicate directly with each other in real time.
API Gateway
An API gateway is a server that acts as the single entry point for all API requests, handling routing, authentication, rate limiting, and load balancing. It sits between clients and backend services, simplifying client interactions and centralizing cross-cutting concerns.
API Rate Limit
An API rate limit is a restriction on the maximum number of API requests a client can make within a defined time window. Rate limits protect servers from overload, ensure fair resource distribution among users, and prevent abuse of API services.
AI Employee
An AI employee is a job-ready AI agent packaged with scoped permissions, connected applications, guardrails, persistent memory, and monitoring dashboards that performs knowledge work autonomously within an organization, handling tasks from data entry to customer outreach without continuous human oversight.
AI Sales Agent
An AI sales agent is an autonomous AI system that handles prospecting, lead qualification, outreach sequences, follow-up scheduling, and CRM updates, executing the repetitive mechanics of sales so human reps can focus on closing deals and building relationships.
AI Customer Service Agent
An AI customer service agent is an autonomous AI system that handles support tickets, live chat conversations, email responses, and customer inquiries by understanding intent, retrieving relevant information, and resolving issues without human intervention for routine cases.
AI Data Analyst
An AI data analyst is an autonomous AI system that collects, cleans, processes, and analyzes data from multiple sources, generating reports, dashboards, visualizations, and actionable insights without requiring manual SQL queries or spreadsheet work.
AI Researcher
An AI researcher is an autonomous AI agent that systematically gathers, evaluates, and synthesizes information from multiple sources — websites, databases, documents, and APIs — to produce comprehensive research reports, literature reviews, and competitive analyses.
AI Writer
An AI writer is an AI system that generates written content — blog posts, articles, reports, documentation, and other text — based on prompts, outlines, or reference material, producing draft-quality or publish-ready copy at scale.
AI Copywriter
An AI copywriter is an AI system specialized in writing persuasive marketing copy — advertisements, email subject lines, landing pages, product descriptions, and social media ads — optimized for conversion, engagement, and brand consistency.
AI Bookkeeper
An AI bookkeeper is an AI system that automatically categorizes transactions, reconciles bank accounts, processes receipts, manages accounts payable and receivable, and maintains accurate financial records with minimal human oversight.
AI Recruiter
An AI recruiter is an AI system that automates hiring workflows — screening resumes, sourcing candidates, scheduling interviews, managing applicant communication, and maintaining hiring pipeline data — to accelerate recruitment while reducing bias and manual effort.
AI Project Manager
An AI project manager is an AI system that tracks tasks, monitors deadlines, identifies risks and bottlenecks, generates status reports, and coordinates team workflows — keeping projects on schedule without the overhead of manual project administration.
AI Executive Assistant
An AI executive assistant is an AI system that manages calendars, triages emails, prepares meeting briefs, coordinates travel logistics, handles administrative tasks, and acts as a personal productivity layer for busy professionals and executives.
AI Social Media Manager
An AI social media manager is an AI system that creates, schedules, publishes, and analyzes social media content across platforms, maintaining consistent posting cadences and optimizing engagement without constant manual effort.
AI Content Creator
An AI content creator is an AI system that produces multiple content types — blog posts, videos, images, infographics, presentations, and social media content — enabling organizations to maintain high-volume, multi-format content output across channels.
AI SEO Specialist
An AI SEO specialist is an AI system that performs keyword research, content optimization, technical SEO auditing, backlink analysis, SERP monitoring, and competitive analysis to improve a website's search engine rankings and organic traffic.
AI Email Marketer
An AI email marketer is an AI system that writes email copy, segments audiences, personalizes messages, optimizes send times, runs A/B tests, and manages campaign performance — automating the full email marketing workflow from strategy to delivery.
AI Web Scraper
An AI web scraper is an AI-powered data extraction tool that understands page structure, identifies relevant data, and extracts information from websites without requiring manual CSS selector configuration or custom scripting for each target site.
AI QA Tester
An AI QA tester is an AI system that automatically generates test cases, executes functional and regression tests, identifies bugs, validates user interfaces, and reports defects — reducing the manual testing burden while improving coverage and consistency.
AI Chatbot
An AI chatbot is a conversational AI interface that interacts with users through text or voice to answer questions, provide information, complete tasks, and guide users through processes, ranging from simple FAQ bots to sophisticated AI-powered conversational agents.
AI Translator
An AI translator is an AI system that translates text, documents, and content between languages while preserving context, tone, cultural nuance, and domain-specific terminology with near-human accuracy for most language pairs.
AI Transcriber
An AI transcriber is an AI system that converts audio and video recordings into accurate text, identifying individual speakers, adding punctuation, and formatting output for readability — handling meetings, interviews, podcasts, and calls.
AI Legal Assistant
An AI legal assistant is an AI system that reviews contracts, identifies key clauses and risks, summarizes legal documents, tracks compliance deadlines, and performs legal research — automating the document-heavy aspects of legal work.
AI Financial Analyst
An AI financial analyst is an AI system that analyzes financial data, builds forecasting models, monitors market conditions, generates investment reports, and identifies financial trends — automating the quantitative and reporting aspects of financial analysis.
AI HR Manager
An AI HR manager is an AI system that handles employee onboarding workflows, answers policy questions, tracks performance data, manages compliance documentation, and provides workforce analytics — automating the administrative backbone of human resources.
AI Operations Manager
An AI operations manager is an AI system that monitors business workflows, identifies bottlenecks and inefficiencies, optimizes processes, manages resource allocation, and provides real-time operational visibility across an organization's systems and teams.
AI Supply Chain Analyst
An AI supply chain analyst is an AI system that forecasts demand, monitors inventory levels, optimizes logistics routing, tracks supplier performance, and manages procurement workflows — bringing data-driven intelligence to supply chain operations.
B
Business Process Automation
Business process automation (BPA) is the use of technology to automate repeatable, multi-step business processes, reducing manual effort, eliminating errors, and enabling organizations to scale operations without proportionally increasing headcount.
Browser Automation
Browser automation is the use of software to control a web browser programmatically, performing tasks like clicking buttons, filling forms, and extracting data without manual human interaction.
Bot Detection
Bot detection is a set of techniques websites use to identify and block automated traffic, distinguishing between human visitors and software-controlled browsers or scripts.
C
CSV
CSV (Comma-Separated Values) is a plain text file format that stores tabular data with each row on a new line and columns separated by commas. It is one of the most widely used formats for data exchange and export.
CSS Selector
A CSS selector is a pattern used to identify and target specific HTML elements on a web page, widely used in browser automation to locate buttons, forms, text, and other interactive elements.
CAPTCHA
CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) is a challenge-response mechanism designed to distinguish human users from automated bots on websites.
Cookies
Cookies are small data files that websites store in a user's browser to maintain session state, track preferences, and manage authentication across page loads and visits.
Cron Job
A cron job is a time-based task scheduler in Unix-like systems that automatically runs scripts or commands at specified intervals, such as every hour, daily, or on specific days of the week.
CI/CD
CI/CD (Continuous Integration/Continuous Delivery) is a software development practice that automates code building, testing, and deployment, enabling teams to release updates frequently and reliably through an automated pipeline.
Containerization
Containerization is a technology that packages an application and all its dependencies into a lightweight, isolated unit called a container, ensuring it runs consistently across any environment without conflicts.
Cloud Computing
Cloud computing is the delivery of computing services — servers, storage, databases, networking, software, and analytics — over the internet on a pay-as-you-go basis. It eliminates the need to own and maintain physical infrastructure, enabling organizations to scale resources up or down on demand.
D
Digital Transformation
Digital transformation is the strategic integration of digital technology into all areas of a business, fundamentally changing how it operates, delivers value to customers, and competes in the market through data-driven decision-making and automated processes.
Data Extraction
Data extraction is the process of retrieving structured or unstructured data from various sources — websites, documents, databases, APIs, or files — and converting it into a usable format for analysis, storage, or further processing.
Data Pipeline
A data pipeline is an automated sequence of steps that moves data from one or more sources to a destination, applying transformations, validations, and enrichments along the way.
Data Transformation
Data transformation is the process of converting data from one format, structure, or value system to another, making it suitable for analysis, storage, or consumption by downstream systems.
Data Enrichment
Data enrichment is the process of enhancing raw or incomplete data by merging it with additional context from external sources. It adds missing fields, validates existing values, and appends supplementary attributes to make datasets more complete and actionable.
Data Scraping
Data scraping is the broad practice of programmatically extracting data from any digital source — websites, applications, databases, documents, or APIs. It encompasses web scraping, screen scraping, and other automated extraction techniques.
Data Cleaning
Data cleaning is the process of detecting and correcting corrupt, inaccurate, incomplete, or irrelevant records in a dataset. Also called data cleansing or data wrangling, it ensures data quality before analysis, reporting, or integration with downstream systems.
DOM
The DOM (Document Object Model) is a tree-structured representation of a web page's HTML that browsers create in memory, allowing scripts and automation tools to read and manipulate page content programmatically.
Data Integration
Data integration is the process of combining data from multiple disparate sources into a unified, consistent view. It involves extracting data from various systems, transforming it into a common format, and loading it into a destination where it can be accessed and analyzed holistically.
DevOps
DevOps is a set of practices, cultural philosophies, and tools that unify software development (Dev) and IT operations (Ops) to shorten the development lifecycle and deliver high-quality software continuously. It emphasizes automation, collaboration, monitoring, and rapid iteration.
E
ETL (Extract, Transform, Load)
ETL stands for Extract, Transform, Load — a three-phase data integration process that pulls data from source systems, converts it into a consistent format, and loads it into a destination such as a data warehouse or database.
Error Handling
Error handling is the practice of anticipating, detecting, and responding to errors in software execution, including catching exceptions, logging failures, and implementing fallback logic to keep automated workflows running reliably.
G
GraphQL
GraphQL is a query language and runtime for APIs that lets clients request exactly the data they need in a single request. Developed by Facebook, it solves the over-fetching and under-fetching problems common with REST APIs.
H
Human-in-the-Loop
Human-in-the-loop (HITL) is a design pattern where automated systems pause at critical decision points to request human review, approval, or input before proceeding.
Hyperautomation
Hyperautomation is a business-driven strategy, coined by Gartner, for rapidly identifying and automating as many business processes as possible using a combination of AI, RPA, machine learning, process mining, and integration tools.
Headless Browser
A headless browser is a web browser that operates without a visible graphical user interface, executing page loads, JavaScript, and rendering entirely in the background for faster automated tasks.
HTTP
HTTP (Hypertext Transfer Protocol) is the foundational protocol of the web that defines how clients and servers communicate. Every API call, web page load, and webhook delivery travels over HTTP.
I
Intelligent Automation
Intelligent automation (IA) combines artificial intelligence, robotic process automation, and process analytics to automate complex business processes that require decision-making, pattern recognition, and adaptation beyond what rule-based systems can handle.
Iframe
An iframe (inline frame) is an HTML element that embeds a separate web page within a parent page, creating an independent browsing context with its own DOM, scripts, and security boundaries.
IFTTT
IFTTT (If This Then That) is a consumer automation platform that connects apps and smart devices using simple conditional recipes. Each recipe follows a single trigger-action pattern, making it accessible for personal automation but limited for business workflows.
iPaaS (Integration Platform as a Service)
iPaaS (Integration Platform as a Service) is a cloud-based platform that enables organizations to build, deploy, and manage integrations between applications without maintaining integration infrastructure. It provides pre-built connectors, visual workflow builders, and centralized monitoring for connecting SaaS and on-premise systems.
Idempotent
An operation is idempotent if performing it multiple times produces the same result as performing it once. In automation, idempotent tasks can be safely retried without causing duplicate data or unintended side effects.
IP Rotation
IP rotation is the practice of automatically cycling through multiple IP addresses when making requests, distributing traffic across different addresses to avoid detection, rate limits, and IP-based blocking.
J
JSON
JSON (JavaScript Object Notation) is a lightweight, text-based data interchange format that uses human-readable key-value pairs and arrays to represent structured data. It is the dominant format for web APIs and configuration files.
L
Low-Code Automation
Low-code automation is an approach to building automated workflows using visual interfaces with minimal hand-coding, allowing both technical and non-technical users to create, modify, and deploy automation faster than traditional development.
Load Balancing
Load balancing is the process of distributing incoming network traffic or workloads across multiple servers or resources to ensure no single server is overwhelmed, improving reliability, performance, and availability.
Latency
Latency is the time delay between initiating a request and receiving a response, measured in milliseconds. In automation, lower latency means faster workflow execution and more responsive interactions with websites and APIs.
M
Machine Learning
Machine learning (ML) is a subset of artificial intelligence where computer systems learn patterns from data and improve their performance on tasks over time without being explicitly programmed for each specific scenario.
Middleware
Middleware is software that sits between two systems or layers of an application, handling communication, data transformation, authentication, or routing. In integration contexts, middleware brokers data exchange between applications that cannot communicate directly.
Make (formerly Integromat)
Make (formerly Integromat) is a visual automation platform that connects apps and designs complex workflows using a drag-and-drop canvas. It offers more advanced data transformation and branching capabilities than simple trigger-action tools, making it popular for technically-inclined users.
Microservices
Microservices is a software architecture pattern where applications are built as a collection of small, independently deployable services, each responsible for a specific business capability. Services communicate via APIs and can be developed, deployed, and scaled independently.
N
No-Code Automation
No-code automation enables non-technical users to build, deploy, and manage automated workflows using visual interfaces and natural-language instructions instead of writing programming code.
Natural Language Processing
Natural language processing (NLP) is a branch of artificial intelligence that enables computers to understand, interpret, and generate human language, powering applications from chatbots and search engines to document analysis and automated content creation.
n8n
n8n is an open-source workflow automation tool that offers both self-hosted and cloud-hosted deployment options. It provides a visual workflow editor with the flexibility to add custom code, appealing to developers who want control over their automation infrastructure.
O
OCR (Optical Character Recognition)
OCR (Optical Character Recognition) is technology that converts images of text — from scanned documents, photographs, screenshots, or PDFs — into machine-readable text that can be searched, edited, and processed by software.
OAuth
OAuth is an open authorization standard that lets users grant third-party applications limited access to their accounts without sharing passwords. It is the protocol behind every 'Sign in with Google' button and most API authorization flows.
P
Process Mining
Process mining is a data-driven technique that uses event logs from business systems to discover, visualize, and analyze how processes actually execute, revealing bottlenecks, deviations, and automation opportunities that manual observation misses.
Prompt Engineering
Prompt engineering is the practice of designing, structuring, and optimizing inputs to AI language models to elicit accurate, relevant, and useful outputs, encompassing techniques like few-shot examples, chain-of-thought reasoning, and role prompting.
Pagination
Pagination is the practice of dividing large datasets or content lists into discrete pages, requiring sequential navigation to access all records. In data extraction, handling pagination means automatically traversing all pages to collect the complete dataset.
PDF Parsing
PDF parsing is the process of extracting text, tables, images, and structured data from PDF documents programmatically. It converts the visual layout of a PDF into machine-readable data for analysis and processing.
Playwright
Playwright is Microsoft's open-source browser automation framework that provides a single API to control Chromium, Firefox, and WebKit browsers. It supports headless and headed modes, auto-waiting, and network interception, making it a leading choice for testing and web scraping.
Puppeteer
Puppeteer is a Node.js library developed by Google that provides a high-level API to control headless Chrome and Chromium browsers. It is widely used for web scraping, automated testing, PDF generation, and screenshot capture.
Proxy Server
A proxy server is an intermediary server that routes network requests between a client and a destination server, masking the client's identity and enabling access control, caching, and geographic distribution of traffic.
R
RPA (Robotic Process Automation)
Robotic Process Automation (RPA) uses software robots to mimic human interactions with application user interfaces, automating repetitive tasks like data entry, form filling, and cross-system data transfer.
Regex (Regular Expressions)
Regex (regular expressions) is a pattern-matching language used to search, match, and extract text based on character patterns. It is widely used in data extraction, validation, and text processing.
REST API
A REST API is a web service interface that follows Representational State Transfer (REST) architectural principles, using standard HTTP methods to create, read, update, and delete resources. It is the most widely adopted API style for web and mobile applications.
Rate Limiting
Rate limiting is a technique that controls the number of requests a client can make to a server within a given time window, preventing abuse and ensuring fair resource distribution.
Retry Logic
Retry logic is a fault-tolerance pattern that automatically re-attempts failed operations with configurable delays, backoff strategies, and maximum attempt limits to handle transient errors without manual intervention.
S
Screen Scraping
Screen scraping is a technique for extracting data from an application's visual display rather than its underlying data source. It captures what appears on screen, translating visual output into structured data.
Structured Data
Structured data is information organized in a predefined, predictable format — typically rows and columns in databases, spreadsheets, or tabular files like CSV and JSON. Each field has a defined type and position, making it easy to query, filter, and analyze programmatically.
Selenium
Selenium is an open-source suite of tools for automating web browsers, widely used for testing web applications and web scraping. Its WebDriver API provides cross-browser control through a standardized protocol supported by all major browsers.
Shadow DOM
Shadow DOM is a web standard that encapsulates a DOM subtree inside a web component, isolating its markup, styles, and behavior from the rest of the page. It creates a boundary that standard CSS selectors and DOM queries cannot cross without explicit handling.
SDK
An SDK (Software Development Kit) is a packaged set of tools, libraries, and documentation that developers use to build applications for a specific platform or integrate with a specific service. SDKs simplify API interaction by abstracting low-level HTTP details into native-language function calls.
Session Management
Session management is the process of maintaining stateful interactions between a client and server across multiple requests, typically using cookies, tokens, or server-side session stores to track authentication and user context.
Serverless Computing
Serverless computing is a cloud execution model where the cloud provider dynamically manages server infrastructure, automatically allocating resources on demand. Developers deploy code as functions that run in response to events, paying only for actual compute time rather than reserved capacity.
T
Task Scheduling
Task scheduling is the process of automating when and how tasks execute, whether on a fixed timetable, triggered by events, or managed through a dependency-aware queue that ensures tasks run in the correct order.
Throughput
Throughput is the amount of work or data processed by a system within a given time period. In automation, it measures how many tasks, requests, or workflow executions a platform can complete per unit of time.
U
Unstructured Data
Unstructured data is information that lacks a predefined format or schema — including emails, PDFs, images, social media posts, and free-form text. It requires specialized techniques like NLP, OCR, or AI to extract meaningful, structured information from it.
User Agent
A user agent is a string that browsers send with every HTTP request identifying the browser type, version, and operating system, used by websites to serve appropriate content and detect automated traffic.
W
Workflow Automation
Workflow automation is the use of software to execute recurring business processes with minimal human intervention, routing tasks, data, and decisions through a predefined sequence of steps.
Web Scraping
Web scraping is the automated process of extracting data from websites by programmatically reading and parsing HTML content. It enables businesses to collect structured information from web pages at scale without manual copy-and-paste.
Web Crawler
A web crawler (also called a spider or bot) is a program that systematically browses the internet by following links from page to page, discovering and indexing web content at scale.
Web Scraping API
A web scraping API is an API endpoint that handles the complexities of web scraping — proxy rotation, browser rendering, CAPTCHA solving, and anti-bot evasion — returning extracted data in a structured format from a single API call.
Web Element
A web element is any individual component rendered on a web page — buttons, links, input fields, images, text blocks, dropdown menus, and other HTML elements. In browser automation, web elements are the targets of interactions like clicking, typing, and data extraction.
Webhook
A webhook is an HTTP callback that automatically sends data from one application to another when a specific event occurs. Unlike polling an API repeatedly, webhooks push updates in real time the moment something happens.
X
XPath
XPath (XML Path Language) is a query language for selecting nodes in XML and HTML documents, offering powerful navigation capabilities including parent traversal and conditional filtering.
Z
Zapier
Zapier is a popular no-code automation platform that connects over 6,000 apps using trigger-action workflows called Zaps. It enables non-technical users to automate repetitive tasks like syncing data between apps, sending notifications, and updating records without writing code.
几分钟内即可构建 AI 驱动的工作流。