How to Solve Cloudflare in 2024: Solve Cloudflare Turnstile and Challenge By Using CapSolver
Cloudflare offers robust protection measures, including the Cloudflare Challenge and Turnstile CAPTCHA, to secure websites from malicious bots and automated scrapers. Understanding and overcoming these challenges can be crucial for those needing access beyond these protective barriers for legitimate purposes.
Understanding Cloudflare's Security Measures
Cloudflare employs various techniques to detect and mitigate unwanted traffic. Among these are the Cloudflare Challenge, which often involves JavaScript challenges or CAPTCHA tests, and Turnstile, a more user-friendly CAPTCHA that minimizes user interaction. These systems are designed to distinguish between human users and automated bots effectively.
Strategies for Solving Cloudflare Challenges
-
Automated Browsers:
- Tools like Selenium or Puppeteer can simulate real user interactions. To mask their automated nature, plugins such as
puppeteer-extra-plugin-stealth
can be used to pass the Cloudflare Challenge by emulating natural browser behavior.
- Tools like Selenium or Puppeteer can simulate real user interactions. To mask their automated nature, plugins such as
-
High-Quality Proxies:
- Utilizing residential proxies that rotate IP addresses can help mimic genuine user access patterns, thereby reducing the likelihood of being blocked by Cloudflare's security measures.
-
CAPTCHA Solving Services:
- For challenges / captchas that require solving
Detailed Guide to solving Cloudflare Challenge
First, let's take a look of how cloudflare challenge looks like:
Sometimes, this page could have turnstile
If have a similar page where you must wait until the verification is completed and usually have additional checks with turnstile, then it's cloudflare challenge.
There are some requeriments for solve this type of challenge:
Token
returned in the response of the method getTaskResult is the value of the cookiecf_clearance
that you will need to create.- Must use the same user-agent that the method getTaskResult return
- Must use the same proxy IP used for solve the challenge
- Must use the cookies of the response
- Must use the headers of the response
- Use TLS chrome 120 version
Example of how to solve cloudflare 5s interface with Python
# -*- coding: utf-8 -*-
import time
import requests
import tls_client
API_KEY = "" # TODO: your api key
page_url = '' # TODO: your page url 'https://example.com
proxy = "" # TODO: your proxy
def call_capsolver():
data = {
"clientKey": API_KEY,
"task": {
"type": 'AntiCloudflareTask',
"websiteURL": page_url,
"proxy": proxy,
}
}
uri = 'https://api.capsolver.com/createTask'
res = requests.post(uri, json=data)
resp = res.json()
task_id = resp.get('taskId')
if not task_id:
print("no get taskId:", res.text)
return
print('created taskId:', task_id)
while True:
time.sleep(1)
data = {
"clientKey": API_KEY,
"taskId": task_id
}
response = requests.post('https://api.capsolver.com/getTaskResult', json=data)
resp = response.json()
status = resp.get('status', '')
if status == "ready":
print("successfully => ", response.text)
return resp.get('solution')
if status == "failed" or resp.get("errorId"):
print("failed! => ", response.text)
return
def request_site(solution):
# TODO if you want to use tls_client
session = tls_client.Session(
client_identifier="chrome_120",
random_tls_extension_order=True
)
return session.get(
page_url,
headers=solution.get('headers'),
cookies=solution.get('cookies'),
proxy=proxy,
allow_redirects=True,
)
def main():
# first request:
res = request_site({})
print('1. response status code:', res.status_code)
if res.status_code != 403:
print("your proxy is good and didn't get the cloudflare challenge")
return
elif 'window._cf_chl_opt' not in res.text:
print('==== proxy blocked ==== ')
return
# call capSolver:
solution = call_capsolver()
if not solution:
return
# second request (verify solution):
res = request_site(solution)
print('2. response status code:', res.status_code)
if res.status_code == 200:
##print("2. response text:\n", res.text)
print("successfully passed the cloudflare challenge")
if __name__ == '__main__':
main()
Detailed Guide to solving Cloudflare Turnstile Captcha
Cloudflare offer 3 types of cloudflare turnstile captcha
-
Managed challenge
-
Non-interactive challenge
-
Invisible challenge
not visible, you can check on the network / scripts loaded and see if turnstile is used
Turnstile CAPTCHA is Cloudflare’s replacement for traditional CAPTCHA challenges, focusing on user experience and security. Here’s how to approach solving it:
-
Understand the Challenge:
- Turnstile CAPTCHA might not always appear visibly; sometimes, it operates in the background, analyzing user interactions to verify authenticity.
-
Integration of CAPTCHA Solving APIs:
- Some CAPTCHA solving services have begun offering solutions for Turnstile by analyzing network traffic and interaction patterns. Utilize these APIs to submit Turnstile challenges for resolution.
-
Maintain Browser Consistency:
- Ensure that your automated solutions maintain consistent browser signatures, as Turnstile can analyze these to detect automation.
-
Adapt to JavaScript Challenges:
- Cloudflare may deploy JavaScript computations to test the browser's environment. Tools like Puppeteer can be scripted to solve these challenges dynamically by executing JavaScript code as a regular browser would.
Example of how to solve cloudflare turnstile captcha with Python
import time
from curl_cffi import requests
CAPSOLVER_API_KEY = "Your CAPSOLVER.COM API KEY"
PAGE_URL = ""
WEBSITE_KEY = ""
def solvecf(metadata_action=None, metadata_cdata=None):
url = "https://api.capsolver.com/createTask"
task = {
"type": "AntiTurnstileTaskProxyLess",
"websiteURL": PAGE_URL,
"websiteKey": WEBSITE_KEY,
}
if metadata_action or metadata_cdata:
task["metadata"] = {}
if metadata_action:
task["metadata"]["action"] = metadata_action
if metadata_cdata:
task["metadata"]["cdata"] = metadata_cdata
data = {
"clientKey": CAPSOLVER_API_KEY,
"task": task
}
response_data = requests.post(url, json=data).json()
print(response_data)
return response_data['taskId']
def solutionGet(taskId):
url = "https://api.capsolver.com/getTaskResult"
status = ""
while status != "ready":
data = {"clientKey": CAPSOLVER_API_KEY, "taskId": taskId}
response_data = requests.post(url, json=data).json()
print(response_data)
status = response_data.get('status', '')
print(status)
if status == "ready":
return response_data['solution']
time.sleep(2)
def main():
start_time = time.time()
taskId = solvecf()
solution = solutionGet(taskId)
if solution:
user_agent = solution['userAgent']
token = solution['token']
print("User_Agent:", user_agent)
print("Solved Turnstile Captcha, token:", token)
end_time = time.time()
elapsed_time = end_time - start_time
print(f"Time to solve the captcha: {elapsed_time} seconds")
if __name__ == "__main__":
main()
Example of how to solve cloudflare turnstile captcha with Nodejs
const axios = require('axios');
const CAPSOLVER_API_KEY = "";
const PAGE_URL = "";
const WEBSITE_KEY = "";
async function solvecf(metadata_action = null, metadata_cdata = null) {
const url = "https://api.capsolver.com/createTask";
const task = {
type: "AntiTurnstileTaskProxyLess",
websiteURL: PAGE_URL,
websiteKey: WEBSITE_KEY,
};
if (metadata_action || metadata_cdata) {
task.metadata = {};
if (metadata_action) {
task.metadata.action = metadata_action;
}
if (metadata_cdata) {
task.metadata.cdata = metadata_cdata;
}
}
const data = {
clientKey: CAPSOLVER_API_KEY,
task: task
};
const response = await axios.post(url, data);
console.log(response.data);
return response.data.taskId;
}
async function solutionGet(taskId) {
const url = "https://api.capsolver.com/getTaskResult";
let status = "";
while (status !== "ready") {
const data = { clientKey: CAPSOLVER_API_KEY, taskId: taskId };
const response = await axios.post(url, data);
console.log(response.data);
status = response.data.status;
console.log(status);
if (status === "ready") {
return response.data.solution;
}
await new Promise(resolve => setTimeout(resolve, 2000));
}
}
async function main() {
const start_time = Date.now();
const taskId = await solvecf();
const solution = await solutionGet(taskId);
if (solution) {
const user_agent = solution.userAgent;
const token = solution.token;
console.log("User_Agent:", user_agent);
console.log("Solved Turnstile Captcha, token:", token);
}
const end_time = Date.now();
const elapsed_time = (end_time - start_time) / 1000;
console.log(`Time to solve the captcha: ${elapsed_time} seconds`);
}
main().catch(console.error);