URL

History

Source Code: lib/url.js

稳定性：2 - 稳定

node:url 模块提供用于 URL 解析和处理的工具。可以使用以下方式访问它：

import url from 'node:url';

URL 字符串和 URL 对象

URL 字符串是一个包含多个有意义组件的结构化字符串。解析后，将返回一个 URL 对象，其中包含这些组件各自的属性。

node:url 模块提供了两个用于处理 URL 的 API：一个是 Node.js 特有的旧版 API，另一个是实现与 Web 浏览器相同的 WHATWG URL 标准的新版 API。

下面提供了 WHATWG API 和旧版 API 之间的比较。在 URL 'https://user:pass@sub.example.com:8080/p/a/t/h?query=string#hash' 上方，显示了旧版 url.parse() 返回的对象的属性。在其下方是 WHATWG URL 对象的属性。

WHATWG URL 的 origin 属性包括 protocol 和 host，但不包括 username 或 password。

┌────────────────────────────────────────────────────────────────────────────────────────────────┐
│                                              href                                              │
├──────────┬──┬─────────────────────┬────────────────────────┬───────────────────────────┬───────┤
│ protocol │  │        auth         │          host          │           path            │ hash  │
│          │  │                     ├─────────────────┬──────┼──────────┬────────────────┤       │
│          │  │                     │    hostname     │ port │ pathname │     search     │       │
│          │  │                     │                 │      │          ├─┬──────────────┤       │
│          │  │                     │                 │      │          │ │    query     │       │
"  https:   //    user   :   pass   @ sub.example.com : 8080   /p/a/t/h  ?  query=string   #hash "
│          │  │          │          │    hostname     │ port │          │                │       │
│          │  │          │          ├─────────────────┴──────┤          │                │       │
│ protocol │  │ username │ password │          host          │          │                │       │
├──────────┴──┼──────────┴──────────┼────────────────────────┤          │                │       │
│   origin    │                     │         origin         │ pathname │     search     │ hash  │
├─────────────┴─────────────────────┴────────────────────────┴──────────┴────────────────┴───────┤
│                                              href                                              │
└────────────────────────────────────────────────────────────────────────────────────────────────┘
(All spaces in the "" line should be ignored. They are purely for formatting.)

text

使用 WHATWG API 解析 URL 字符串：

const myURL =
  new URL('https://user:pass@sub.example.com:8080/p/a/t/h?query=string#hash');

JavaScript

使用旧版 API 解析 URL 字符串：

import url from 'node:url';
const myURL =
  url.parse('https://user:pass@sub.example.com:8080/p/a/t/h?query=string#hash');

JavaScript

从组件部分构造 URL 并获取构造后的字符串

可以使用属性设置器或模板字符串从组件部分构造 WHATWG URL：

const myURL = new URL('https://example.org');
myURL.pathname = '/a/b/c';
myURL.search = '?d=e';
myURL.hash = '#fgh';

JavaScript

要获取构造后的 URL 字符串，请使用 href 属性访问器：

console.log(myURL.href);

协议	端口
"ftp"	21
"file"
"http"	80
"https"	443
"ws"	80
"wss"	443

以类似于 <Map> 构造函数的方法使用可迭代映射实例化一个新的 URLSearchParams 对象。iterable 可以是 Array 或任何可迭代对象。这意味着 iterable 可以是另一个 URLSearchParams，在这种情况下，构造函数将简单地创建提供的 URLSearchParams 的克隆。iterable 的元素是键值对，并且它们本身可以是任何可迭代对象。

允许重复键。

let params;

// 使用数组
params = new URLSearchParams([
  ['user', 'abc'],
  ['query', 'first'],
  ['query', 'second'],
]);
console.log(params.toString());
// 打印 'user=abc&query=first&query=second'

// 使用 Map 对象
const map = new Map();
map.set('user', 'abc');
map.set('query', 'xyz');
params = new URLSearchParams(map);
console.log(params.toString());
// 打印 'user=abc&query=xyz'

// 使用生成器函数
function* getQueryPairs() {
  yield ['user', 'abc'];
  yield ['query', 'first'];
  yield ['query', 'second'];
}
params = new URLSearchParams(getQueryPairs());
console.log(params.toString());
// 打印 'user=abc&query=first&query=second'

// 每个键值对必须正好有两个元素
new URLSearchParams([
  ['user', 'abc', 'error'],
]);
// 抛出 TypeError [ERR_INVALID_TUPLE]:
//        每个查询对必须是可迭代的 [name, value] 元组

JavaScript

urlSearchParams.append

urlSearchParams.append(name, value): void

Attributes

name:<string>

value:<string>

将新的名称 - 值对附加到查询字符串。

urlSearchParams.delete

History

urlSearchParams.delete(name, value?): void

Attributes

name:<string>

value:<string>

如果提供了 value，则移除所有名称为 name 且值为 value 的名称 - 值对。

如果未提供 value，则移除所有名称为 name 的名称 - 值对。

urlSearchParams.entries

urlSearchParams.entries(): void

返回：<Iterator>

返回一个 ES6 Iterator，遍历查询中的每个名称 - 值对。迭代器的每个项都是一个 JavaScript Array。Array 的第一项是 name，Array 的第二项是 value。

urlSearchParams[Symbol.iterator]() 的别名。

urlSearchParams.forEach

History

urlSearchParams.forEach(fn, thisArg?): void

Attributes

fn:<Function>

为查询中的每个名称 - 值对调用

thisArg:<Object>

当调用 fn 时用作 this 值

遍历查询中的每个名称 - 值对并调用给定函数。

const myURL = new URL('https://example.org/?a=b&c=d');
myURL.searchParams.forEach((value, name, searchParams) => {
  console.log(name, value, myURL.searchParams === searchParams);
});
// 打印：
//   a b true
//   c d true

JavaScript

urlSearchParams.get

urlSearchParams.get(name): void

Attributes

name:<string>

返回： <string> | <null> 一个字符串，如果没有具有给定 name 的名称 - 值对，则为 null 。

返回名称为 name 的第一个名称 - 值对的值。如果没有这样的对，则返回 null。

urlSearchParams.getAll

urlSearchParams.getAll(name): void

Attributes

name:<string>

返回： <string[]>

返回名称为 name 的所有名称 - 值对的值。如果没有这样的对，则返回一个空数组。

urlSearchParams.has

History

urlSearchParams.has(name, value?): void

Attributes

name:<string>

value:<string>

返回： <boolean>

检查 URLSearchParams 对象是否包含基于 name 和可选 value 参数的键值对。

如果提供了 value，当存在具有相同 name 和 value 的名称 - 值对时，返回 true。

如果未提供 value，如果存在至少一个名称为 name 的名称 - 值对，则返回 true。

urlSearchParams.keys

urlSearchParams.keys(): void

返回：<Iterator>

返回一个 ES6 Iterator，遍历每个名称 - 值对的名称。

const params = new URLSearchParams('foo=bar&foo=baz');
for (const name of params.keys()) {
  console.log(name);
}
// 打印：
//   foo
//   foo

JavaScript

urlSearchParams.set

urlSearchParams.set(name, value): void

Attributes

name:<string>

value:<string>

将 URLSearchParams 对象中与 name 关联的值设置为 value。如果存在任何名称为 name 的现有名称 - 值对，则将第一个这样的对的值设置为 value 并移除所有其他对。如果不存在，则将名称 - 值对附加到查询字符串。

const params = new URLSearchParams();
params.append('foo', 'bar');
params.append('foo', 'baz');
params.append('abc', 'def');
console.log(params.toString());
// 打印 foo=bar&foo=baz&abc=def

params.set('foo', 'def');
params.set('xyz', 'opq');
console.log(params.toString());
// 打印 foo=def&abc=def&xyz=opq

JavaScript

urlSearchParams.size

History

参数条目的总数。

urlSearchParams.sort

History

urlSearchParams.sort(): void

就地按其名称对所有现有名称 - 值对进行排序。排序使用稳定排序算法完成，因此具有相同名称的名称 - 值对之间的相对顺序保持不变。

此方法特别可用于增加缓存命中率。

const params = new URLSearchParams('query[]=abc&type=search&query[]=123');
params.sort();
console.log(params.toString());
// 打印 query%5B%5D=abc&query%5B%5D=123&type=search

JavaScript

urlSearchParams.toString

urlSearchParams.toString(): void

返回：<string>

返回序列化为字符串的搜索参数，必要时对字符进行百分号编码。

urlSearchParams.values

urlSearchParams.values(): void

返回：<Iterator>

返回一个 ES6 Iterator，遍历每个名称 - 值对的值。

urlSearchParams[Symbol.iterator]

urlSearchParams[Symbol.iterator](): void

返回：<Iterator>

返回一个 ES6 Iterator，遍历查询字符串中的每个名称 - 值对。迭代器的每个项都是一个 JavaScript Array。Array 的第一项是 name，Array 的第二项是 value。

urlSearchParams.entries() 的别名。

const params = new URLSearchParams('foo=bar&xyz=baz');
for (const [name, value] of params) {
  console.log(name, value);
}
// 打印：
//   foo bar
//   xyz baz

JavaScript

url.domainToASCII

History

url.domainToASCII(domain): void

Attributes

domain:<string>

返回： <string>

返回 domain 的 Punycode ASCII 序列化。如果 domain 是无效的域，则返回空字符串。

它执行与 url.domainToUnicode() 相反的操作。

import url from 'node:url';

console.log(url.domainToASCII('español.com'));
// 打印 xn--espaol-zwa.com
console.log(url.domainToASCII('中文.com'));
// 打印 xn--fiq228c.com
console.log(url.domainToASCII('xn--iñvalid.com'));
// 打印空字符串

JavaScript

url.domainToUnicode

History

url.domainToUnicode(domain): void

Attributes

domain:<string>

返回： <string>

返回 domain 的 Unicode 序列化。如果 domain 是无效的域，则返回空字符串。

它执行与 url.domainToASCII() 相反的操作。

import url from 'node:url';

console.log(url.domainToUnicode('xn--espaol-zwa.com'));
// 打印 español.com
console.log(url.domainToUnicode('xn--fiq228c.com'));
// 打印 中文.com
console.log(url.domainToUnicode('xn--iñvalid.com'));
// 打印空字符串

JavaScript

url.fileURLToPath

History

url.fileURLToPath(url, options?): void

Attributes

url:<URL> | <string>

要转换为路径的文件 URL 字符串或 URL 对象。

options:<Object>

windows:<boolean> | <undefined>

如果 path 应该作为 windows 文件路径返回，则为 true ，如果是 posix 则为 false ，如果是系统默认值则为 undefined 。 默认值： undefined 。

返回： <string> 完全解析的特定于平台的 Node.js 文件路径。

此函数确保正确解码百分号编码的字符，并确保跨平台有效的绝对路径字符串。

安全注意事项：

此函数解码百分号编码的字符，包括编码的点段（%2e 作为 . 和 %2e%2e 作为 ..），然后规范化结果路径。这意味着编码的目录遍历序列（例如 %2e%2e）被解码并作为实际路径遍历处理，即使编码的斜杠（%2F、%5C）被正确拒绝。

应用程序不得仅依赖 fileURLToPath() 来防止目录遍历攻击。 在使用返回的路径值进行文件系统操作之前，始终对返回的路径值执行显式路径验证和安全检查，以确保它保持在预期边界内。

import { fileURLToPath } from 'node:url';

const __filename = fileURLToPath(import.meta.url);

new URL('file:///C:/path/').pathname;      // 不正确：/C:/path/
fileURLToPath('file:///C:/path/');         // 正确：C:\path\ (Windows)

new URL('file://nas/foo.txt').pathname;    // 不正确：/foo.txt
fileURLToPath('file://nas/foo.txt');       // 正确：\\nas\foo.txt (Windows)

new URL('file:///你好.txt').pathname;      // 不正确：/%E4%BD%A0%E5%A5%BD.txt
fileURLToPath('file:///你好.txt');         // 正确：/你好.txt (POSIX)

new URL('file:///hello world').pathname;   // 不正确：/hello%20world
fileURLToPath('file:///hello world');      // 正确：/hello world (POSIX)

JavaScript

url.fileURLToPathBuffer

url.fileURLToPathBuffer(url, options?): void

Attributes

url:<URL> | <string>

要转换为路径的文件 URL 字符串或 URL 对象。

options:<Object>

windows:<boolean> | <undefined>

如果 path 应该作为 windows 文件路径返回，则为 true ，如果是 posix 则为 false ，如果是系统默认值则为 undefined 。 默认值： undefined 。

返回：{Buffer} 完全解析的特定于平台的 Node.js 文件路径作为 {Buffer}。

类似于 url.fileURLToPath(...)，不同之处在于它返回路径的 Buffer 而不是字符串表示。当输入 URL 包含不是有效 UTF-8 / Unicode 序列的百分号编码段时，此转换很有用。

安全注意事项：

此函数具有与 url.fileURLToPath() 相同的安全注意事项。它解码百分号编码的字符，包括编码的点段（%2e 作为 . 和 %2e%2e 作为 ..），并规范化路径。应用程序不得仅依赖此函数来防止目录遍历攻击。 在使用返回的 buffer 值进行文件系统操作之前，始终对返回的 buffer 值执行显式路径验证。

url.format

History

url.format(URL, options?): void

Attributes

URL:<URL>

一个 WHATWG URL 对象

options:<Object>

auth:<boolean>

如果序列化的 URL 字符串应包含用户名和密码，则为 true ，否则为 false 。 默认值： true 。

fragment:<boolean>

如果序列化的 URL 字符串应包含片段，则为 true ，否则为 false 。 默认值： true 。

search:<boolean>

如果序列化的 URL 字符串应包含搜索查询，则为 true ，否则为 false 。 默认值： true 。

unicode:<boolean>

如果 URL 字符串主机组件中出现的 Unicode 字符应直接编码而不是进行 Punycode 编码，则为 true 。 默认值： false 。

返回： <string>

返回 WHATWG URL 对象的 URL String 表示的可自定义序列化。

URL 对象具有 toString() 方法和 href 属性，它们返回 URL 的字符串序列化。然而，这些在任何方面都不可自定义。 url.format(URL[, options]) 方法允许对输出进行基本自定义。

import url from 'node:url';
const myURL = new URL('https://a:b@測試？abc#foo');

console.log(myURL.href);
// 打印 https://a:b@xn--g6w251d/?abc#foo

console.log(myURL.toString());
// 打印 https://a:b@xn--g6w251d/?abc#foo

console.log(url.format(myURL, { fragment: false, unicode: true, auth: false }));
// 打印 'https://測試/?abc'

JavaScript

url.pathToFileURL

History

url.pathToFileURL(path, options?): void

Attributes

path:<string>

要转换为文件 URL 的路径。

options:<Object>

windows:<boolean> | <undefined>

如果 path 应该被视为 windows 文件路径，则为 true ，如果是 posix 则为 false ，如果是系统默认值则为 undefined 。 默认值： undefined 。

返回： <URL> 文件 URL 对象。

此函数确保 path 被绝对解析，并且在转换为文件 URL 时正确编码 URL 控制字符。

import { pathToFileURL } from 'node:url';

new URL('/foo#1', 'file:');           // 不正确：file:///foo#1
pathToFileURL('/foo#1');              // 正确：file:///foo%231 (POSIX)

new URL('/some/path%.c', 'file:');    // 不正确：file:///some/path%.c
pathToFileURL('/some/path%.c');       // 正确：file:///some/path%25.c (POSIX)

JavaScript

url.urlToHttpOptions

History

url.urlToHttpOptions(url): void

Attributes

url:<URL>

要转换为选项对象的 WHATWG URL 对象。

返回： <Object> 选项对象

protocol:<string>

要使用的协议。

hostname:<string>

发出请求的服务器的域名或 IP 地址。

hash:<string>

URL 的片段部分。

search:<string>

URL 的序列化查询部分。

pathname:<string>

URL 的路径部分。

path:<string>

请求路径。如果有的话，应包括查询字符串。例如 '/index.html?page=12' 。当请求路径包含非法字符时会抛出异常。目前，只拒绝空格，但这将来可能会改变。

href:<string>

序列化的 URL。

port:<number>

远程服务器的端口。

auth:<string>

基本认证，即 'user:password' ，用于计算 Authorization 头。

此实用函数将 URL 对象转换为 http.request() 和 https.request() API 预期的普通选项对象。

import { urlToHttpOptions } from 'node:url';
const myURL = new URL('https://a:b@測試？abc#foo');

console.log(urlToHttpOptions(myURL));
/*
{
  protocol: 'https:',
  hostname: 'xn--g6w251d',
  hash: '#foo',
  search: '?abc',
  pathname: '/',
  path: '/?abc',
  href: 'https://a:b@xn--g6w251d/?abc#foo',
  auth: 'a:b'
}
*/

JavaScript

旧版 URL API

History

稳定性：3 - 旧版：请改用 WHATWG URL API。

旧版 `urlObject`

History

旧版 urlObject（require('node:url').Url 或 import { Url } from 'node:url'）由 url.parse() 函数创建并返回。

urlObject.auth

auth 属性是 URL 的用户名和密码部分，也称为 userinfo。此字符串子集跟随 protocol 和双斜杠（如果存在）之后，并在 host 组件之前，由 @ 分隔。该字符串要么是用户名，要么是由 : 分隔的用户名和密码。

例如：'user:pass'。

urlObject.hash

hash 属性是 URL 的片段标识符部分，包括前导 # 字符。

例如：'#hash'。

urlObject.host

host 属性是 URL 的完整小写主机部分，包括指定的 port（如果存在）。

例如：'sub.example.com:8080'。

urlObject.hostname

hostname 属性是 host 组件的小写主机名部分，不包含 port。

例如：'sub.example.com'。

urlObject.href

href 属性是完整的 URL 字符串，解析时 protocol 和 host 组件已转换为小写。

例如：'http://user:pass@sub.example.com:8080/p/a/t/h?query=string#hash'。

urlObject.path

path 属性是 pathname 和 search 组件的连接。

例如：'/p/a/t/h?query=string'。

不对 path 进行解码。

urlObject.pathname

pathname 属性由 URL 的整个路径部分组成。这是跟随 host（包括 port）之后且在 query 或 hash 组件开始之前的所有内容，由 ASCII 问号（?）或哈希（#）字符分隔。

例如：'/p/a/t/h'。

不对路径字符串进行解码。

urlObject.port

port 属性是 host 组件的数字端口部分。

例如：'8080'。

urlObject.protocol

protocol 属性标识 URL 的小写协议方案。

例如：'http:'。

urlObject.query

query 属性要么是不带前导 ASCII 问号（?）的查询字符串，要么是由 querystring 模块的 parse() 方法返回的对象。query 属性是字符串还是对象取决于传递给 url.parse() 的 parseQueryString 参数。

例如：'query=string' 或 {'query': 'string'}。

如果作为字符串返回，则不对查询字符串进行解码。如果作为对象返回，则键和值都会被解码。

urlObject.search

search 属性由 URL 的整个“查询字符串”部分组成，包括前导 ASCII 问号（?）字符。

例如：'?query=string'。

不对查询字符串进行解码。

urlObject.slashes

如果 protocol 中的冒号后面需要两个 ASCII 正斜杠字符（/），则 slashes 属性是一个值为 true 的 boolean。

url.format

History

url.format(urlObject): void

Attributes

urlObject:<Object>

一个 URL 对象（由 url.parse() 返回或以其他方式构造）。

url.format() 方法返回源自 urlObject 的格式化 URL 字符串。

const url = require('node:url');
url.format({
  protocol: 'https',
  hostname: 'example.com',
  pathname: '/some/path',
  query: {
    page: 1,
    format: 'json',
  },
});

// => 'https://example.com/some/path?page=1&format=json'

JavaScript

如果 urlObject 不是对象或字符串，url.format() 将抛出 TypeError。

格式化过程操作如下：

创建一个新的空字符串 result。
如果 urlObject.protocol 是字符串，则按原样附加到 result。
否则，如果 urlObject.protocol 不是 undefined 且不是字符串，则抛出 Error。
对于所有_不以_ ASCII 冒号（:）字符结尾的 urlObject.protocol 字符串值，字面字符串 : 将附加到 result。
如果以下任一条件为真，则字面字符串 // 将附加到 result：
- urlObject.slashes 属性为 true；
- urlObject.protocol 以 http、https、ftp、gopher 或 file 开头；
如果 urlObject.auth 属性的值为真值，并且 urlObject.host 或 urlObject.hostname 都不是 undefined，则 urlObject.auth 的值将被强制转换为字符串并附加到 result，后跟字面字符串 @。
如果 urlObject.host 属性为 undefined，则：
- 如果 urlObject.hostname 是字符串，则附加到 result。
- 否则，如果 urlObject.hostname 不是 undefined 且不是字符串，则抛出 Error。
- 如果 urlObject.port 属性值为真值，且 urlObject.hostname 不是 undefined：
  - 字面字符串 : 附加到 result，并且
  - urlObject.port 的值被强制转换为字符串并附加到 result。
否则，如果 urlObject.host 属性值为真值，则 urlObject.host 的值被强制转换为字符串并附加到 result。
如果 urlObject.pathname 属性是一个非空字符串：
- 如果 urlObject.pathname 不以 ASCII 正斜杠（/）开头，则字面字符串 '/' 附加到 result。
- urlObject.pathname 的值附加到 result。
否则，如果 urlObject.pathname 不是 undefined 且不是字符串，则抛出 Error。
如果 urlObject.search 属性为 undefined 且 urlObject.query 属性是 Object，则字面字符串 ? 附加到 result，后跟调用 querystring 模块的 stringify() 方法的输出，传递 urlObject.query 的值。
否则，如果 urlObject.search 是字符串：
- 如果 urlObject.search 的值_不以_ ASCII 问号（?）字符开头，则字面字符串 ? 附加到 result。
- urlObject.search 的值附加到 result。
否则，如果 urlObject.search 不是 undefined 且不是字符串，则抛出 Error。
如果 urlObject.hash 属性是字符串：
- 如果 urlObject.hash 的值_不以_ ASCII 哈希（#）字符开头，则字面字符串 # 附加到 result。
- urlObject.hash 的值附加到 result。
否则，如果 urlObject.hash 属性不是 undefined 且不是字符串，则抛出 Error。
返回 result。

自动迁移可用（来源）。

npx codemod@latest @nodejs/node-url-to-whatwg-url

url.format

History

url.format(urlString): void

稳定性：0 - 已弃用：请改用 WHATWG URL API。

Attributes

urlString:<string>

一个将传递给 url.parse() 然后格式化的字符串。

url.format(urlString) 是 url.format(url.parse(urlString)) 的简写。

因为它在内部调用了已弃用的 url.parse()，所以向 url.format() 传递字符串参数本身也是已弃用的。

可以通过构造一个新的 URL 对象并调用 url.toString() 来使用 WHATWG URL API 执行 URL 字符串的规范化。

import { URL } from 'node:url';

const unformatted = 'http://[fe80:0:0:0:0:0:0:1]:/a/b?a=b#abc';
const formatted = new URL(unformatted).toString();

console.log(formatted); // 打印：http://[fe80::1]/a/b?a=b#abc

JavaScript

url.parse

History

url.parse(urlString, parseQueryString?, slashesDenoteHost?): void

稳定性：0 - 已弃用：请改用 WHATWG URL API。

Attributes

urlString:<string>

要解析的 URL 字符串。

parseQueryString:<boolean>

如果为 true ， query 属性将始终设置为由 querystring 模块的 parse() 方法返回的对象。如果为 false ，返回的 URL 对象上的 query 属性将是未解析、未解码的字符串。 默认值： false 。

slashesDenoteHost:<boolean>

如果为 true ，字面字符串 // 之后的第一个令牌且在前一个 / 之前将被解释为 host 。例如，给定 //foo/bar ，结果将是 {host: 'foo', pathname: '/bar'} 而不是 {pathname: '//foo/bar'} 。 默认值： false 。

url.parse() 方法接受 URL 字符串，解析它，并返回 URL 对象。

如果 urlString 不是字符串，则抛出 TypeError。

如果 auth 属性存在但无法解码，则抛出 URIError。

url.parse() 使用一种宽松、非标准的算法来解析 URL 字符串。它容易出现安全问题，例如主机名欺骗以及用户名和密码的处理不正确。不要与不可信的输入一起使用。不会为 url.parse() 漏洞发布 CVE。请改用 WHATWG URL API，例如：

function getURL(req) {
  const proto = req.headers['x-forwarded-proto'] || 'https';
  const host = req.headers['x-forwarded-host'] || req.headers.host || 'example.com';
  return new URL(`${proto}://${host}${req.url || '/'}`);
}

JavaScript

上面的示例假设格式正确的头是从反向代理转发到你的 Node.js 服务器的。如果你没有使用反向代理，你应该使用下面的示例：

function getURL(req) {
  return new URL(`https://example.com${req.url || '/'}`);
}

JavaScript

自动迁移可用（来源）。

npx codemod@latest @nodejs/node-url-to-whatwg-url

url.resolve

History

url.resolve(from, to): void

稳定性：0 - 已弃用：请改用 WHATWG URL API。

Attributes

from:<string>

如果 to 是相对 URL，则使用的基础 URL。

to:<string>

要解析的目标 URL。

url.resolve() 方法以类似于 Web 浏览器解析锚标签的方式，相对于基础 URL 解析目标 URL。

const url = require('node:url');
url.resolve('/one/two/three', 'four');         // '/one/two/four'
url.resolve('http://example.com/', '/one');    // 'http://example.com/one'
url.resolve('http://example.com/one', '/two'); // 'http://example.com/two'

JavaScript

因为它在内部调用了已弃用的 url.parse()，所以 url.resolve() 本身也是已弃用的。

要使用 WHATWG URL API 实现相同的结果：

function resolve(from, to) {
  const resolvedUrl = new URL(to, new URL(from, 'resolve://'));
  if (resolvedUrl.protocol === 'resolve:') {
    // `from` 是一个相对 URL。
    const { pathname, search, hash } = resolvedUrl;
    return pathname + search + hash;
  }
  return resolvedUrl.toString();
}

resolve('/one/two/three', 'four');         // '/one/two/four'
resolve('http://example.com/', '/one');    // 'http://example.com/one'
resolve('http://example.com/one', '/two'); // 'http://example.com/two'

JavaScript

URL 中的百分号编码

URL 只允许包含特定范围的字符。任何超出该范围的字符都必须进行编码。如何编码这些字符以及编码哪些字符完全取决于字符在 URL 结构中的位置。

遗留 API

在遗留 API 中，空格（' '）和以下字符将在 URL 对象的属性中自动被转义：

< > " ` \r \n \t { } | \ ^ '

例如，ASCII 空格字符（' '）被编码为 %20。ASCII 正斜杠（/）字符被编码为 %3C。

WHATWG API

WHATWG URL 标准在选择编码字符方面比遗留 API 使用了更具选择性和更细粒度的方法。

WHATWG 算法定义了四个“百分号编码集”，用于描述必须进行了百分号编码的字符范围：

C0 控制百分号编码集 包括范围在 U+0000 到 U+001F（含）之间的代码点以及所有大于 U+007E（~）的代码点。
片段百分号编码集 包括 C0 控制百分号编码集 以及代码点 U+0020 SPACE、U+0022 (")、U+003C (<)、U+003E (>) 和 U+0060 (`)。
路径百分号编码集 包括 C0 控制百分号编码集 以及代码点 U+0020 SPACE、U+0022 (")、U+0023 (#)、U+003C (<)、U+003E (>)、U+003F (?)、U+0060 (`)、U+007B ({) 和 U+007D (})。
用户信息编码集 包括 路径百分号编码集 以及代码点 U+002F (/)、U+003A (:)、U+003B (;)、U+003D (=)、U+0040 (@)、U+005B ([) 到 U+005E(^) 以及 U+007C (|)。

用户信息百分号编码集 仅用于编码 URL 内的用户名和密码。路径百分号编码集 用于大多数 URL 的路径。片段百分号编码集 用于 URL 片段。C0 控制百分号编码集 用于主机和路径在某些特定条件下，以及所有其他情况。

当非 ASCII 字符出现在主机名中时，主机名使用 Punycode 算法进行编码。但请注意，主机名可能包含既 Punycode 编码又百分号编码的字符：

const myURL = new URL('https://%CF%80.example.com/foo');
console.log(myURL.href);
// 输出 https://xn--1xa.example.com/foo
console.log(myURL.origin);
// 输出 https://xn--1xa.example.com

JavaScript