arraypress/wp-url-utils 问题修复 & 功能扩展

解决BUG、新增功能、兼容多环境部署,快速响应你的开发需求

邮箱:yvsm@zunyunkeji.com | QQ:316430983 | 微信:yvsm316

arraypress/wp-url-utils

Composer 安装命令:

composer require arraypress/wp-url-utils

包简介

A lightweight WordPress library for comprehensive URL manipulation, validation, and connectivity checking with bulk operations support.

README 文档

README

Comprehensive URL manipulation, validation, connectivity checking, platform detection, and tracking parameter removal for WordPress. Clean APIs for single URLs, bulk operations, HTTP status checking, and privacy-focused URL cleaning.

Installation

composer require arraypress/wp-url-utils

Usage

Single URL Operations

use ArrayPress\URLUtils\URL;

// Validation and checks
$valid = URL::is_valid( 'https://example.com' );
$external = URL::is_external( 'https://google.com' );
$https = URL::is_https( 'https://example.com' );

// URL parts
$domain = URL::get_domain( 'https://example.com/path' ); // example.com
$path = URL::get_path( 'https://example.com/path' ); // /path
$extension = URL::get_extension( 'https://example.com/file.pdf' ); // pdf

// Scheme manipulation (Enhanced!)
$https_url = URL::to_https( 'example.com' );          // https://example.com
$http_url = URL::to_http( 'example.com' );            // http://example.com
$custom_scheme = URL::add_scheme( 'example.com', 'https' ); // https://example.com

// URL manipulation
$relative = URL::make_relative( 'https://mysite.com/page' ); // /page
$with_params = URL::add_params( $url, [ 'utm_source' => 'email' ] );

// File type detection
$is_image = URL::is_image( 'photo.jpg' );
$is_video = URL::is_video( 'movie.mp4', [ 'mp4', 'avi' ] ); // custom extensions

// Platform detection (NEW!)
$is_youtube = URL::is_video_platform( 'https://youtube.com/watch?v=abc123' );
$is_spotify = URL::is_audio_platform( 'https://open.spotify.com/track/123' );
$is_social = URL::is_social_platform( 'https://facebook.com/page' );

// UTM tracking
$tracked = URL::add_utm( $url, [
    'source' => 'newsletter',
    'campaign' => 'summer'
]);

$smart_utm = URL::add_utm_smart( $url, [ 'campaign' => 'promo' ] );
// Auto-detects WordPress context (admin screen, template, etc.)

// Sanitization
$clean = URL::sanitize( $_POST['website'] );
$html_safe = URL::sanitize_for_html( $user_url );

// Current page
$current = URL::current();
$clean_current = URL::current_clean(); // without query params

Advanced Platform Detection

// Enhanced media type detection
function get_media_type( $url ) {
    if ( URL::is_video( $url ) || URL::is_video_platform( $url ) ) {
        return 'video';  // Works for both .mp4 files AND YouTube/TikTok links
    }
    
    if ( URL::is_audio( $url ) || URL::is_audio_platform( $url ) ) {
        return 'audio';  // Works for both .mp3 files AND Spotify/podcast links
    }
    
    if ( URL::is_image( $url ) ) {
        return 'image';
    }
    
    if ( URL::is_social_platform( $url ) ) {
        return 'social';
    }
    
    return 'other';
}

// Content categorization
function categorize_urls( $urls ) {
    $categorized = [
        'video' => [],
        'audio' => [],
        'social' => [],
        'other' => []
    ];
    
    foreach ( $urls as $url ) {
        if ( URL::is_video_platform( $url ) ) {
            $categorized['video'][] = $url;
        } elseif ( URL::is_audio_platform( $url ) ) {
            $categorized['audio'][] = $url;
        } elseif ( URL::is_social_platform( $url ) ) {
            $categorized['social'][] = $url;
        } else {
            $categorized['other'][] = $url;
        }
    }
    
    return $categorized;
}

Bulk URL Operations

use ArrayPress\URLUtils\URLs;

$url_list = [
    'https://example.com',
    'http://test.com',
    'invalid-url',
    'https://mysite.com/page'
];

// Filtering
$valid_urls = URLs::filter_valid( $url_list );
$external = URLs::filter_by_location( $url_list, 'external' );
$https_only = URLs::filter_by_protocol( $url_list, 'https' );
$images = URLs::filter_by_type( $url_list, 'image' );

// Conversion
$all_https = URLs::to_https( $url_list );
$all_relative = URLs::make_relative( $url_list );

// Analysis
$domains = URLs::get_domains( $url_list );
$found_urls = URLs::extract( $text_content );
$unique = URLs::remove_duplicates( $url_list );

// Sanitization
$clean_urls = URLs::sanitize( $url_list );
$html_safe_urls = URLs::sanitize_for_html( $url_list );

URL Tracking Parameter Removal

use ArrayPress\URLUtils\Cleaner;

// Remove all tracking parameters (300+ supported)
$dirty_url = 'https://example.com/page?utm_source=facebook&fbclid=abc123&gclid=def456&product_id=789';
$clean_url = Cleaner::strip( $dirty_url );
// Result: https://example.com/page?product_id=789

// Bulk cleaning
$urls = [
    'https://shop.com/product?gclid=abc123&category=shoes',
    'https://blog.com/post?fbclid=def456&author=john&twclid=xyz'
];
$clean_urls = Cleaner::strip_multiple( $urls );

// Complete sanitization pipeline (validate, clean, deduplicate)
$messy_urls = [
    'https://example.com?utm_source=test&fbclid=123',
    'invalid-url',
    'https://example.com',  // duplicate after cleaning
    'https://shop.com?gclid=456&category=shoes'
];
$sanitized = Cleaner::sanitize( $messy_urls );
// Result: ['https://example.com', 'https://shop.com?category=shoes']

// Custom parameters and whitelisting
$custom_clean = Cleaner::strip( $url, ['my_tracker', 'internal_ref'] );
$keep_important = Cleaner::strip( $url, [], ['page', 'category', 'search'] );

// Check for tracking parameters
if ( Cleaner::has_tracking( $url ) ) {
    echo 'This URL contains tracking parameters';
}

HTTP Connectivity Checking

use ArrayPress\URLUtils\Checker;

// Single URL checks
$reachable = Checker::is_reachable( 'https://example.com' );
$status = Checker::get_status_code( 'https://example.com' ); // 200, 404, etc.
$final_url = Checker::get_final_url( 'https://bit.ly/short' ); // follows redirects
$downloadable = Checker::is_downloadable( 'https://example.com/file.pdf' );

// Comprehensive info
$info = Checker::get_info( 'https://example.com' );
// Returns: reachable, status_code, final_url, content_type, error

// Bulk checking
$results = Checker::check_multiple( $url_list );
$working_urls = Checker::filter_reachable( $url_list );

// Configuration
Checker::set_default_timeout( 15 ); // seconds

Common Use Cases

Enhanced scheme handling:

// Now handles domain-only URLs!
$secure_urls = [
    URL::to_https( 'example.com' ),           // https://example.com
    URL::to_http( 'secure-site.com' ),       // http://secure-site.com
    URL::add_scheme( 'api.site.com', 'https' ), // https://api.site.com
];

Smart content processing:

// Categorize URLs by platform type
function process_content_urls( $content ) {
    $urls = URLs::extract( $content );
    
    foreach ( $urls as $url ) {
        if ( URL::is_video_platform( $url ) ) {
            // Handle video embeds
            $content = add_video_wrapper( $content, $url );
        } elseif ( URL::is_audio_platform( $url ) ) {
            // Handle audio players
            $content = add_audio_player( $content, $url );
        } elseif ( URL::is_social_platform( $url ) ) {
            // Handle social embeds
            $content = add_social_embed( $content, $url );
        }
        
        // Clean tracking parameters from all URLs
        $clean_url = Cleaner::strip( $url );
        $content = str_replace( $url, $clean_url, $content );
    }
    
    return $content;
}

Privacy-focused URL sharing:

// Clean URLs before sharing to protect user privacy
$share_url = Cleaner::strip( $_POST['url'] );
$safe_url = URL::sanitize_for_html( $share_url );

Content management:

// Clean tracking from imported content
function clean_imported_content( $content ) {
    $urls = URLs::extract( $content );
    foreach ( $urls as $url ) {
        $clean_url = Cleaner::strip( $url );
        $content = str_replace( $url, $clean_url, $content );
    }
    return $content;
}

Form validation with cleaning:

$website = Cleaner::strip( $_POST['website'] );
$website = URL::sanitize( $website );
if ( ! URL::is_valid( $website ) ) {
    $errors[] = 'Invalid website URL';
}

Link checking:

$broken_links = [];
foreach ( $links as $link ) {
    if ( ! Checker::is_reachable( $link, 5 ) ) {
        $broken_links[] = $link;
    }
}

Content processing:

$found_urls = URLs::extract( $post_content );
$external_urls = URLs::filter_by_location( $found_urls, 'external' );
$clean_urls = Cleaner::strip_multiple( $external_urls );
$safe_urls = URLs::sanitize_for_html( $clean_urls );

UTM campaign tracking:

$campaign_url = URL::add_utm_smart( 'https://mysite.com/sale', [
    'source' => 'email',
    'campaign' => 'black-friday'
]);
// Automatically adds medium based on WordPress context

WordPress Integration Examples:

// Categorize and clean URLs when saving posts
function process_post_urls( $content ) {
    $urls = URLs::extract( $content );
    
    foreach ( $urls as $url ) {
        // Clean tracking parameters
        $clean_url = Cleaner::strip( $url );
        
        // Ensure HTTPS for external links
        if ( URL::is_external( $clean_url ) ) {
            $clean_url = URL::to_https( $clean_url );
        }
        
        // Replace in content
        $content = str_replace( $url, $clean_url, $content );
    }
    
    return $content;
}
add_filter( 'content_save_pre', 'process_post_urls' );

// Auto-embed platform content
function auto_embed_platforms( $content ) {
    $urls = URLs::extract( $content );
    
    foreach ( $urls as $url ) {
        if ( URL::is_video_platform( $url ) && ! URL::supports_oembed( $url ) ) {
            // Custom video embed logic
        } elseif ( URL::is_audio_platform( $url ) ) {
            // Custom audio player logic
        }
    }
    
    return $content;
}
add_filter( 'the_content', 'auto_embed_platforms' );

// Enhanced URL processing pipeline
$processed_url = URL::sanitize_for_html(
    URL::to_https(
        Cleaner::strip( $original_url )
    )
);

Platform Coverage

Video Platforms (30+ supported)

Major: YouTube, Vimeo, TikTok, Instagram, Facebook Watch
Streaming: Twitch, Dailymotion, Rumble, BitChute
Business: Wistia, JW Player, Brightcove, Vidyard, Loom
International: Youku, Bilibili (Chinese), VK, OK.ru (Russian)
Other: Twitter videos, Reddit videos, Archive.org

Audio Platforms (25+ supported)

Music: Spotify, Apple Music, YouTube Music, Deezer, Tidal, SoundCloud
Podcasts: Apple Podcasts, Google Podcasts, Anchor, Overcast, Stitcher
Radio: TuneIn, iHeart Radio, Pandora
International: NetEase Music, QQ Music (Chinese)
Other: Bandcamp, AudioMack, Last.fm, Mixcloud

Social Media Platforms (40+ supported)

Major: Facebook, Twitter/X, Instagram, LinkedIn, TikTok, Snapchat
Messaging: WhatsApp, Telegram, Discord, WeChat, LINE
Communities: Reddit, Pinterest, Tumblr, Mastodon, Threads
Professional: Behance, Dribbble, Stack Overflow
International: VK, Weibo, Xiaohongshu (Chinese)
Other: Nextdoor, Goodreads, Flickr, Meetup

Tracking Parameter Coverage

The Cleaner class removes 300+ tracking parameters from:

Major Platforms:

  • Google (Analytics, Ads, DoubleClick)
  • Facebook, Instagram, Twitter/X, TikTok, LinkedIn, Snapchat
  • Amazon, eBay, AliExpress, Walmart, Best Buy
  • YouTube, Spotify, Netflix, Twitch

Marketing Tools:

  • Email marketing (Mailchimp, HubSpot, Klaviyo)
  • Analytics (Matomo, Piwik, Adobe)
  • Affiliate networks (Impact Radius, Commission Junction)
  • Chinese platforms (Taobao, Bilibili, Xiaohongshu)

News & Media:

  • NY Times, Forbes, Reuters, TechCrunch, BBC, CNN

International:

  • Yandex, Seznam, German/French/Czech sites

All Methods

URL Class

  • is_valid(), is_external(), is_https(), is_same_domain()
  • get_domain(), get_path(), get_query(), get_extension()
  • add_params(), remove_params(), add_scheme(), to_https(), to_http(), make_relative()
  • is_image(), is_video(), is_audio() (with custom extensions)
  • is_video_platform(), is_audio_platform(), is_social_platform()NEW
  • add_utm(), add_utm_smart(), sanitize(), current()

URLs Class

  • validate(), filter_valid(), filter_invalid()
  • filter_by_location(), filter_by_protocol(), filter_by_type()
  • to_https(), make_relative(), get_domains()
  • extract(), remove_duplicates(), sanitize()

Cleaner Class ⭐ NEW

  • strip() - Remove tracking parameters from single URL
  • strip_multiple() - Bulk tracking parameter removal
  • sanitize() - Complete pipeline: validate, clean, deduplicate
  • has_tracking() - Check if URL contains tracking parameters
  • add_params() - Extend tracking parameter list
  • get_params() - View all tracked parameters

Checker Class

  • is_reachable(), get_status_code(), get_final_url()
  • is_downloadable(), get_info()
  • check_multiple(), filter_reachable()
  • set_default_timeout()

Requirements

  • PHP 7.4+
  • WordPress 5.0+

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

This project is licensed under the GPL-2.0-or-later License.

Support

统计信息

  • 总下载量: 7
  • 月度下载量: 0
  • 日度下载量: 0
  • 收藏数: 0
  • 点击次数: 1
  • 依赖项目数: 0
  • 推荐数: 0

GitHub 信息

  • Stars: 0
  • Watchers: 0
  • Forks: 0
  • 开发语言: PHP

其他信息

  • 授权协议: GPL-2.0-or-later
  • 更新时间: 2025-09-02