reverse proxying with apache and mod_proxy_html

I've been fighting to get some reverse proxy things working today at work. Basically, some python application servers that speak HTTP live on servers with private IP addresses behind the firewall, but they need to be reachable to the outside world via a HTTPS portal that does authentication checking with mod_authnz_ldap. Basically, https://example.com/app1/ needs to go to http://app1:8888/. I figured out much of what is below with the help of: http://www.apachetutor.org/admin/reverseproxies.

Apache's mod_proxy seemed like it would be simple enough to use and 2 lines of config file changes later, the first page was working. However, redirects from the app servers were causing the client to redirect to internal addresses which didn't work, and absolute urls in HTML from the appserver needed to be changed to include the /app1/ on the externally facing server. Enter mod_proxy_html.

mod_proxy_html is a third party module that allows content modification including replacing link addresses with different addresses. I downloaded and installed it on the proxy server but it wasn't working. Turning up debugging with

LogLevel debug
ProxyHTMLLogVerbose On

gave me the following message: "No links configured: nothing for proxy-html filter to do", and Google only had one result for this: mod_proxy_html.c - the source code for mod_proxy_html with the error message in it! It turns out that much of the documentation for mod_proxy_html is out of date, and in mod_proxy_html 3.0 the link tag definitions have been removed from the code and must be included in the configuration. Had I looked at the config file provided with the download (instead of the one I'd been writing from howtos), this wouldn't have happened, but it's surpsising Google hasn't indexed anyone else running into this! The fix for this was to include the following in my config:

ProxyHTMLLinks  a               href
ProxyHTMLLinks  area            href
ProxyHTMLLinks  link            href
ProxyHTMLLinks  img             src longdesc usemap
ProxyHTMLLinks  object          classid codebase data usemap
ProxyHTMLLinks  q               cite
ProxyHTMLLinks  blockquote      cite
ProxyHTMLLinks  ins             cite
ProxyHTMLLinks  del             cite
ProxyHTMLLinks  form            action
ProxyHTMLLinks  input           src usemap
ProxyHTMLLinks  head            profile
ProxyHTMLLinks  base            href
ProxyHTMLLinks  script          src for
ProxyHTMLLinks  iframe          src

ProxyHTMLEvents onclick ondblclick onmousedown onmouseup \
                onmouseover onmousemove onmouseout onkeypress \
                onkeydown onkeyup onfocus onblur onload \
                onunload onsubmit onreset onselect onchange

An Apache restart later, and HTML links were getting rewritten. Neat! On to the next problem.. the app servers in question have lots of hardcoded absolute URLs, many of them in CSS and JS files. The documentation has an initial solution to this in their technical guide, using a regular expression like:

ProxyHTMLURLMap url\(http://internal.example.com([^\)]*)\) url(http://proxy.example.com$1) Rihe

However this only works on inline CSS because mod_proxy_html only works on html content types and not the text/css that CSS files are sent as. A workaround for this is setting the PROXY_HTML_FORCE environment variable, but in addition to forcing mod_proxy_html to look at css files, this forces it to process image files, etc, which uses up too much CPU for our use case. Doh!

Setting up each application server as a vhost insted is a lot simpler (the 2 lines of config I started with here are enough), and while it's less than ideal, we have wildcard SSL certificates so having https://app1.example.com/ isn't the end of the world and doesn't require any additional IP addresses.

The above text was really

The above text was really helpful and did the trick for me as well. Thanks a lot! However, I did not really get the last part. What problem is solved by setting up each application server as vhost? Can you explain this a bot more?

Hans

Glad it helped you! Instead

Glad it helped you! Instead of doing http://internal.example.com/app1 and /app2, doing separate vhosts on the proxy server as: http://internal-app1.example.com/ and http://internal-app2.example.com/ allows everything to run through the proxy with minimal configuration overhead.

Any chance you've ran into

Any chance you've ran into the thing where mod_proxy_html translates "&" to "&amp", causing badly formatted 3rd party products to display a bunch of &nbsp instead of spaces?

nope. Haven't seen this one,

nope. Haven't seen this one, but I did end up doing vhosts per app instead of using mod_proxy_html so I haven't really played with it much :)

...and further travels found

...and further travels found the answer, but it isn't pretty.

In mod_proxy_html.c, I commented out the "case '&' : FLUSH ; ap_fputs(ctx->f->next, ctx->bb, "&") ; break ;" line and rebuilt the module.

Ugh, I've totally run in to

Ugh, I've totally run in to that & -> amp; problem, it's totally making this horrid Dot Net Nuke site I'm trying to tuck away render the odd page like garbage. In my case, these "&" were part of " " objects, so now I've got nbsp; showing up in a few places and some other probably related issues. /cry

I guess commenting out the line in code is a solution, but I'm not setup to build mod_proxy_html and am not really wanting to go down that path.. But I want to use the damned thing.

Thanks for posting your

Thanks for posting your experiences with mod_proxy_html. This was a big help to me. I will try and pay it forward.

You rock, thanks. It is a bit

You rock, thanks. It is a bit odd that all these mod_proxy_html tutorials don't mention that "small tidbit". Perhaps because Apache 2.2 ships with it by default? (I was working on Apache 2.0)

Hello all, I have 2 machines,

Hello all,
I have 2 machines, one is an apache proxy server and the other is a machine thats serving as a file server for me to access my files online, the file server machine is behind the proxy, i can get fine to the fileserver through the proxy and the login page loads fine however the software i am using to serve my files (Serv-U FTP) implements some css and javascript files and cookies which for some reason ( i am sure its coz of my proxy configurations) these scripts or the images are not loading correctly perventing me from accessing the server to get any files .... i would like someone who is more familiar with proxies and specifically apache to guide me as for where i am going wrong and just to clarify .. i am using this third party software ( Serv-U FTP) in the HTTP mode only disabling all the FTP functionality of it . below i attached my httpd.conf configuration file for your review, please help as i am not very familiar yet with apache .

May be someone can download the trial version of the Serv-U FTP software and simulate this situation and tell me why it's not runing right?

Proxy running on IP 192.168.1.16 port # 80
Fileserver running on IP 192.168.1.9 port # 8000

Thank you all in advance !!!


#
# This is the main Apache HTTP server configuration file. It contains the
# configuration directives that give the server its instructions.
# See for detailed information.
# In particular, see
#
# for a discussion of each configuration directive.
#
# Do NOT simply read the instructions in here without understanding
# what they do. They're here only as hints or reminders. If you are unsure
# consult the online docs. You have been warned.
#
# Configuration and logfile names: If the filenames you specify for many
# of the server's control files begin with "/" (or "drive:/" for Win32), the
# server will use that explicit path. If the filenames do *not* begin
# with "/", the value of ServerRoot is prepended -- so "logs/foo.log"
# with ServerRoot set to "C:/Program Files/Apache Software Foundation/Apache2.2" will be interpreted by the
# server as "C:/Program Files/Apache Software Foundation/Apache2.2/logs/foo.log".
#
# NOTE: Where filenames are specified, you must use forward slashes
# instead of backslashes (e.g., "c:/apache" instead of "c:\apache").
# If a drive letter is omitted, the drive on which httpd.exe is located
# will be used by default. It is recommended that you always supply
# an explicit drive letter in absolute paths to avoid confusion.

#
# ServerRoot: The top of the directory tree under which the server's
# configuration, error, and log files are kept.
#
# Do not add a slash at the end of the directory path. If you point
# ServerRoot at a non-local disk, be sure to point the LockFile directive
# at a local disk. If you wish to share the same ServerRoot for multiple
# httpd daemons, you will need to change at least LockFile and PidFile.
#
ServerRoot "C:/Program Files/Apache Software Foundation/Apache2.2"

#
# Listen: Allows you to bind Apache to specific IP addresses and/or
# ports, instead of the default. See also the
# directive.
#
# Change this to Listen on specific IP addresses as shown below to
# prevent Apache from glomming onto all bound IP addresses.
#
Listen 192.168.1.16:80
#Listen 9000

#
# Dynamic Shared Object (DSO) Support
#
# To be able to use the functionality of a module which was built as a DSO you
# have to place corresponding `LoadModule' lines at this location so the
# directives contained in it are actually available _before_ they are used.
# Statically compiled modules (those listed by `httpd -l') do not need
# to be loaded here.
#
# Example:
# LoadModule foo_module modules/mod_foo.so
#
LoadModule actions_module modules/mod_actions.so
LoadModule alias_module modules/mod_alias.so
LoadModule asis_module modules/mod_asis.so
LoadModule auth_basic_module modules/mod_auth_basic.so
#LoadModule auth_digest_module modules/mod_auth_digest.so
#LoadModule authn_alias_module modules/mod_authn_alias.so
#LoadModule authn_anon_module modules/mod_authn_anon.so
#LoadModule authn_dbd_module modules/mod_authn_dbd.so
#LoadModule authn_dbm_module modules/mod_authn_dbm.so
LoadModule authn_default_module modules/mod_authn_default.so
LoadModule authn_file_module modules/mod_authn_file.so
#LoadModule authnz_ldap_module modules/mod_authnz_ldap.so
#LoadModule authz_dbm_module modules/mod_authz_dbm.so
LoadModule authz_default_module modules/mod_authz_default.so
LoadModule authz_groupfile_module modules/mod_authz_groupfile.so
LoadModule authz_host_module modules/mod_authz_host.so
#LoadModule authz_owner_module modules/mod_authz_owner.so
LoadModule authz_user_module modules/mod_authz_user.so
LoadModule autoindex_module modules/mod_autoindex.so
#LoadModule cache_module modules/mod_cache.so
#LoadModule cern_meta_module modules/mod_cern_meta.so
LoadModule cgi_module modules/mod_cgi.so
#LoadModule charset_lite_module modules/mod_charset_lite.so
#LoadModule dav_module modules/mod_dav.so
#LoadModule dav_fs_module modules/mod_dav_fs.so
#LoadModule dav_lock_module modules/mod_dav_lock.so
#LoadModule dbd_module modules/mod_dbd.so
LoadModule deflate_module modules/mod_deflate.so
LoadModule dir_module modules/mod_dir.so
#LoadModule disk_cache_module modules/mod_disk_cache.so
#LoadModule dumpio_module modules/mod_dumpio.so
LoadModule env_module modules/mod_env.so
#LoadModule expires_module modules/mod_expires.so
#LoadModule ext_filter_module modules/mod_ext_filter.so
#LoadModule file_cache_module modules/mod_file_cache.so
#LoadModule filter_module modules/mod_filter.so
LoadModule headers_module modules/mod_headers.so
#LoadModule ident_module modules/mod_ident.so
#LoadModule imagemap_module modules/mod_imagemap.so
LoadModule include_module modules/mod_include.so
#LoadModule info_module modules/mod_info.so
LoadModule isapi_module modules/mod_isapi.so
#LoadModule ldap_module modules/mod_ldap.so
#LoadModule logio_module modules/mod_logio.so
LoadModule log_config_module modules/mod_log_config.so
#LoadModule log_forensic_module modules/mod_log_forensic.so
#LoadModule mem_cache_module modules/mod_mem_cache.so
LoadModule mime_module modules/mod_mime.so
#LoadModule mime_magic_module modules/mod_mime_magic.so
LoadModule negotiation_module modules/mod_negotiation.so
LoadModule proxy_module modules/mod_proxy.so
#LoadModule proxy_ajp_module modules/mod_proxy_ajp.so
#LoadModule proxy_balancer_module modules/mod_proxy_balancer.so
#LoadModule proxy_connect_module modules/mod_proxy_connect.so
#LoadModule proxy_ftp_module modules/mod_proxy_ftp.so
LoadModule proxy_http_module modules/mod_proxy_http.so
LoadModule rewrite_module modules/mod_rewrite.so
LoadModule setenvif_module modules/mod_setenvif.so
LoadModule speling_module modules/mod_speling.so
#LoadModule ssl_module modules/mod_ssl.so
#LoadModule status_module modules/mod_status.so
#LoadModule substitute_module modules/mod_substitute.so
#LoadModule unique_id_module modules/mod_unique_id.so
#LoadModule userdir_module modules/mod_userdir.so
#LoadModule usertrack_module modules/mod_usertrack.so
#LoadModule version_module modules/mod_version.so
#LoadModule vhost_alias_module modules/mod_vhost_alias.so
LoadFile "C:\Program Files\Apache Software Foundation\Apache2.2\bin\iconv.dll"
LoadFile "C:\Program Files\Apache Software Foundation\Apache2.2\lib\zlib.dll"
LoadFile "C:\Program Files\Apache Software Foundation\Apache2.2\bin\libxml2.dll"
LoadModule proxy_html_module modules/mod_proxy_html.so

ProxyRequests Off

Order deny,allow
Allow from all

#
# If you wish httpd to run as a different user or group, you must run
# httpd as root initially and it will switch.
#
# User/Group: The name (or #number) of the user/group to run httpd as.
# It is usually good practice to create a dedicated user and group for
# running httpd, as with most system services.
#
User daemon
Group daemon

# 'Main' server configuration
#
# The directives in this section set up the values used by the 'main'
# server, which responds to any requests that aren't handled by a
# definition. These values also provide defaults for
# any containers you may define later in the file.
#
# All of these directives may appear inside containers,
# in which case these default settings will be overridden for the
# virtual host being defined.
#

#
# ServerAdmin: Your address, where problems with the server should be
# e-mailed. This address appears on some server-generated pages, such
# as error documents. e.g. admin@your-domain.com
#
ServerAdmin Admin@xxxxxxxxxxxxxxxxxx.com

#
# ServerName gives the name and port that the server uses to identify itself.
# This can often be determined automatically, but we recommend you specify
# it explicitly to prevent problems during startup.
#
# If your host doesn't have a registered DNS name, enter its IP address here.
#
ServerName www.xxxxxxxxxxxxxxxxxx.com:80

#
# DocumentRoot: The directory out of which you will serve your
# documents. By default, all requests are taken from this directory, but
# symbolic links and aliases may be used to point to other locations.
#
#**********************************************************************************DocumentRoot "C:/Root"

#
# Each directory to which Apache has access can be configured with respect
# to which services and features are allowed and/or disabled in that
# directory (and its subdirectories).
#
# First, we configure the "default" to be a very restrictive set of
# features.
#

Options FollowSymLinks
AllowOverride None
Order deny,allow
Deny from all

#
# Note that from this point forward you must specifically allow
# particular features to be enabled - so if something's not working as
# you might expect, make sure that you have specifically enabled it
# below.
#

#
# This should be changed to whatever you set DocumentRoot to.
#

#
# Possible values for the Options directive are "None", "All",
# or any combination of:
# Indexes Includes FollowSymLinks SymLinksifOwnerMatch ExecCGI MultiViews
#
# Note that "MultiViews" must be named *explicitly* --- "Options All"
# doesn't give it to you.
#
# The Options directive is both complicated and important. Please see
# http://httpd.apache.org/docs/2.2/mod/core.html#options
# for more information.
#
Options Indexes FollowSymLinks

#
# AllowOverride controls what directives may be placed in .htaccess files.
# It can be "All", "None", or any combination of the keywords:
# Options FileInfo AuthConfig Limit
#
AllowOverride None

#
# Controls who can get stuff from this server.
#
Order allow,deny
Allow from all

#
# DirectoryIndex: sets the file that Apache will serve if a directory
# is requested.
#

DirectoryIndex index.html

#
# The following lines prevent .htaccess and .htpasswd files from being
# viewed by Web clients.
#

Order allow,deny
Deny from all
Satisfy All

#
# ErrorLog: The location of the error log file.
# If you do not specify an ErrorLog directive within a
# container, error messages relating to that virtual host will be
# logged here. If you *do* define an error logfile for a
# container, that host's errors will be logged there and not here.
#
ErrorLog "logs/error.log"

#
# LogLevel: Control the number of messages logged to the error_log.
# Possible values include: debug, info, notice, warn, error, crit,
# alert, emerg.
#
LogLevel warn

#
# The following directives define some format nicknames for use with
# a CustomLog directive (see below).
#
LogFormat "%h %l %u %t \"%r\" %>s %b \"%{Referer}i\" \"%{User-Agent}i\"" combined
LogFormat "%h %l %u %t \"%r\" %>s %b" common

# You need to enable mod_logio.c to use %I and %O
LogFormat "%h %l %u %t \"%r\" %>s %b \"%{Referer}i\" \"%{User-Agent}i\" %I %O" combinedio

#
# The location and format of the access logfile (Common Logfile Format).
# If you do not define any access logfiles within a
# container, they will be logged here. Contrariwise, if you *do*
# define per- access logfiles, transactions will be
# logged therein and *not* in this file.
#
CustomLog "logs/access.log" common

#
# If you prefer a logfile with access, agent, and referer information
# (Combined Logfile Format) you can use the following directive.
#
#CustomLog "logs/access.log" combined

#
# Redirect: Allows you to tell clients about documents that used to
# exist in your server's namespace, but do not anymore. The client
# will make a new request for the document at its new location.
# Example:
# Redirect permanent /foo http://www.xxxxxxxxxxxxxxxxxx.com/bar

#
# Alias: Maps web paths into filesystem paths and is used to
# access content that does not live under the DocumentRoot.
# Example:
# Alias /webpath /full/filesystem/path
#
# If you include a trailing / on /webpath then the server will
# require it to be present in the URL. You will also likely
# need to provide a section to allow access to
# the filesystem path.

#
# ScriptAlias: This controls which directories contain server scripts.
# ScriptAliases are essentially the same as Aliases, except that
# documents in the target directory are treated as applications and
# run by the server when requested rather than as documents sent to the
# client. The same rules about trailing "/" apply to ScriptAlias
# directives as to Alias.
#
ScriptAlias /cgi-bin/ "C:/Program Files/Apache Software Foundation/Apache2.2/cgi-bin/"

#
# ScriptSock: On threaded servers, designate the path to the UNIX
# socket used to communicate with the CGI daemon of mod_cgid.
#
#Scriptsock logs/cgisock

#
# "C:/Program Files/Apache Software Foundation/Apache2.2/cgi-bin" should be changed to whatever your ScriptAliased
# CGI directory exists, if you have that configured.
#

AllowOverride None
Options None
Order allow,deny
Allow from all

#
# DefaultType: the default MIME type the server will use for a document
# if it cannot otherwise determine one, such as from filename extensions.
# If your server contains mostly text or HTML documents, "text/plain" is
# a good value. If most of your content is binary, such as applications
# or images, you may want to use "application/octet-stream" instead to
# keep browsers from trying to display binary files as though they are
# text.
#
DefaultType text/plain

#
# TypesConfig points to the file containing the list of mappings from
# filename extension to MIME-type.
#
TypesConfig conf/mime.types

#
# AddType allows you to add to or override the MIME configuration
# file specified in TypesConfig for specific file types.
#
#AddType application/x-gzip .tgz
#
# AddEncoding allows you to have certain browsers uncompress
# information on the fly. Note: Not all browsers support this.
#
#AddEncoding x-compress .Z
#AddEncoding x-gzip .gz .tgz
#
# If the AddEncoding directives above are commented-out, then you
# probably should define those extensions to indicate media types:
#
AddType application/x-compress .Z
AddType application/x-gzip .gz .tgz

#
# AddHandler allows you to map certain file extensions to "handlers":
# actions unrelated to filetype. These can be either built into the server
# or added with the Action directive (see below)
#
# To use CGI scripts outside of ScriptAliased directories:
# (You will also need to add "ExecCGI" to the "Options" directive.)
#
AddHandler cgi-script .cgi

# For type maps (negotiated resources):
#AddHandler type-map var

#
# Filters allow you to process content before it is sent to the client.
#
# To parse .shtml files for server-side includes (SSI):
# (You will also need to add "Includes" to the "Options" directive.)
#
#AddType text/html .shtml
#AddOutputFilter INCLUDES .shtml

#
# The mod_mime_magic module allows the server to use various hints from the
# contents of the file itself to determine its type. The MIMEMagicFile
# directive tells the module where the hint definitions are located.
#
#MIMEMagicFile conf/magic

#
# Customizable error responses come in three flavors:
# 1) plain text 2) local redirects 3) external redirects
#
# Some examples:
#ErrorDocument 500 "The server made a boo boo."
#ErrorDocument 404 /missing.html
#ErrorDocument 404 "/cgi-bin/missing_handler.pl"
#ErrorDocument 402 http://www.xxxxxxxxxxxxxxxxxx.com/subscription_info.html
#

#
# EnableMMAP and EnableSendfile: On systems that support it,
# memory-mapping or the sendfile syscall is used to deliver
# files. This usually improves server performance, but must
# be turned off when serving from networked-mounted
# filesystems or if support for these functions is otherwise
# broken on your system.
#
EnableMMAP off
EnableSendfile off

# Supplemental configuration
#
# The configuration files in the conf/extra/ directory can be
# included to add extra features or to modify the default configuration of
# the server, or you may simply copy their contents here and change as
# necessary.

# Server-pool management (MPM specific)
Include conf/extra/httpd-mpm.conf

# Multi-language error messages
#Include conf/extra/httpd-multilang-errordoc.conf

# Fancy directory listings
#Include conf/extra/httpd-autoindex.conf

# Language settings
#Include conf/extra/httpd-languages.conf

# User home directories
#Include conf/extra/httpd-userdir.conf

# Real-time info on requests and configuration
#Include conf/extra/httpd-info.conf

Options ExecCGI

#DeflateCompressionLevel 4

# Virtual hosts
#*******************************************************xxxxxxxxxxxxxxxxxx V HOST****************************************************
NameVirtualHost 192.168.1.16

ServerName xxxxxxxxxxxxxxxxxx.com
#ServerAlias xxxxxxxxxxxxxxxxxx.com http://xxxxxxxxxxxxxxxxxx.com
DocumentRoot "c:/root/MAINNNNNNNNNNNNNNNNNN"

RewriteEngine on
RewriteCond %{HTTP_HOST} ^xxxxxxxxxxxxxxxxxx\.com
RewriteRule ^(.*)$ http://www.xxxxxxxxxxxxxxxxxx.com$1 [R=permanent,L]

#CacheDirLength 4
#CacheDirLevels 5
#CacheMaxFileSize 10000000
#CacheMinFileSize 1
#CacheRoot c:/Root\Cache

#SetOutputFilter DEFLATE
#SetEnvIfNoCase Request_URI \.(?:gif|jpe?g|png|jpg)$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.(?:exe|t?gz|zip|bz2|sit|rar)$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.pdf$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.avi$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.mov$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.mp3$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.mp4$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.rm$ no-gzip dont-vary

#speed server by turning off ETags
#Header unset ETag
#FileETag None

#speed Server by compressing .js files
#
#
#SetOutputFilter DEFLATE
#
#

#speed server by turning off headers
#
#Header unset Last-Modified
#

#speed Server by adding future expiration time
#
#Header set Cache-Control "public"
#Header set Expires "Thu, 15 Apr 2010 20:00:00 GMT"
#

#Handler is for Mdaemon Access
#AddHandler type-map isapi-isa.dll
#AddHandler type-map Worldclient.dll

Options ExecCGI
#Options FollowSymLinks
#AllowOverrdie None
Order Allow,Deny
Allow from All

# This secures the Proxy from becoming used as a third party Proxy server
ProxyRequests off
ProxyPreserveHost On
#SSLProxyEngine on

RedirectMatch /download /download/

ProxyPass /download http://192.168.1.9:8000
ProxyPass /Common/Scripts http://192.168.1.9:8000/Common/Scripts
ProxyPass /Common/Style/ http://192.168.1.9:8000/Common/Style/
ProxyPass /Common/Images http://192.168.1.9:8000/Common/Images

#AddType text/javascript .js
#AddType text/css .css

Order allow,deny
Allow from all
Options FollowSymlinks
ProxyHTMLExtended Off
#AddType text/javascript .js
#AddType text/css .css
SetOutputFilter DEFLATE
SetEnvIfNoCase Request_URI \.(?:gif|jpe?g|png|jpg)$ no-gzip dont-vary
SetEnvIfNoCase Request_URI \.(?:js|css)$ no-gzip dont-vary
SetOutputFilter proxy-html
ProxyPass http://192.168.1.9:8000
ProxyPassReverse /

RedirectMatch /Common/Style/ /download/Common/Style/
ProxyHTMLURLMap / /download/
#ProxyHTMLURLMap /download /download/
ProxyHTMLURLMap /download/Common/Style /Common/Style
ProxyHTMLURLMap /download/Common/Scripts /Common/Scripts
RequestHeader unset Accept-Encoding

#CacheDirLength 4
#CacheDirLevels 5
#CacheMaxFileSize 10000000
#CacheMinFileSize 1
#CacheRoot c:/Root\Cache

SetOutputFilter DEFLATE
SetEnvIfNoCase Request_URI \.(?:gif|jpe?g|png|jpg)$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.(?:exe|t?gz|zip|bz2|sit|rar)$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.pdf$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.avi$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.mov$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.mp3$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.mp4$ no-gzip dont-vary
#SetEnvIfNoCase Request_URI \.rm$ no-gzip dont-vary

#speed server by turning off ETags
#Header unset ETag
#FileETag None

#speed Server by compressing .js files
#
#
#SetOutputFilter DEFLATE
#
#

#speed server by turning off headers
#
#Header unset Last-Modified
#

#speed Server by adding future expiration time
#
#Header set Cache-Control "public"
#Header set Expires "Thu, 15 Apr 2010 20:00:00 GMT"
#

Options ExecCGI
#AllowOverrdie None
#Order Allow, Deny
#Allow from All

#*******************************************************yyyyyyyyyyyyyyyyyyyyy V HOST*****************************************************

ServerName www.yyyyyyyyyyyyyyyyyyyyy.com
ServerAlias yyyyyyyyyyyyyyyyyyyyy.com http://yyyyyyyyyyyyyyyyyyyyy.com
DocumentRoot "c:/root/SECONDDDDDDDDDDDDDDDDDDD"

#Disable Directory Browsing "-Indexes" "+Indexes" for allowing

Options -Indexes
#AllowOverrdie None
#Order Allow, Deny
#Allow from All

RewriteEngine on
RewriteCond %{HTTP_HOST} ^yyyyyyyyyyyyyyyyyyyyy\.com
RewriteRule ^(.*)$ http://www.yyyyyyyyyyyyyyyyyyyyy.com$1 [R=permanent,L]

ProxyRequests off
ProxyPreserveHost On
#SSLProxyEngine on

#*******************************************************TEST V HOST*****************************************************

ServerName www.xxxxxxxxxxxxxxxxxx.com
ServerAlias xxxxxxxxxxxxxxxxxx.com http://xxxxxxxxxxxxxxxxxx.com
DocumentRoot "c:/root/Test"

Include conf/extra/httpd-vhosts.conf

# Local access to the Apache HTTP Server Manual
Include conf/extra/httpd-manual.conf

# Distributed authoring and versioning (WebDAV)
#Include conf/extra/httpd-dav.conf

# Various default settings
#Include conf/extra/httpd-default.conf

# Secure (SSL/TLS) connections
#Include conf/extra/httpd-ssl.conf

# mod_proxy_html
Include conf/extra/proxy_html.conf
#
# Note: The following must must be present to support
# starting without SSL on platforms with no /dev/random equivalent
# but a statically compiled-in mod_ssl.
#

SSLRandomSeed startup builtin
SSLRandomSeed connect builtin

Thankyou thankyou thankyou.

Thankyou thankyou thankyou. Been banging my head against a wall for 3 days with this one !

Documentation for

Documentation for mod_proxy_html is dire. I did attempt to document the installation procedure here: http://outsidethe.net/2009/11/17/building-and-installing-mod_proxy_html-and-mod_xml2enc

I agree that this issue needs a great deal more visibility. I've literally spent a day staring at "No links configured: nothing for proxy-html filter to do" messages in my logs and cursing the world.

I'll add a link back to this article from mine to try and help the next poor sod.

Sam.

Your page seems to be the

Your page seems to be the only one, which is up to date.

Thank you.
Uwe

Hi, Thanks for useful post.

Hi,

Thanks for useful post. I find that once I start using ProxyHTMLLinks, my HTML document type changes from :

TO :

Even though I have tried specifying "ProxyHTMLDocType XHTML [Legacy]"

The moment I remove the ProxyHTMLLinks, the DocType is back to Transitional.

Any help would be much appreaciated! Thanks

The tags were removed from my

The tags were removed from my previous post:

From : Transitional doctype
To : Strict doctype

Hi, Does anyone have a

Hi,

Does anyone have a solution to this issue with reverse proxy:

1- www. mydomain.com/app is set to pull content from www. otherdomain.com
2- but www. otherdomain.com redirects to www. otherdomain.com/other
3- and because of the redirection, the address bar changes to www. otherdomain.com/other - instead of staying www. mydomain.com/app

How do I handle redirection for a reverse proxy? I'd like redirections to be processed under the original request path.

Thanks.

Thanks a lot for this post. I

Thanks a lot for this post. I had missed the ProxyHTMLLinks requirements too...

This was useful for setting a reverse proxy to a PIWIK system.

And now when I searched for

And now when I searched for this problem, your post came up at the top of Google :-) Way to go and thanks for the solution.

I'm having the same problem,

I'm having the same problem, but I can't even get logging to work. Without logging information, I can't even start to work out why it's not working.

GREAT post, I lost 2 days

GREAT post, I lost 2 days fighting this module, until I found your page and in 20 minutes I was done!
Thanks!

hi ckdake, thanks for this,

hi ckdake, thanks for this, worked just like charm :)

Thank you so much for this.

Thank you so much for this.

I am working on reverse proxy

I am working on reverse proxy for the backend WAS portal. I am having hardtime translating the content based texts in the page. some tags liks

/test/myportal/! into /myportal/test/myportal/!

is there anyway to do achive this? i am doing this in my conf file..

ProxyHTMLURLMap (^(.*wcm_url")\/)test $1myportal/upsers [R,s,x]

kavitha

Helped me ALOT, thanks

Helped me ALOT, thanks

Thank you ckdake. I saw all

Thank you ckdake. I saw all that ProxyHTMLLinks in the mod_proxy config example, but it just didn't click. Make for a nice Friday afternoon.

Many thanks for article.

Many thanks for article.