Skip to main content

Adding support for path_info to Tomcat

By default Apache Tomcat does not come configured to handle the path_info server variable information commonly found in SES (search engine safe) compatible URLs.  An SES URL might look like:  www.domain.tld/index.cfm/user/123.  Combining Tomcat as the servlet container with other technologies like Railo and Mura, this missing feature can become problematic and frustrating (redundancy provided for emphasis).

A scenario

The path_info data is traditionally available to CFML applications a la the CGI.path_info variable.  Applications often refer to this variable to resolve URLs to specific actions or details of the application.  One such example is Mura.  The CMS (content management system) has the ability to host several sites from a single instance of the application.  While most sites are likely set up to be accessed from their own domain names, in some cases, the sites will be accessed through an identifier in the URL.  For example, assume the case of a news organization hosting a separate site for each news category.  Perhaps like the following:
  • news.domain.tld/business
  • news.domain.tld/news
  • news.domain.tld/sports
  • news.domain.tld/world
Using Mura as the content manager for these sites, an article in the sports section might have a URL that looks like:  http://news.domain.tld/sports/index.cfm/new-stadium-announced.  By default, Railo would throw an Error 404 because it can't match that URL to a file in the application.  What Mura wants to do is:
  1. Run the CFML file index.cfm
  2. And read CGI.path_info which is /new-stadium-announced
The remainder of this article is meant to be a three-step reference to setting-up Tomcat in conjunction with Apache httpd in such a way that it won't drop the path_info aspect of a URL.

The fix

STEP 1:  Enable the following modules in Apache

File:  APACHE/conf/http.conf

Uncomment the following lines to enable the necessary modules.
  • LoadModule proxy_module modules/mod_proxy.so
  • LoadModule proxy_ajp_module modules/mod_proxy_ajp.so
  • LoadModule rewrite_module modules/mod_rewrite.so

STEP 2:  Add the proxy and rewrite lines

File:  APACHE/conf/extra/http-vhosts.conf
 

NOTE:  The lines of interest are those below the comment.

<VirtualHost *:80>
 ServerAdmin administrator@domain.tld
 ServerName server.domain.tld
 DocumentRoot "/path/to/server.domain.tld"
 ErrorLog "/path/to/logs/server.domain.tld-error.log"
 CustomLog "/path/to/logs/server.domain.tld-access.log" common
 JkMount /*.cfm mainWorker
 
 # The lines from here to the bottom of the block need to be in the directive.
 ProxyPreserveHost On
 ProxyPassReverse / ajp://server.domain.tld:8009/
 
 RewriteEngine On
 
 RewriteRule ^(.+\.cf[cm])(/.*)?$ ajp://%{HTTP_HOST}:8009$1$2 [P]
</VirtualHost>


STEP 3:  Add a servlet mapping

File:  /path/to/server.domain.tld/CONTEXT/WEB-INF/web.xml

NOTE:  Fight the urge to use wildcards (*) to keep the servlet mappings to a single entry.  The way Tomcat handles wildcards means that a new mapping will be necessary for URLs that include a sub-directory or use a script file other than index.cfm.  A couple of examples have been included for this purpose.

<!-- Servlet handlers to support PATH_INFO (one per directory and CFML file) -->
<servlet-mapping>
 <servlet-name>CFMLServlet</servlet-name>
 <url-pattern>/index.cfm/*</url-pattern>
</servlet-mapping>


<!-- Example with an alternate file in the URL (/application.cfm/some/info). -->
<servlet-mapping>
 <servlet-name>CFMLServlet</servlet-name>
 <url-pattern>/application.cfm/*</url-pattern>
</servlet-mapping>


<!-- Example with a sub-folder in the URL (/application/index.cfm/some/info). -->
<servlet-mapping>
 <servlet-name>CFMLServlet</servlet-name>
 <url-pattern>/application/index.cfm/*</url-pattern>
</servlet-mapping>


Credit goes to Jamie Krug for the work he did on his blog article describing his set-up that use these steps as well.

URL Rewrite Goodies for Apache, Tomcat, Railo and Mura CMS
http://jamiekrug.com/blog/index.cfm/2009/5/22/url-rewrite-goodies-for-apache-tomcat-railo-and-mura-cms

Comments

Popular posts from this blog

Remove control of Chrome being managed by organization on personal devices

Chrome may indicate that it's being managed by a user's organization. This warning is provided by the Chrome Policies feature of the browser. To know if an instance of Chrome is managed by an organization, there will be an entry at the bottom of the browser’s hamburger menu (three dot menu), on the right side of the browser window that reads, "Managed by your organization." This is likely due to an entry in the Chrome Policies listing, which can be found by loading this page in the browser: chrome://policy The policies listed in this section are stored in the computer's file system in one of the following locations as JSON files. /usr/share/chromium/policies/managed /usr/share/chromium/policies/recommended Remove the offending JSON files, and click the Reload policies button. The Managed by your organization entry in the browser menu should be gone. A notice like this on instances of Chrome for work, school, library, or other devices that belong to an organizati...

Allow Windows authentication using SQL Server driver with DBeaver

DBeaver will allow Microsoft Windows single sign on access when connecting to Microsoft SQL Server using the SQL Server driver (rather than jTDS ).  From the driver properties settings, set the integratedSecurity flag to true . Open the Connection configuration panel and choose the Driver properties section. Set the integratedSecurity flag to true . A subtle, but important step is to not provide username and password credentials to the connection.

Edit CUPS Configuration File To Re-assign Network Address

The printers.conf file can be edited to change the network address a printer uses.  This can be useful to fix situations in which the printer in question has a new IP address, but the local system is trying to use the previous address. Shutdown the CUPS server Change the network address Restart the CUPS server sudo systemctl stop cups sudo nano /etc/cups/printers.conf sudo systemctl start cups NOTE The editor used in the example is nano for the sake of those who may be less comfortable in the command-line.  With nano, once the change has been made, use Control + O to save the changes, and then Control + X to quit the editor. Ideally, this process would not be necessary.  Instead, once a printer is added, it will always be reachable at the address it was assigned when it was added to the system.  In practice, things like power outages, or breaks in network connectivity, may be enough for the DHCP server to issue a new IP address. A tip when making the address cha...