URL-pathname [class]
:super pathname
:slots server port protocol
-
- extends pathname to allow URL notation.
url-pathname name [function]
-
-
instantiates url-pathname class object from url string or url-pathname class object.
escape-url url &optional (ss *standard-output*) (queryp t) [function]
-
-
writes percent-escaped url to stream ss (default: *standard-output*).
If queryp is T, then Space in url is encoded to +, otherwise escaped as Space.
This option is convenient for sending url query to server with separation.
escaped-url-string-from-namestring url-string &optional (queryp t) [function]
-
-
returns result of escape-url as string.
unescape-url url &optional (ss *standard-output*) (queryp t) [function]
-
-
unescapes percent-escaped url and writes unescaped url to stream ss.
unescaped-url-string-from-namestring url-string &optional (queryp t) [function]
-
-
returns result of unescape-url as string.
read-http url &key (timeout 10) (retry 5) [function]
-
- makes a socket connection to the designated url, and
read the html document.
The result is a list of tags and plain strings.
HTML tags are converted as lists consisting of the tag-name
and argument lists.
For example, the following html document,
results in the following list.
Note that tags are represented as lists, in which
the directive is represented as a symbol
followed by symbols or strings.
Whether an argument is represented as symbol or string
reflects how the original argument is described.
<body bgcolor=#ffa080>
<h1> EusLisp Title</h1>
<li> item1 </li>
<a href="http://www.etl.go.jp/~matsui/eus/euslisp.html"> euslisp</a>
</body>
("HTTP/1.1 200 OK"
"Date: Sun, 21 May 2000 11:47:00 GMT"
"Server: Apache/1.3.9 (Unix)"
"Last-Modified: Sun, 21 May 2000 11:19:35 GMT"
"ETag: \"4f014-c7-3927c647\""
"Accept-Ranges: bytes"
"Content-Length: 199"
"Content-Type: text/html"
(head) (title) " Toshihiro Matsui on t570" (/title) (/head)
(body bgcolor |#FFA080|)
(h1) " Title Line" (/h1)
(li) " item1 " (/li)
(a href "http://www.etl.go.jp/~matsui/eus/euslisp.html")
" euslisp"
(/a)
(/body))
extract-html tag html-list [function]
-
-
returns a list of strings (and tags) sandwitched by tag and /tag.
remove-html-tags html-list [function]
-
-
removes tags from the html-list leaving only texts (strings).
2015-08-17