I. Executive Summary
In the past 2 years, the the Internet has become a major focus
for the the community. Although the Internet has been around
for over 15 years, it was primarily a research network used by
engineers and scientists at leading universities, government
agencies, and military branches.
Since 1990, when the Bush administration mandated that "The Internet"
be a fully self-supporting infrastructure, it has been a rapidly growing
network which has grown exponentially from less than 1 million users
in 1990 to nearly 64 million users by 1995. Secondary markets such as
interactive access known as "Shell Accounts" and access through the
World Wide Web (approximately 1 million independently operated
servers) have experienced similar exponential growth curves. The most
recent wave of growth is in the "Commercial Internet User". These are
people who pay for services beyond basic connectivity. There are
currently about 2 million "Commerce Users".
This paper will chart out a strategy for competing effectively in this
rapidly growing market in such a way as to be able to take advantage of
this and subsequent waves of markets.
II. Why is the Internet important to S&P
More than anything else, the Internet represents Communication,
Powerful Relationships, and Opportunity. This is true for
users, information providers, access providers, and carriers.
The internet is a carefully balanced set of agreements or
de-facto "standards" established and maintained by a series of
public specifications known as Requests For Comment (RFCs), and
Freely Distributable Reference Source Code (FDRSC). Although
most of this code was written for variants of the UNIX operating
system (once FDRSC itself and avialable in FDRSC variants),
several commercial vendors have ported this code to Windows and
OS/2. Currently this "unsupported" software is supported by
over 200,000 engineers and MIS professionals. It is very
popular with Value Added Resellers, Consulting Services, and
Large corporations because they don't have to "build from
scratch", yet they can diagnose and correct problems more
quickly than "vendors". Because these corrections are
contributed back to the internet, support is generally quite
good
Currently, there are an estimated 60-80 million E-Mail users and
an estimated 10 million users of the World Wide Web (the Web).
There are over 4.5 million registered "Web Pages", each
averaging about 25,000 hits (uses) per week. Quality Commercial
content can go as high as 25,000 hits per day. The average web
server serves 10-20 pages/second.
The Web provides an easy to use interface that requires a minimal
amount of expertize to use. Most commercial Web Browsers are very easy
to set up. Dial-up users need to provide a phone number and 2
"addresses". Each address is 4 numbers between 1 and 256, for example
192.131.34.7 this information is provided by the access provider.
As an advertizing media, the Web provides content providers the
opportunity to provide advertizing to highly qualified
customers.
A. What if we ignore the Internet
The internet is growing at a rate of nearly 20 percent/month.
There are a number of metrics being used to estimate the
approximate number of users using the different types of access,
but nearly all indicators are that people are connecting at an
an exponential growth rate that exceeds 10%/month. In some
segments (Web users) the growth rate exceeds 20%/month.
There is already a significant presence on the internet. The
economics in terms of band-width efficiency and low switching
and link costs (largely due to FDRSC) has caused the internet to
displace traditional on-line "self contained services". Lexus
and Nexus are now on the Internet, as is Dow Jones. There are
several government databases (Edgar) which are provided in a
"raw" format in delayed databases. The on-line services are
also now providing access to the web and other internet services
and finding themselves able to leverage outside data.
Since MCI first started carrying the internet traffic in 1991,
the combination of large amounts of research data, interactive
communication (e-mail, bulletin boards called "news groups", and
real-time Internet Relay Chat (computer conference calling) have
lured users into first supplementing their on-line service
(America OnLine, Prodigy, or Compuserve) and eventually subscribing
to a more direct full service direct access provider.
On of the more significant factors is the integration of
corporate enterprise networks and the Internet. Historically,
the internet was primarily corporate lines which were "donated"
in exchange for access to research and development information.
Today, fire-walls and routers provide a means of providing
secure access using protected common carriers. The invisible
"commercial internet" is also growing at exponential rates.
The internet is to telecommunications what the automobile and highway
system was to transportation. While we might wish for the relative
security of the Railroads, or attempt get security through Mass Transit
(Busses and Taxicabs), the consumer is willing to take a calculated risk
in exchange for the ability to drive from his home to any destination
he chooses and return, on his schedule. The internet provides the same
level of risk and convenience. As a result, the internet consumer can
choose from an extremely competitive range of products which are
more convenient than point-to-point dial-up services (Compuserve, AMERICA ONLINE,
or Prodigy) or dedicated link services (Rueters, Quotron, A.D.P.).
Trying to ignore internet is like trying to ignore the train
that is coming down the tracks. It is better to be standing on
the platform than between those tracks. Companies who have resisted the
Internet have lost a significant market share.
B. How it fits with an overall online access strategy
The Internet is rapidly becoming commercialized and is now
accessible by nearly anyone in the world. There are several
different protocols and services which facilitate access.
Bulletins and clip files can be sent via e-mail, real-time feeds
can be sent using IRC. Finally, we can provide high-quality
services with uniform interfaces whether provided at a national
site, or in the local corporations. For example, customers with
high demand could receive a Comstock feed and have the same Web
interface on their local system.
Providing Internet access would provide access to all Prodigy,
Compuserve, America Online, Delphi, and Direct Access internet customers.
Proprietary strategies could give us only a small segment of the
market at best.
By adopting the open standards of the Internet, we can also provide
high-quality low-cost "enterprise servers" which can combine Comstock,
internet (for retransmission of lost sattellite records) and Local Area Network access
for high-performance "Web Servers" within local corporate fire-walls.
C. What are our real expectations
There are several sources of revenue available. Web pages are
now earning as much as $7000/line when they are relatively busy.
The important factor on the internet, especially WEB advertizing
is that the reference can be placed such that very highly
qualified customers are actually intentionally requesting the
advertizing with the intent of making a purchase decision
within moments of starting to browse.
The information consumer market (people willing to pay for content as
well as access) is also growing exponentially. It is not unusual to
have an average of 200-600 simultaneous users on a popular site. For
several types of information (news, financial data, classified
searchable advertizing...) they are willing to pay s much as $1.50/page
(Meade), or as much as $100/month (Dow Jones). Generally there is a
wider acceptance of "bulk purchasing" - 500 pages/month for $5.00/month
(the equivalent of a copy of Byte magazine). Again, this figure was
for a local Newspaper offering it's news and some secondary databases.
Even at a mere penny per page, it is likely that we would be capable of
earning $60,000/day at only 60 pages/second. A 5 segment stock report might
actually be 40-50 different pages.
III. Significant Issues
If we wish to be on the internet, there are some issues we need
to manage. In order to reach the widest range of customers, we need to
manage compatibility. In order to protect the value of our content, we
need to manage security. We also need to provide convenient mechanisms
to receive payments for our content. We also need to provide ways to
get the customer hooked on our content and have them return regularly
by providing "free samples" while not giving away everything.
A. Compatibility.
We must first be aware of the Open Standards. This is
the key to wide-spread user acceptance. Unlike vendor controlled
environments where "revolutionary" releases are shipped once or twice per
year, the Internet is a more "evolutionary" with innovations being
available as much as a few years prior to general acceptance by the
entire user community. It is important to be aware of these
innovations, even to provide demonstrations, but it is risky to
implement features that exclude clients. The customer who is
greeted with a message like "The server refuses to accept your
connection" will become a regular customer of the competition.
B. Security and Copyright
There are two sides to the security issue. One is protecting
our data from unauthorized access and modification, the other is
overcoming the misinformation that has been proliferated lately.
The WEB uses a communications protocol called the HyperText
Transfer Protocol (HTTP) which provides much of the session
layer management. The Web also uses an application protocol or
"language" called the HyperText Markup Language (HTML). The
latest browsers, including Netscape and Mosaic 2.0+ provide
security features such as the "Secure Sockets Protocol" (SSP)
and the "Secure Hypertext Transfer Protocol" (SHTTP). The SSP
is patented and has must be licensed for commercial use. The
SHTTP is available under FDRSC which assures a wider acceptance
and reasonable licensing costs.
The secondary issue is that of copyright. The individual user
who posts a single story once/month probably does more to
advertize our product and service (they need to be guided
through an explanation of attribution). The big concern is the
user who routinely clips documents and starts passing them
around in news-groups. Generally, those people should site the
URL instead of quoting the full article. A 2 paragraph summary,
or the first 2 paragraphs should be reasonable.
C. Commercial -- how do we get paid
Until recently, commerce on the net was a bit challenging. The
risk of having your credit card and related information
intercepted by someone who would use your card to buy their
computer was too high. Recently, relaxation of restrictions on
RSA and DES encryption have reduced the risk of casual
interception (something like looking over your shoulder
electronically). Several companies including First Virtual
Holdings, Digicash, NetCash, and First Boston offer "push-button
checks". The set-up fees range from $10 (First Virtual), to
$2500 (Visa), and are likely to go up rapidly as leaders emerge.
They collect commissions ranging from 39 cents to 6 percent.
There are several charging strategies. The simplest, from a technology
standpoint would be the "charge per hit" or the "bulk rate
charge per hit". The concern here is that each user might have
to contract with 2000 providers and each provider has to deal
with 2 million customers. There are several companies which
provide a "One fee buys thousands of servers".
These systems let you delegate authorization to them and at the end of
the month, you tally up the number of accesses you submitted and
collect a percentage based on that proportion (1 to 3 cents/page).
Users who are information hungry will be very likely to use this
service. Companies that provide this include the NewsShare
corporation, the Author's registry and the Copyright Clearing house.
In addition, many ISPs are including royalty charges to one or more of
these services. Examples of this include NewsShare, Copyright Clearing
House, Folio, and the Author's registry.
D. Content strategy -- how much free
There are several strategies for managing free vs commercial. A
simple strategy is to combine the "puppy dog close" (try it out
a few weeks, if you don't like it, you pay nothing), and the
health club plan (get them on for a year plan so that they feel
the must come back). In addition, we may want to work out bulk
rates for access providers (1,000,000 pages/month at 2
cents/page). The unused or excess pages/month will be factored
into the next month's rate.
We may wish to become a payment recipient who supports several
copyright clearing houses. In effect, we would collect the payment
and the clearinghouse would credit our "pool" and allow the users we
put into the pool access to ours and other publishers information.
IV. Actual Recommendation
What follows are several recommendations, with alternates where
desired.
A. The Home Page
In addition to the usual copyright and "more about us", we should
also provide lines which point to search engines (Wais, Folio, Lycos...)
Another line should link to a Marketscope "Menu".
We should also include Payment options, first for payment directly to us
for private and consortium data. We should offer them "apply buttons"
for each of these cards. We should at least support First Virtual,
Digicash, and NetCash. The goal is to make it as easy as possible to
turn them into a paying customer.
All content pages should contain links back to the S&P home page.
Specifically a hierarchy like:
McGraw-Hill, Standard & Poor's, Product (Marketscope...),
Topical (Company profile information), and "previous".
The intent here is to guide customers who receive our pages as a
result of search hits back to company oriented product.
B. Product offering
- Content
- Presentation
- Pricing Model
The "low-budget" customers would be given access to a delayed (batch
update) feed on a very generic server. The "High end" customers would
receive real-time updates and rapid response times. Corporate
customers should have the option of a web-server interface to
Comstock/Marketscope data.
We can also point links to "premium" services and invite them to upgrade
their levels after a certain number of "Samples". After they've seen
10 consensus reports within a week, they know what it is and can
determine whether they consider it worth the extra payment.
In either case, advertizing is a definite possibility. The "random
spot" is generally disliked by internet users, on the other hand,
relevant information, such as a link to company home-pages or targeted
catalogues can be very worth-while. Often, corporate users search
content as buyers looking for suppliers. By the time they press
the "See advertizing" page, they are highly qualified and very
interested. Some of the most successful internet commerce servers are
little more than "Classified Advertizing".
We will probably want to create "Company Profile pages" which function
as a "table of contents" for a group of products, including prospectus
and advertizing "buttons".
- In house vs. outside development (system operator)
We want to leverage as much of the available technology as possible. It
is important to have source code, to know that that source code is
available to others, and to participate in the supporting news groups.
This will reduce the development effort and at the same time assure a
higher probability of rapid resolution of problems.
The internet is a global network. We need to be aware that just about
the time New York Closes, Hong Kong is opening. When England opens,
it 3:00 AM. It can be very useful to be able to get a response from
support group members in England, New Zealand, Australia, and
California. As good internet citizens, we should monitor these groups
and contribute where answers are available "off the cuff". Often, the
bug we fixed yesterday will be discovered by 3 others tomorrow.
Although it is possible to obtain fully bundled turnkey systems, they
are rarely as turnkey as they would appear. In addition the license
rates can run as high as $1 million dollars for software that runs on
hardware that costs less than $50,000. Using the modular software
approach including the Unix Platform, and TCP/IP
C. Other uses
Most of the focus has been on "The Web". The internet also provides
e-mail, news-groups (public forums, many technical and business
oriented), and archives (ability to search nearly 1 billion (mostly
text) documents in a matter of seconds) using "front-ends" which
interface to web browsers. We want to be able to serve these needs as
well.
The internet is a natural form of customer feedback, discussion groups,
and free-lance support. As the commerce market increases, there are
more business oriented forums as well.
We should use the "Standard" products. There are about 20 mail
interfaces to Windows, all but the "commercial" packages support
internet standards transparently.
There is also the possibility of providing encrypted feeds through
the internet, or using the internet to provide "retransmissions" for
Comstock Sattellite feeds. Both can provide access to the small "branch
banks" and "Certified Financial Consultant" who can benifit from the
combination of Feed and Internet products.
D. Development timetable
The actual development should be rather rapid. Availabilty of code and
support for Unix on platforms ranging from 386/SX PCs to 1000 Processor
engines can provide the ability to grow at the 10-20% rates. The
modular structure allows us to "tune" the network based on shifting
demands.
Prototype servers, and related engines can be configured in a few weeks.
I was able to load and start a web server for Ultrix in about 2 days.
A development infrastructure using BSDI UNIX or Linux can be set up
in about 2 weeks. The logistics of coordinating with network
administrators and managing the flow of traffic over the "fire-wall"
may add some additional delay.
We will also need to negotiate licenses with some companies. The "list
price" on a netscape server using a Sun/Solaris engine can run as
high as $300,000 per year. The hardware runs about $200,000. This
will serve about 4 million hits/day (3000 pages/minute).
The smaller, loosely coupled servers provide a very high degree of fault
tolerance as well. This technology has been used on Directory
Assistance Systems for about 25 years with down-times averaging 15
minutes/year.
A BSDI or Linux PC, configured with an NCSA server (Mosaic), runs about
$3000/year and serves about 200 pages/minute. Methods of delegating
work to other low-cost servers can increase this to over 800
pages/minute. This is sufficient to service a pair of T1 links.
The BSDI and Linux hosts are also used as the building blocks for
Routers, Fire-walls, Database servers, and File servers using freely
redistributable source code provided on CD-Roms with quarterly or
monthly updates.
We will also need to negotiate licenses for payment schemes and possibly
encryption.
Since much of our content is either fielded data or SGML tagged data, it
will be fairly easy to adapt it for use on the web. The Edgar content
can be converted to HTML using unix utility languages that can cut
development time by 80% compared to regular C development. Tom Gerhard
has already converted several European Marketscope Pages to HTML using
a utility package from SoftQuad.
In effect we can return a substantial NOI at 200 pages/second ($4 to
$10/second) on increments of $10,000/server. As resource intensive
functions are identified and specialty needs are singled out for
optimization, they can be migrated to performance specific platforms
such as Pyramid, VMS, or RS-6000.
The major development effort will be the development of Menus and
artwork. This can be done in house, or through free-lance designers
at $20-200/page. The interaction is similar to a game. We will need
someone who can design an interesting dialogue.
Consistent with the "Evolutionary" nature of the internet, we should
work toward getting the first "Easy" products (Marketscope, Edgar)
onto the web within 90 days. We can add the various components of
stock reports within 3 to 5 months. Many of the tools for rapid
prototyping and development are provided on the recommended platforms,
the remainder can be loaded.
The other major effort will be negotiations. There are ways to
structure systems so that "simultanious users" can be reduced to a
very small number (under 200 oracle licenses for example) while
serving several thousand "browsers". We do want to make sure that
we do not find ourselves irrevocably committed to paying royalties
based on each individual user coming in through the "Pool" or through
through subscription. This could easily be a number in the millions.
E. McGraw-Hill coordination
This cooperation needs to occur at several levels. First, there are the
development logistics. There have been problems with the Heightstown
routers, fire-walls, and address/routing management. We need to have
coordination between the different systems such that we can add new
connections to our side within a few minutes, and can add accounts to
the outside within a few seconds.
The potential value of a McGraw-Hill "Pool" raises the possibility of
providing access to all McGraw-Hill publications on a "Bulk purchase
plan". Users could purchase "Books" of a few thousand pages (remember
web pages are very small) for $30-50 and be able to access those pages
repeatedly without extra charges. Supplementing them with dynamic
content such as S&P Marketscope and Daily News can generate a steady flow of
return business and new "book" purchases.
V. Resource requirements
We should begin with a prototyping system (Linux or Unix, basic service)
and a series of phased enhancements.
Prototype phase:
A. Pentium CPU as prototype server, Linux or BSDI OS. | $5000 |
B. Configure servers, routers, fire-walls. | 5 days. |
C. Httpd web server porting and configuration. | 5 days. |
D. Billing system configuration - | 10 days. |
E. Oracle and WAIS front-ends. | 5 days. |
F. Convert Marketscope menus to Web Pages | 10 days. |
G. Convert Marketscope forms to Web Pages - | 10 days. |
H. Royalty Negotiations | 10 days. |
Dependencies A:B B:CD C:FG D:H E:G
Conversion of marketscope forms may be automatable.
Phase 1 beta:
MarketScope Alert to HTML driver. | 10 Days. |
Royalty negotiations with 3rd parties. | 5 Days. |
Enable Secure HTML. | 3 Days. |
Blocking Strategy. | 2 Days. |
Enable "Feed Rate" Customers. | 3 Days. |
Start Billing. | 2 Days. |
Total | 25 Days. |
Phase 1 production:
Promotion. | 10 Days |
Inclusion on other databases (lycos, WAIS,...). | 3 Days |
Inclusion links on strategic web pages (Compuserve, America Online...) | |
Inclusion links to strategic web pages (Edger, MIT stats...) |
Total | 20 Days |
Phase 2:
Stock Reports text to HTML. | 10 Days |
Stock Reports Graphics to JPEG and GIF. | 10 Days |
Arrange advertizing (Initial). | 5 Days+ |
Arrange other content(Initial). | 5 Days+ |
Secure Sockets Distribution. | 5 Days |
Phase 3:
Edgar SGML to HTML parser. | 10 Days |
Edgar EDI to HTML parser. | 10 Days |
Access other content. | 5 Days |
Phase 4:
Based on traffic, Migrate critical functions to alternate platforms. | 20 Days |
Based on traffic & Royalty arrangements, optimise resource usage. | 20 Days |
Performance enhancements. | 20 Days |
To: tonym@ix.netcom.com
cc:
Subject: Latest proposal From: Rex Ballard Date: Thu, 29 Jun 1995 17:32:19 -0400
How the Web Was Won
Subject: Latest proposal
From: Rex Ballard
Date: Thu, 29 Jun 1995 17:32:19 -0400
--------
I. Executive Summary
In the past 2 years, the the Internet has become a major focus
for the the community. Although the Internet has been around
for over 15 years, it was primarily a research network used by
engineers and scientists at leading universities, government
agencies, and military branches.
II. Why is the Internet important to S&P
More than anything else, the Internet represents Communication,
Powerful Relationships, and Opportunity. This is true for
users, information providers, access providers, and carriers.
The internet is a carefully balanced set of agreements or
de-facto "standards" established and maintained by a series of
public specifications known as Requests For Comment (RFCs), and
Freely Distributable Reference Source Code (FDRSC). Although
most of this code was written for variants of the UNIX operating
system (once FDRSC itself and avialable in FDRSC variants),
several commercial vendors have ported this code to Windows and
OS/2. Currently this "unsupported" software is supported by
over 200,000 engineers and MIS professionals. It is very
popular with Value Added Resellers, Consulting Services, and
Large corporations because they don't have to "build from
scratch", yet they can diagnose and correct problems more
quickly than "vendors". Because these corrections are
contributed back to the internet, support is generally quite
good
Currently, there are an estimated 60-80 million E-Mail users and
an estimated 10 million users of the World Wide Web (the Web).
There are over 4.5 million registered "Web Pages", each
averaging about 25,000 hits (uses) per week. Quality Commercial
content can go as high as 25,000 hits per day. The Web provides
an easy to use interface that requires a minimal amount of
expertize to use. Most commercial Web Browsers are very easy to
set up. Dial-up users need to provide a phone number and 2
"addresses". Each address is 4 numbers between 1 and 256, for
example 192.131.34.7 this information is provided by the access
provider.
As an advertizing media, the Web provides content providers the
opportunity to provide advertizing to highly qualified
customers.
A. What if we ignore the Internet
The internet is growing at a rate of nearly 20 percent/month.
There are a number of metrics being used to estimate the
approximate number of users using the different types of access,
but nearly all indicators are that people are connecting at an
an exponential growth rate that exceeds 10%/month. In some
segments (Web users) the gowth rate exceeds 20%/month.
There is already a significant presence on the internet. The
economics in terms of band-width efficiency and low switching
and link costs (largely due to FDRSC) has caused the internet to
displace traditional on-line "self contained services". Lexus
and Nexus are now on the Internet, as is Dow Jones. There are
several government databases (Edgar) which are provided in a
"raw" format in delayed databases. The on-line services are
also now providing access to the web and other internet services
and finding themselves able to leverage outside data.
Since MCI first started carrying the internet traffic in 1991,
the combination of large amounts of research data, interactive
communication (e-mail, bulliten boards called "news groups", and
real-time Internet Relay Chat (computer conference calling) have
lured users into first suplimenting their on-line service
(America OnLine, Prodigy, or Compuserve) and eventually subscribing
to a more direct full service direct access provider.
On of the more significant factors is the integration of
corporate enterprize networks and the Internet. Historically,
the internet was primarily corporate lines which were "donated"
in exchange for access to research and development information.
Today, fire-walls and routers provide a means of providing
secure access using protected common carriers. The invisible
"commercial internet" is also growing at exponential rates.
Trying to ignore internet is like trying to ignore the train
that is coming down the tracks. It is better to be standing on
the platform than between those tracks. AT&T resisted the
Internet for years and lost a significant market share in the
process.
B. How it fits with an overall online access strategy
The Internet is rapidly becoming commercialized and is now
accessible by nearly anyone in the world. There are several
different protocols and services which facilitate access.
Bullitens and clip files can be sent via e-mail, real-time feeds
can be sent using IRC. Finally, we can provide high-quality
services with uniform interfaces whether provided at a national
site, or in the local corporations. For example, customers with
high demand could receive a ComStock feed and have the same Web
interface on their local system.
Providing Internet access would provide access to all Prodigy,
Compuserve, AOL, Delphi, and Direct Access internet customers.
Proprietary strategies could give us only a small segment of the
market at best.
By adopting the open standards of the Internet, we can also provide
high-quality low-cost "enterprise servers" which can combine comstock,
internet (for retransmission of lost sattelite records) and LAN access
for high-performance "Web Servers" within local corporate fire-walls.
C. What are our real expectations
There are several sources of revenue available. Web pages are
now earning as much as $7000/line when they are relatively busy.
The important factor on the internet, especially WEB advertizing
is that the reference can be placed such that very highly
qualified customers are actually intentionally requesting the
advertizing with the intent of making a purchase decision
within moments of starting to browse.
The information consumer market (people willing to pay for content as
well as access) is also growing exponentially. It is not unusual to
have an average of 200-600 simultaneous users on a popular site. For
several types of information (news, financial data, classified
searchable advertizing...) they are willing to pay s much as $1.50/page
(Meade), or as much as $100/month (Dow Jones). Generally there is a
wider acceptance of "bulk purchasing" - 500 pages/month for $5.00/month
(the equivalent of a copy of Byte magazine). Again, this figure was
for a local Newspaper offering it's news and some secondary databases.
Even at a mere penny per page, it is likely that we would be capable of
earning $60,000/day at only 60 pages/second. A 5 segment stock report might
actually be 40-50 different pages.
III. Significant Issues
If we wish to be on the internet, there are some issues we need
to manage. In order to reach the widest range of customers, we need to
manage compatibility. In order to protect the value of our content, we
need to manage security. We also need to provide convenient mechanisms
to receive payments for our content. We also need to provide ways to
get the customer hooked on our content and have them return regularly
by providing "free samples" while not giving away everything.
A. Compatibility.
We must first be aware of the Open Standards. This is
the key to wide-spread user acceptance. Unlike vendor controlled
environments where "revolutionary" releases are shipped once or twice per
year, the Internet is a more "evolutionary" with innovations being
available as much as a few years prior to general acceptance by the
entire user community. It is important to be aware of these
innovations, even to provide demonstrations, but it is risky to
implement features that exclude clients. The customer who is
greeted with a message like "The server refuses to accept your
connection" will become a regular customer of the competition.
B. Security and copyright
There are two sides to the security issue. One is protecting
our data from unauthorized access and modification, the other is
overcoming the misinformation that has been proliferated lately.
The WEB uses a communications protocol called the HyperText
Transfer Protocol (HTTP) which provides much of the session
layer management. The Web also uses an application protocol or
"language" called the HyperText Markup Language (HTML). The
latest browsers, including Netscape and Mosaic 2.0+ provide
security features such as the "Secure Sockets Protocol" (SSP)
and the "Secure Hypertext Transfer Protocol" (SHTTP). The SSP
is patented and has must be licensed for commercial use. The
SHTTP is available under FDRSC which assures a wider acceptance
and reasonable licensing costs.
The secondary issue is that of copyright. The individual user
who posts a single story once/month probably does more to
advertize our product and service (they need to be guided
through an explanation of attribution). The big concern is the
user who routinely clips documents and starts passing them
around in news-groups. Generally, those people should site the
URL instead of quoting the full article. A 2 paragraph summary,
or the first 2 paragraphs should be reasonable.
C. Commercial -- how do we get paid
Until recently, commerce on the net was a bit challenging. The
risk of having your credit card and related information
intercepted by someone who would use your card to buy their
computer was too high. Recently, relaxation of resrictions on
RSA and DES encryption have reduced the risk of casual
interception (something like looking over your shoulder
electronically). Several companies including First Virtual
Holdings, Digicash, NetCash, and First Boston offer "push-button
checks". The set-up fees range from $10 (First Virtual), to
$2500 (Visa), and are likely to go up rapidly as leaders emerge.
They collect commissions ranging from 39 cents to 6 percent.
There are several charging strategies. The simplest, from a technology
standpoint would be the "charge per hit" or the "bulk rate
charge per hit". The concern here is that each user might have
to contract with 2000 providers and each provider has to deal
with 2 million customers. There are several companies which
provide a "One fee buys thousands of servers".
These systems let you delegate authorization to them and at the end of
the month, you tally up the number of accesses you submitted and
collect a percentage based on that proportion (1 to 3 cents/page).
Users who are information hungry will be very likely to use this
service. Companies that provide this include the NewsShare
corporation, the Author's registry and the CopyRight Clearing house.
In addition, many ISPs are including royalty charges to one or more of
these services. Examples of this include NewsShare, Copyright Clearing
House, Folio, and the Author's registry.
C. Content strategy -- how much free
There are several strategies for managing free vs commercial. A
simple strategy is to combine the "puppy dog close" (try it out
a few weeks, if you don't like it, you pay nothing), and the
health club plan (get them on for a year plan so that they feel
the must come back). In addition, we may want to work out bulk
rates for access providers (1,000,000 pages/month at 2
cents/page). The unused or excess pages/month will be factored
into the next month's rate.
We may wish to become a payment recipent who supports several
copyright clearing houses. In effect, we would collect the payment
and the clearinghouse would credit our "pool" and allow the users we
put into the pool access to ours and other publishers information.
IV. Actual Recommendation
What follows are several reccomendations, with alternates where
desired.
A. The Home Page
In addition to the usual copyright and "more about us", we should
also provide lines which point to search engines (Wais, Folio, Lycos...)
Another line should link to a Marketscope "Menu".
We should also include Payment options, first for payment directly to us
for private and consortium data. We should offer them "apply buttons"
for each of these cards. We should at least support First Virtual,
Digicash, and NetCash. The goal is to make it as easy as possible to
turn them into a paying customer.
All content pages should contain links back to the S&P home page.
Specifically a hierarchy like:
McGraw-Hill, Standard & Poor's, Product (Marketscope...),
Topical (Company profile information), and "previous".
The intent here is to guide customers who recieve our pages as a
result of search hits back to company oriented product.
B. Product offering
- Content
- Presentation
- Pricing Model
The "low-budget" customers would be given access to a delayed (batch
update) feed on a very generic server. The "High end" customers would
receive real-time updates and rapid response times. Corporate
customers should have the option of a web-server interface to
Comstock/Marketscope data.
We can also point links to "premium" services and invite them to upgrade
their levels after a certain number of "Samples". After they've seen
10 consensus reports within a week, they know what it is and can
determine whether they consider it worth the extra payment.
In either case, advertizing is a definite possibility. The "random
spot" is generally disliked by internet users, on the other hand,
relevant information, such as a link to company home-pages or targeted
catalogues can be very worth-while. Often, corporate users search
content as buyers looking for suppliers. By the time they press
the "See advertizing" page, they are highly qualified and very
interested. Some of the most successful internet commerce servers are
little more than "Classified Advertizing".
We will probably want to create "Company Profile pages" which function
as a "table of contents" for a group of products, including prospectus
and advertizing "buttons".
- In house vs. outside development (system operator)
We want to leverage as much of the available technology as possible. It
is important to have source code, to know that that source code is
available to others, and to participate in the supporting news groups.
This will reduce the development effort and at the same time assure a
higher probability of rapid resolution of problems.
The internet is a global network. We need to be aware that just about
the time New York Closes, Hong Kong is opening. When England opens,
it 3:00 AM. It can be very useful to be able to get a response from
support group members in England, New Zealand, Australia, and
California. As good internet citizens, we should monitor these groups
and contribute where answers are available "off the cuff". Often, the
bug we fixed yesterday will be discovered by 3 others tomorrow.
C. Other uses
Most of the focus has been on "The Web". The internet also provides
e-mail, news-groups (public forums, many technical and business
oriented), and archives (ability to search nearly 1 billion (mostly
text) documents in a matter of seconds) using "front-ends" which
interface to web browsers. We want to be able to serve these needs as
well.
The internet is a natural form of customer feedback, discussion groups,
and free-lance support. As the commerce market increases, there are
more business oriented forums as well.
We should use the "Standard" products. There are about 20 mail
interfaces to Windows, all but the "commercial" packages support
internet standards transparently.
There is also the possibility of providing encrypted feeds through
the internet, or using the internet to provide "retransmissions" for
Comstock Sattellite feeds. Both can provide access to the small "branch
banks" and "Certified Financial Consultant" who can benifit from the
combination of Feed and Internet products.
D. Development timetable
The actual development should be rather rapid. Availablilty of code and
support for Unix on platforms ranging from 386/SX PCs to 1000 Processor
engines can provide the ability to grow at the 10-20% rates. The
modular structure allows us to "tune" the network based on shifting
demands.
Prototype servers, and related engines can be configured in a few weeks.
I was able to load and start a web server for Ultrix in about 2 days.
A development infrastructure using BSDI UNIX or Linux can be set up
in about 2 weeks. The logistics of coordinating with network
administrators and managing the flow of traffic over the "fire-wall"
may add some additional delay.
We will also need to negotiate licenses with some companies. The "list
price" on a netscape server using a Sun/Solaris engine can run as
high as $300,000 per year. The hardware runs about $100,000. This
will serve about 1 million hits/day (700 pages/minute).
The smaller, loosely coupled servers provide a very high degree of fault
tolerance as well. This technology has been used on Directory
Assistance Systems for about 25 years with down-times averaging 15
minutes/year.
A BSDI or Linux PC, configured with an NCSA server (Mosaic), runs about
$3000/year and serves about 200 pages/minute. Methods of delegating
work to other low-cost servers can increase this to over 800
pages/minute. This is sufficient to service a pair of T1 links.
The BSDI and Linux hosts are also used as the building blocks for
Routers, Fire-walls, Database servers, and File servers using freely
redistributable source code provided on CD-Roms with quarterly or
monthly updates.
We will also need to negotiate licenses for payment schemes and possibly
encryption.
Since much of are content is either fielded data or SGML tagged data, it
will be fairly easy to adapt it for use on the web. The Edgar content
can be converted to HTML using unix utility languages that can cut
development time by 80% compared to regular C development.
In effect we can return a substantial NOI at 200 pages/second ($4 to
$10/second) on increments of $10,000/server. As resource intensive
functions are identified and specialty needs are singled out for
optimization, they can be migrated to performance specific platforms
such as Pyramid, VMS, or RS-6000.
The major development effort will be the development of Menus and
artwork. This can be done in house, or through free-lance designers
at $20-200/page. The interaction is similar to a game. We will need
someone who can design an interesting dialogue.
Consistent with the "Evolutionary" nature of the internet, we should
work toward getting the first "Easy" products (market-scope, Edgar)
onto the web within 90 days. We can add the various componants of
stock reports within 3 to 5 months. Many of the tools for rapid
protoyping and development are provided on the reccomended platforms,
the remainder can be loaded.
The other major effort will be negotiations. There are ways to
structure systems so that "simultainious users" can be reduced to a
very small number (under 200 oracle licenses for example) while
serving several thousand "browsers". We do want to make sure that
we do not find ourselves irrevocably committed to paying royalties
based on each individual user coming in through the "Pool" or through
through subscription. This could easily be a number in the millions.
E. McGraw-Hill coordination
This cooperation needs to occur at several levels. First, there is the
development logistics. There have been problems with the Heightstown
routers, fire-walls, and address/routing management. We need to have
coordination between the different systems such that we can add new
connections to our side within a few minutes, and can add accounts to
the outside within a few seconds.
The potential value of a McGraw-Hill "Pool" raises the possibility of
providing access to all McGraw-Hill publications on a "Bulk purchase
plan". Users could purchase "Books" of a few thousand pages (remember
web pages are very small) for $30-50 and be able to access those pages
repeatedly without extra charges. Supplementing them with dynamic
content such as S&P and PR News-Wire can manage a steady flow of
return business and new "book" purchases.
V. Resource requirements
We should begin with a prototyping system (Linux or Unix, basic service)
and a series of phased enhancments.
Prototype phase:
A. Pentium CPU as prototype server, Linux or BSDI OS. $5000
B. Configure servers, routers, fire-walls. 2 Staff Weeks
C. Httpd web server porting and configuration. 3 days.
D. Billing system configuration - 5 days.
E. Oracle and WAIS front-ends.
F. Convert Marketscope menus to Web Pages - 4 hours/page.
G. Convert Marketscope forms to Web Pages - 4 hours/page.
H. Royalty Negotiations.
Dependencies A:B B:CD C:FG D:H E:G
Conversion of marketscope forms may be automatable.
Phase 1 beta:
Stock Reports text to HTML.
Stock Reports Graphics to JPEG and GIF.
Royalty negotiations with 3rd parties.
Start Billing.
Phase 1 production:
Promotion.
Inclusion on other databases.
Inclusion links on strategic web pages.
Phase 2 beta:
Edgar SGML to HTML parser.
Edgar EDI to HTML parser.
MarketScope Alert to HTML driver.
To: "Annie Abrams"
cc: "CHARLES HUMPHRIES" , "REX BALLARD" ,
reb
Subject: Re: nj tmlp project From: Rex Ballard Date: Thu, 29 Jun 1995 17:32:19 -0400
How the Web Was Won
Subject: Re: nj tmlp project
From: Rex Ballard
Date: Thu, 29 Jun 1995 17:32:19 -0400
In-reply-to: Your message of "Wed, 24 May 95 14:01:00."
<199505241940.PAA29571@dialup.oar.net>
--------
Annie,
Let me know if you get this.
Rex
From owner-online-news@marketplace.com Fri Apr 28 22:29:29 1995
Received: from marketplace.com by cnj.digex.net with SMTP id AA16462
(5.67b8/IDA-1.5 for ); Fri, 28 Apr 1995 22:29:26 -0400
Received: (from majordom@localhost) by marketplace.com (8.6.12/8.6.12) id SAA00282 for online-news-outgoing; Fri, 28 Apr 1995 18:23:50 -0600
Received: from everest.pinn.net (everest.pinn.net [198.252.201.1]) by marketplace.com (8.6.12/8.6.12) with ESMTP id SAA00276 for ; Fri, 28 Apr 1995 18:23:44 -0600
Received: from docdon (ppp-c-19.pinn.net [198.252.201.19]) by everest.pinn.net (8.6.9/8.6.4) with SMTP id UAA29458; Fri, 28 Apr 1995 20:23:47 -0500
Message-Id: <199504290123.UAA29458@everest.pinn.net>
X-Sender: docdon@mail.pinn.net
X-Mailer: Windows Eudora Version 1.4.4
Mime-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
To: amandell , online-news@marketplace.com