Peacock Data

At our original website you will be able to read about and purchase our quality data management services ranging from list scrubbing to custom programming and beyond. You will not only find the customary array, but also one-of-a-kind services only available from Peacock Data. Our specialty is our trademarked merge/purge services. Go there >>

Peacock Data 2

At our new sister website we offer a full line of database products crafted with the same quality found in our well-known data management service. There you will find unique packages relating to ZIP Codes, United States Census demographics, GeoCoding, names and nicknames, gender coding and more—and the list is growing. Go there >>

Archive for February, 2010

Feb
22

Key Dates for Census 2010 Data

Posted by: Peacock Data | Comments (0)

The U.S. Census Bureau begins sending out census forms next month. And because so many of you utilize our U.S. Census and GeoCoding database services and products, we know there is a great deal of interest in when the 2010 information will be made available. To keep you informed we put together a key dates list for this year’s U.S. Census Bureau activities as well as the availability of Census 2010 data and updated GeoCoding information. Below the date list is an interesting fact sheet provided to us by the U.S. Census Bureau. Dates are approximate except those required by law.

Census 2010 Key Dates and Peacock Data Release Schedule (data release dates are in bold)

Date Description
March 2010 Census forms are mailed or delivered to households
April 2010 Census takers visit households that did not return a form by mail
December 2010 By law, the Census Bureau delivers population information to the President for apportionment
December 2010 Peacock Data releases new GeoCoding data to prepare customers for Census 2010 data releases the following year
January 2011 Peacock Data releases initial Census 2010 data
March 2011 By law, the Census Bureau completes delivery of redistricting data to states
June 2011 Peacock Data releases initial Summary File 1 (SF1) 100% counts
December 2011 Peacock Data releases initial Summary File 2 (SF2) 100% counts


U.S. Census Bureau Fact Sheet

What is the U.S. census?

Every 10 years, the government reports the number of people who live in the United States by conducting a count called the census. This count is required by the U.S. Constitution.

Why is the U.S. census count necessary?

Census data are used to determine the number of representatives your state receives in the U.S.
Congress, as well as your county’s representation in the state legislature. Government agencies
use the data to make funding decisions for more than $300 billion each year, including:

  • Title 1 allocations
  • College grant and loan programs
  • Public transportation
  • Road and community improvements
  • Public health services and hospitals
  • Neighborhood improvements
  • Senior services

How is the 2010 Census taken?

  • Census questionnaires are given to everyone living in the United States, Puerto Rico, Guam, American Samoa, the Commonwealth of the Northern Mariana Islands, and the U.S. Virgin Islands.
  • The information is collected in two ways: by a questionnaire that is sent to every home, and through confidentiality-bound census workers who travel door-to-door.

Who should be counted?

Everyone! All children, babies, and adults who live in a household should be counted, regardless of
nationality, citizenship status, race, age, or gender.

Why are some people reluctant to be counted?

The U.S. Census Bureau believes these are the most common deterrents to census participation:

  • Privacy: Some people are reluctant to give the government personal information.
  • Confidentiality: Some people worry that the information they provide could be used against them. However, census information is completely confidential. It is never shared with other government agencies, including the IRS, any office of immigration, or the FBI. Sharing census data is a federal offense.
  • Immigration and citizenship concerns: People may not want to draw attention to themselves. However, every person in every home should be counted as part of the census.
Comments (0)

One of the most exasperating things about processing telephone data is all the junk that often gets added next to the numbers. Little notes like “cell”, “parent’s phone”, “call before 8 p.m.” and “disconnected” can wreak havoc when the information is processed with telephone update services or sent through merge/purge, as well as when utilized in-house.

While here at we clean up telephone fields before processing, many service provides do not. Having the extra information included is particularly destructive when trying to verify telephone numbers or running a reverse append. Often these numbers are flagged as invalid and not properly processed.

Merge/purge can also be harmfully affected if telephone information is used in the match criteria and matches are missed due to the “database pollution”.

The little notes can be a problem in-house as well. They show up when printing telephone lists and when supplying data to your call center. And they can also cause issues when performing queries on the tables.

If notes about telephone numbers are necessary, a separate field should be provided and utilized by the end user. And if the problem persists, database managers can limit the size of the telephone fields so there is not enough room for the notes.

Most importantly, lessons in database etiquette, including a list of do’s and don’ts, should be included in training for anyone who accesses the database tables.

A company or organization’s data resources are among its most important assets. Following a few simple rules when accessing them can greatly help maintain their value and extend their effectiveness.

Be kind—don’t pollute!

Comments (0)

An alternative structure for is to have one record per name with the variations in fields next to it. This tutorial explains how to do it.

Matching and merging names can be tricky. How do you relate William Smith with Bill Smith? The pdNickname database can be utilized to match names that are dissimilar because one has a given first name while another has a nickname or other variation, or vice versa.

Out of the box pdNickname is structured to allow immediate compatibility with the greatest number of database systems as well as to make it easy to become familiar with.

The nickname database is setup with two names per record. The first name field contains the names you are looking up, and in the second is a variation for each name—nickname, diminutive, given name, variant, etc. The same name can be listed several times in the first field, each time with a different variation. (See Figure 1.)

FIGURE 1: PDNICKNAME OUT OF THE BOX

If the names compared are Alexander Jones and Alex Jones, all names matching Alexander (NAME-A) are scanned until a variation is found that matches Alex (NAME-B). This works well, but there are other ways of organizing pdNickname that could work even better for you. In fact, we have restructured the table for utilization in our own services.

An alternative structure is to have one record per name and the variations in fields next to it. It is not practical to have separate fields for each variation, which can range from one to over two hundred. So what we do is have two Memo fields (also known as Long Text), one for close variations (relflag = "1") and the other for more distant variations (relflag = "2"), with the string of variations separated by delimiters for easier matching. (See Figure 2.)

FIGURE 2: PDNICKNAME RESTRUCTURED

Note: when browsing a table, normally you cannot see the content of a Memo or Long Text field because the database keeps it in a separate file. For this screenshot we have made the content visible.

Structured this way, when your program finds a match for NAME-A, it then determines if NAME-B can be found in variation field one or variation field two. This can be faster because you only access one record in each search request. The code sample below is an example in Visual FoxPro that illustrates this. Of course other programs use different commands and syntax to achieve the same outcome.

* CODE SAMPLE

*- this Visual FoxPro function receives as parameters
*- the two first names being compared - it returns a
*- variable indicating what matches are found - this
*- function is based on the restructuring of the
*- pdNickname database described in this tutorial

FUNCTION pdNickname
LPARAMETERS cNameA, cNameB
LOCAL nMatch

IF NOT USED("nicknames")
    USE nicknames ALIAS nicknames IN 0
ENDIF

cNameA = PADR(UPPER(ALLTRIM(cNameA)),25," ")
cNameB = "/"+UPPER(ALLTRIM(cNameB))+"/"

nMatch = 0
IF SEEK (cNameA, "nicknames", "name")
    DO CASE
    CASE OCCURS(cNameB, nicknames.variations) > 0
        nMatch = 1
    CASE OCCURS(cNameB, nicknames.var2) > 0
        nMatch = 2
    ENDCASE
ENDIF

RETURN nMatch

pdNickname, like all our Database Products, are structured to satisfy most users from the start. But there are many ways to integrate the databases into your system. It is up to you to determine what works best for you. Do not be afraid to experiment.

Comments (0)
Feb
01

Start at the Finish Line

Posted by: Peacock Data | Comments (0)

When planning a new data management projects, results will be better, costs lower and headaches lessened if you consider what your objectives are before you begin. Always start by thinking about what you want to do with your data both now and down the road.

The worst kind of database system is one put together piecemeal as new demands arise. At some point it becomes more of a hindrance that a help. Many become monsters that seem to have a life of their own.

Before you design your databases, tables and user interfaces and decide on purchases, consider all the kinds of data you want to track and the best and most resourceful way of doing so. But to do this you need to gather some information first.

Talk to the end users who will utilize your data. Find out what they need and how they will be using it. Just as important, determine what they would like to do in the future and what has frustrated them most about data in the past.

Also talk to those who will be entering information and the techs who will be working directly with the database system. Find out what will make them more efficient and what has previously held them back.

Finally talk to the vendors who will be processing your data and supplying equipment and third-party lists. Ask them what you can do to help them achieve the best results for you and reduce costs without sacrificing quality.

All of this will affect what tables you design, what fields they will contain, what relationships there will be between them and how end-users will access the information. It will also affect what equipment and lists you acquire, when you buy them and who you hire to make it all work.

Your new data management project should not be planned until after gathering the insight needed to establish what the end results should be. Once you have seen the view from the finish line, you will be much better equipped to create a database system that will get you where you want to be days, months and years from now.

Comments (0)
Feb
01

Product Price Changes

Posted by: Peacock Data | Comments (0)

Good news! We are pleased to announce that as of today, February 1, 2010, there are new prices on most of our Database Products and Developer Licenses—and they are all down. Reductions range from $5 to as much as $1,210. See the chart below for a complete listing of changes:

PRICE ADJUSTMENTS

Product Old Price New Price
pdZIP – 5-digit ZIP Code Database $35 $25
pdCensus2000 – Census Demographics Database $250 $195
pdGeoTiger – Address Range & ZIP+4 GeoCoding Databases $500 $395
pdNickname – Name & Nickname Database $500 $495
pdCountry – Country Database $35 $25
pdSuite Master Collection – All database products in one package $1,100 $990
Developer Licenses Old Price New Price
pdZIP – Developer License $200 $100
pdCensus2000 – Developer License $500 $195
pdGeoTiger – Developer License $1,000 $395
pdGender – Developer License $700 $350
pdNickName – Developer License $1,000 $495
pdCountry – Developer License $70 $25
pdSuite Master Collection – Developer License $2,200 $990
Categories : Announcements
Comments (0)

Services

Products