Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Free Learning
Arrow right icon
Arrow up icon
GO TO TOP
Hands-On SAS for Data Analysis

You're reading from   Hands-On SAS for Data Analysis A practical guide to performing effective queries, data visualization, and reporting techniques

Arrow left icon
Product type Paperback
Published in Sep 2019
Publisher Packt
ISBN-13 9781788839822
Length 346 pages
Edition 1st Edition
Languages
Tools
Arrow right icon
Author (1):
Arrow left icon
Harish Gulati Harish Gulati
Author Profile Icon Harish Gulati
Harish Gulati
Arrow right icon
View More author details
Toc

Table of Contents (17) Chapters Close

Preface 1. Section 1: SAS Basics
2. Introduction to SAS Programming FREE CHAPTER 3. Data Manipulation and Transformation 4. Section 2: Merging, Optimizing, and Descriptive Statistics
5. Combining, Indexing, Encryption, and Compression Techniques Simplified 6. Power of Statistics, Reporting, Transforming Procedures, and Functions 7. Section 3: Advanced Programming
8. Advanced Programming Techniques - SAS Macros 9. Powerful Functions, Options, and Automatic Variables Simplified 10. Section 4: SQL in SAS
11. Advanced Programming Techniques Using PROC SQL 12. Deep Dive into PROC SQL 13. Section 5: Data Visualization and Reporting
14. Data Visualization 15. Reporting and Output Delivery System 16. Other Books You May Enjoy

Indexing

We broached the subject of creating an index file in previous chapters. In this section, we will describe how the indexes are stored and retrieved. The index file consists of entries that are organized hierarchically and connected by pointers, all of which are maintained by SAS. The lowest level in the index file hierarchy consists of entries that represent each distinct value for an indexed variable, in ascending value order. Each entry contains the following information:

  • A distinct value
  • One or more unique record identifiers (referred to as a RID) that identify each observation that contains the value

If we created an index for the City variable using the AC dataset in the mismatch dataset, we would have an index file with entries such as the following:

Value

RID

Adelaide

1

Copenhagen

2

Hong Kong

3, 4, 5, 6

hong Kong

7, 8, 9

Let...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Banner background image