Lect 10a - Wrapping Up

Synopsis:

Professional developers are distinguished from students and academics by the size of the projects they build. Industrial scale projects often are composed of many packages, perhaps a few thousand.

To be effective, professionals need mental catalogs of interesting ways large systems are structured, and the ability to build parts of a chosen structure so they are correct, readable, composable, and fast.

CSE681-OnLine : Software Modeling and Analysis (SMA), the first course in this sequence, focuses on system structures.
CSE687-OnLine : Object Oriented Design (OOD), this course, is concerned with building elegant, reliable packages.

In this last lecture, we merge those two points of view by examining our Sample Project #4's design and implementation. At the end, we survey the topics we've studied, and help you find resources you may need later.

Readings and Activities for the Week:

Complete Project #4 and submit.
Pat yourself on the back - Cheers!

Glossary of Terms

Large System:
A system composed of many packages, often multiple processes, that may use a uniform communication protocal and mechanisms for sharing resources. Successfully implementing a large system requires strong concept, solid infrastructure, a team of well-trained developers, and an adhered-to process for development, often modeled on Agile Development ideas. The largest modern software system, the web, was built on top of the internet, and grows incrementally. That works because it's based on effective protocols and is decentralized.
Client-Server:
Client-Server systems have a server for sharing resources among clients, supporting concurrent access by clients, and usually communicating via HTTP in a "request-wait for reply" style.
Federation:
A set of servers and clients that communicate asynchronously, often using HTTP-style messaging. Each server has a dedicated role that combines with the roles of the other servers to deliver a large, multi-faceted service to federation clients.
Message-Passing Communication:
An internet-based service for transporting messages from sender to receiver. Messages are often HTTP-style constructs with send and receive addresses for routing requests and resulting replies, command for specifying requested actions, and optional body to send information needed by the receiver to execute the requested task, or used to send back contents of the reply.
Tools for Building Large Systems
Building large systems requires tools for creating products and tools for managing work items as they migrate from initial concept to finished product. These usually include:
- Software Development IDEs
  
  Visual Studio, Eclipse, and Netbeans are commonly used for Windows and Linux development.
- Code Repositories
  
  While developing large systems, the product code base grows to become - you guessed it - large. To keep track of all those pieces, and their serveral versions, we depend on smart storage that understands package ownership and versioning. It is also very useful if the storage mechanism knows about package dependency relationships. That's where code repositories become essential.
  The Project #4 Sample for CSE681-OnLine, is a prototype for a code repository, based-on dependency relationships. It saves package dependency information, entered by the user, in metadata files associated with each controlled repository item.
- Test Harnesses
  There are several types of testing needed for development of large systems:
  - Construction Tests
  - Unit Tests
  - Integration
  - Regression
  - Performance and Stress
  - Qualification
  All but the first two work on very large software baselines, and require significant automation. That's what a test harness provides.
- Storage and Discloser Tools
  
  Schedules, work descriptions, specifications, and test documents all need to be saved and disclosed to any Project developer on demand. We often use SQL or NoSql databases for this purpose.
- Code Analysis Tools
  
  The Code Analyzer - Sample Project #4 is a good example. It examines source code in a directory tree, analyzing packages, namespaces, classes, and functions, for meansures of size and complexity. It also reports the number of lines in each package.

Projects:

Sample Project #1, Code folder
Lexical scanner - Tokenizer and SemiExpression token collector
Sample Project #2, Code folder
Rule-based parser - Parser contains rules which each contain actions invoked if semiExp matches rule
Sample Project #3, Code folder
Parser with Abstract Syntax Tree - AST is a container for analysis information, built during analysis, used for display
Sample Project #4, Code folder
Code Analyzer - analyzes code metrics, SLOCs, and shows AST contents, uses GUI

Software Systems are Structured with Classes and Packages (SMA meets OOD):

The Code analyzer we used for Project #4 Sample is composed of more than a dozen packages with a total of 10,616 lines of code. While not "large", it is an industrial scale project, with some interesting structures, supported by both application-side and solution-side packages. We will look at its code, and, package, class, and activity diagrams, to see how an interesting structure can be implemented with packages:

Project #4 Requirements, Project #4 Sample code

The Visual Code Analyzer application consists of two processes:

Code Analyzer accepts the path to code to be analyzed, and a set of analysis attributes, on its command line. It then sweeps through the directory tree rooted at the specified path, and analyzes all the types of files specified in the command line attributes. It has several modes of display, e.g., Code Metrics, Abstract Syntax Tree contents, or Source Lines of Code.
The analyser writes a log file at the root of the analysis path, so users can elect to view that later.
Visual Code Analyzer builds the Code Analyzer's complex command line, supporting browsing for the analysis path and setting the types of analysis display.

This structure turns out to be very effective. We can elect to use the Code Analyzer directly, perhaps in a script. Most users will use the Visual Code Analyzer GUI. That opens with the last set of attributes selected, but allows users to selectively change those.

Show Packages, Show Classes, Show Output

Visual Code Analyzer Packages

Visual Code Analyzer Classes

Visual Code Analyzer Output

The package diagram is relatively simple, showing us the major parts of the application and how they relate to each other. This simplicity is a useful abstraction, but it hides a lot of important detail.

The class diagram makes it clear that there is some significant code complexity in this application. That has to be managed by dividing the implementation into a number of relatively small and managable classes. When we do that, we have to think carefully about ownership and communication.

For example, the ConfigureParser instance creates and owns almost all the application's low-level parts, e.g., Parser, Semiexp, Tokenizer, all the derived Rules, all the derived Actions, and the Repository. Each time it creates a derived action it passes, to the action constructor, a reference to the Repository. That means that the actions can all access the Abstract Syntax Tree and Scope Stack, which they need to do to carry out their tasks.

The ownership relationships are clearly show in the class diagram, by means of the composition, aggregation, and using connectors. Some classes, like GrammarHelpers, are not owned by any other part. This class has all static methods, so no instance is created. The derived rules simply use it by calling its functions preceded by the class name.

Three classes, StringHelper, Converter - both part of the Utilities package - and Logger are used by most of the classes in the application. It would be counter-productive to show directly all those associations. That would make the diagram very dense with association lines and would be almost unreadable. So we simply show them with no associations.

In this course, CSE687-OnLine, we've focused on techniques for building individual packages so they are flexible and robust. We also need the structuring ideas discussed in CSE681-Online, to help bind all the packages we build into a coherent whole.

Course Review:

A summary of course topics, with links to many of the most important details.

Course Take-aways:

The most important things we covered in this course are:

Structure and style for packages:

Packages are the fundamental units of Software Systems. We need to know how to build them well.
Package Structure Matters and Blocking Queue example.
Syntax and structure of the C++ Language:

C++ is a very effective and expressive language for building packages. We need to know its guiding principles and have an approximate model of the C++ compiler and what it generates in our developer heads.
ADT Lecture, Templates Lecture and Class Relationships Lecture
Prepackaged Frameworks for building software systems:

There are a rich collection of resources in the C++ ecosystem, and also some surprising omisions. We need to know how to interoperate with code from other languages to use their resources as well.
Standard Library Lecture
Custom Libraries to support native code operations:

A lot of things that C++ elected not to supply us, we can build, without too much effort, i.e., Sockets, XmlDocument, and CppProperties. Sockets Lecture and our Repository Code.
Practical project experiences, using all of the above:
Four industrial style projects that progressively build a useful code analysis tool:
- Sample Project #1
  Lexical scanner
- Sample Project #2
  Rule-based parser
- Sample Project #3
  Parser with Abstract Syntax Tree
- Sample Project #4
  Code Analyzer
These projects use most of the C++ features we've discussed, and gave you the opportunity to apply some interesting design ideas.

Week 10a - Building Large Systems

Tools and Techniques

Synopsis:

Readings and Activities for the Week:

Glossary of Terms

Large System:

Client-Server:

Federation:

Message-Passing Communication:

Tools for Building Large Systems

Software Development IDEs

Code Repositories

Test Harnesses

Storage and Discloser Tools

Code Analysis Tools

Projects:

Software Systems are Structured with Classes and Packages (SMA meets OOD):

Course Review:

Inheritance:

Composition:

Aggregation:

Using:

WPF

C++\CLI

Streams library

STL library - wk9a

Threads library

FileSystem

Sockets

C++ Properties

XmlDocument

Course Take-aways:

Structure and style for packages:

Syntax and structure of the C++ Language:

Prepackaged Frameworks for building software systems:

Custom Libraries to support native code operations:

Practical project experiences, using all of the above: