Updated: August 24, 2015
Ladies and gentlemen, I am most proud to announce the imminent publication of my latest and greatest tech book, Problem-solving in High Performance Computing: A Situational Awareness Approach with Linux. Published by Morgan Kaufmann, Elsevier, it will hit the markets in roughly mid-late September 2015.
For the past 17 months, I have worked hard behind the scenes compiling this book, and putting in pretty much everything I know about large-scale computing and problem solving. The end result is about 320 pages of deep Linux troubleshooting, statistical engineering, and best practices. For anyone who even remotely considers using Linux for work, this should definitely intrigue you.
Press stuff
If you're interested, here are some more details from the backcover:
Problem-solving in High Performance Computing is focused on understanding giant computing grids as cohesive
systems. Unlike other titles on general problem solving or system administration, this book offers a cohesive
approach to problem solving in complex, layered environments. It highlights the difference between standalone
system troubleshooting and complex problem solving in large, mission critical environments, and addresses the
pitfalls of information overload, micro and macro symptoms in problem solving, and offers highly modular,
extensible, scalable tools and methods for managing problems in large computing ecosystems.
The authors offer perspective gained from years of developing Intel-based systems that lead the industry in the
number of hosts, software tools and licenses used in chip design. They cover the entire stack, from identifying
the problem, understanding and reproducing it; to solving it, using a methodical, top-down approach, while
handling the constraints of a global computing grid. Throughout, the book offers unique, real-life examples
that emphasize the magnitude and operational complexity of high performance computer systems.
Provides insider perspectives on challenges in high performance environments with thousands of servers,
millions of cores, distributed data centers, and petabytes of shared data.
Covers analysis, troubleshooting, and system optimization, from initial diagnostics to deep dives into kernel
crash dumps.
Presents macro principles that appeal to a wide range of users and various real-life, complex problems.
Includes examples from 24/7 mission-critical environments with specific HPC operational constraints.
|
Book details:Publisher: Morgan Kaufmann, 1st edition Language: English Trim size: 7.5" x 9.2" (19.1 x 23.4 cm) Color: Black & White on White paper Paperback: 320 pages ISBN-13: 0128010193 ISBN-10: 978-0128010198 Availability: Now |
Call for reviews
I already have several professional reviewers lined up, but I would like to extend the call to the wider community of bloggers, Linux users and experts, and academics to take a look at my work. I can provide free copies, but in return, I kindly expect thorough and honest reviews.
Now, the ideal target audience includes professional system administrators, power users and developers, engineers, and solutions architects. If you are willing to take on this book, it would be awesome.
Ideally, you would have a reasonably high profile website or journal, decent experience working with Linux, and be willing to commit your final opinion to words, both on Amazon and your own personal corner of the Web. Not a must, but do take this into consideration.
The book is going to be hugely expensive, but I have no control over that, so this might be a nice addition to your proverbial tech drawer. If you are interested, please drop me an email, titled Problem solving, and we can discuss things further. Please introduce yourself, your site, and any other details. I can also provide the introduction section and the first chapter as press samples.
Needless to say, it will be a great honor for me to have people read and enjoy my book. The principles are the same - no-nonsense, step-by-step approach, like all my other guides and tutorials.
Conclusion
There you go. I dare say this is my best work yet. It really covers everything you may encounter when trying to figure out a problem in Linux. Starting with: Do you even have a problem? Then, the debugging gets more and more serious, we delve deeper and deeper, then swoop back up and analyze the whole situation. I honestly believe the book should serve you well in all your troubleshooting escapades, no matter what you do.
In a way, the book is a culmination of almost 10 years of Dedoimedo, plus my professional career, and it's a genuine attempt to provide a methodical approach to problem solving, something the Internet never quite gave me. If I may be cheesy for a moment, having you, my readers, on my side, helping, providing feedback and encouragement, was a crucial factor that kept me going. To wit, this book is dedicated to you. No joke. Check Page V.
Last but not the least, this could be a good way for you to support Dedoimedo. The book is immensely expensive, and I will try to organize discounts and freebies and whatnot, but anyhow, that's how it is. There. I sincerely hope you enjoy it.
Cheers.