Senior Manager, Resiliency Engineering
McLean, VA, USA | Capital One
Functions:IT / Information Technology
Job Description:50 people have viewed this job
We are actively seeking highly creative and intellectually curious Technology Professionals with a deep technical expertise in networking and network security to join our Network Resiliency and Assurance team. This is an opportunity for a Senior Technical Lead to be a part of a high-performance team which is responsible for ensuring the overall resiliency of the Capital One network.The role will include performing resiliency and chaos testing for the network, security and telephony within Capital One. Ensuring operational excellence of new designs and builds for Capital One’s large scale enterprise network environments. Streamlining and optimizing the performance of the existing network, security and telephony platforms. Engage with Capital One’s internal and external audit resources to ensure our compliance requirements are met. Foster the use of expert level troubleshooting skills by the team. Our network and network security environments play a major role in protecting our company, so ensuring optimal performance of the environment is always critical.
On any given day you will:
Serve as Resiliency team’s Technical Lead to ensure resiliency of the Capital One network environment(s).
Perform and document resiliency and chaos testing.
Ensuring Resiliency within the Network Fabric in support of Capital One’s cloud strategy.
Consult on the development of the enterprise support strategy and assist with lifecycle management efforts for platforms and technologies inclusive of Wide Area Network Carrier, Routers, Switches, Wireless, DNS and on some occasions Firewall, Cyber Defense and Proxy services.
Diligently manage various platform environment(s) to ensure they are resilient and operating at optimal levels and drive manufacturer support engagements as required.
Drive major incident and problem resolution for incidents related to resiliency, consult with other technology support groups as needed including Operational Advance Support and Engineering Teams as part of problem resolution efforts.
Assist with the implementation of new platforms, code versions, and features to ensuring resiliency requirements.
Assess security vulnerabilities, identify mitigation plans and successfully implement them.
Consult on the creation and/or maintenance of standards documentation, operational design documentation, templates, topology diagrams and workflow documents.
Participate in technology integration efforts with engineering and other support teams to ensure new designs and implementation are resilient and meet the high standard of operational excellence for Capital One.
Provide direct support of audit and ad hoc consulting engagements.
Ensure compliance with departmental and enterprise configuration standards.
Serve as a mentor and technical resource for various team(s) and provide training to associates through one-on-one or group technical discussions.
This position will require some planned and ad hoc travel.
Successful candidates will possess:
Significantly demonstrable experience supporting large-scale enterprise networks.
Able to quickly understand complex routing solutions including routing protocol such has EBGP/IBGP, ISIS, OSPF, and EIGRP.
High level of understanding of Proxies, Firewalls (PaloAlto/CheckPoint), DNS, and F5 LTM/GTMs
CCIE level knowledge or better for network platforms including wireless solutions.
Experience with SD-WAN solutions.
Working knowledge of Cloud network inter-connects deployed in a Regional Hub design.
Skilled in scripting and automation.
Enterprise experience in leading network incident management activities relative to resiliency.
Strong analytical, problem solving, and organizational skills with a high degree of attention to detail.
Ability to work both independently as well as part of a geographically dispersed yet integrated team.
Ability to balance multiple priorities in a fast-paced, highly collaborative, frequently changing, and sometimes ambiguous environment.
Expert level knowledge of network management and packet capture tools.
Familiarity with Agile / DevOps delivery model, industry standard network management tools, and common application traffic flow patterns in multi-tiered applications.
A solid understanding of what comprises a scalable, robust and supportable design.
A bias toward action, along with an internal drive for continuous improvement.
Excellent written, oral and interpersonal communications skills to:
Discuss complex technical issues with technicians, engineers, and vendors.
Assemble and clearly present technical information in a business-like manner to non-technical personnel.
Lead and facilitate communications with people in immediate department, other departments, and external third parties.
Inform and influence senior leaders and peers.
This position is an operational role. As such, periodic late-night work and participation in an on-call rotation will be required. At times the late-night work may come with minimal notice.
At least 6 years’ professional experience in an operational role or a technical leadership role supporting an enterprise network infrastructure that is geographically distributed
At least 6 years’ experience with Cisco routing and switching technologies
At least 2 years’ AWS, Google or Azure cloud provider experience
At least 2 years of experience developing scripts for network automation
Master’s degree in Information Technology or Information Security.
4+ years’ experience with demonstrated technical proficiency in Infoblox DNS/DHCP, BIND, Microsoft DNS
4+ years’ experience with demonstrated technical proficiency in Blue Coat Proxy servers or 4+ years’ experience with demonstrated technical proficiency in Aruba wireless access points, controllers and Clearpass.
Already a member? Sign In