Gene ECH74115_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2078 
SymbolnarZ 
ID6972037 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1972686 
End bp1976426 
Gene Length3741 bp 
Protein Length1246 aa 
Translation table11 
GC content55% 
IMG OID643385983 
Productnitrate reductase 2, alpha subunit 
Protein accessionYP_002270472 
Protein GI209398888 
COG category[C] Energy production and conversion 
COG ID[COG5013] Nitrate reductase alpha subunit 
TIGRFAM ID[TIGR01580] respiratory nitrate reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAC TTTTGGATCG CTTTCGCTAC TTCAAACAAA AGGGCGATAC CTTTGCCGAT 
GGTCACGGAC AGGTGATGCA TAGCAACCGC GACTGGGAGG ACAGCTATCG CCAGCGTTGG
CAGTTCGACA AAATCGTGCG TTCCACCCAC GGTGTTAACT GTACAGGCTC CTGTAGCTGG
AAAATCTACG TTAAAAATGG TCTGGTGACC TGGGAAATCC AACAGACCGA CTACCCGCGC
ACTCGCCCTG ACCTGCCCAA TCATGAACCT CGCGGCTGCC CGCGTGGCGC AAGTTACTCC
TGGTATCTTT ACAGCGCTAA CCGCCTGAAA TACCCGCTCA TTCGTAAACG ACTGATTGAA
CTGTGGCGCG AAGCCCTCAA ACAACACAGC GATCCGGTAC TGGCGTGGGC ATCGATTATG
AACGATCCGC AAAAGAGCCT GAGCTACAAA CAAGTGCGTG GGCGCGGCGG GTTTATCCGC
TCCAACTGGC CGGAACTAAA CCAGCTGATT GCCGCCGCTA ACGTCTGGAC TATCAAAACC
TACGGCCCGG ATCGCGTTGC CGGTTTCTCG CCGATCCCGG CGATGTCGAT GGTTTCTTAC
GCCGCCGGAA CACGTTATCT GTCACTATTA GGTGGCACCT GTTTAAGCTT CTACGACTGG
TATTGCGACC TGCCGCCCGC CTCGCCGATG ACCTGGGGCG AGCAAACCGA CGTACCGGAA
TCTGCCGACT GGTATAACTC CAGTTACATC ATCGCGTGGG GGTCTAACGT ACCGCAGACA
CGTACGCCGG ACGCCCATTT CTTTACCGAA GTACGCTACA AAGGCACTAA AACCATCGCC
ATTACCCCTG ACTACTCTGA AGTGGCCAAA TTGTGCGACC AGTGGCTGGC ACCAAAACAA
GGCACTGATA GCGCCCTGGC GATGGCAATG GGCCATGTGA TTTTAAAAGA GTTTCATCTC
GATAATCCCA GCGACTACTT TATCAACTAC TGCCGCCGCT ATAGCGACAT GCCGATGCTG
GTAATGCTGG AACCTCGCGA CGATGGTAGC TACGTACCCG GGCGCATGAT CCGCGCGTCT
GACCTGGTAG ATGGACTAGG CGAAAGCAAT AATCCGCAGT GGAAAACCGT GGCGGTTAAT
ACCGCAGGTG AATTGGTAGT GCCGAATGGT TCGATTGGTT TCCGCTGGGG AGAAAAAGGC
AAATGGAATC TGGAATCCAT TGCCGCCGGT AAGGAAACCG AATTGTCGTT AACCCTGCTC
GGTCAACATG ACGCCGTTGC AGGCGTGGCT TTCCCCTACT TTGGCGGCAT CGAAAATCCG
CATTTTCGCA GCGTCAAACA CAATCCTGTC CTGGTGCGTC AATTACCCGT TAAAAACCTG
ACATTAGCCG GTGGCAGCAC CTGTCCAGTG GTCAGCGTTT ATGATTTGGT ACTGGCGAAT
TACGGCCTCG ATCGCGGGCT GGAAGATGAA AACAGCGCGA AAGATTACGC TGAAATCAAA
CCGTACACCC CAGCCTGGGG TGAGCAAATT ACCGGCGTGC CGCGCCAGTA TATTGAAACC
ATCGCTCGTG AATTTGCCGA TACTGCCCAT AAAACGCATG GGCGCTCGAT GATTATCCTC
GGCGCAGGTG TTAACCACTG GTATCACATG GACATGAACT ACCGGGGAAT GATCAATATG
CTGATCTTCT GCGGTTGTGT CGGACAAAGC GGTGGCGGCT GGGCACACTA TGTCGGTCAG
GAAAAACTGC GCCCGCAAAC CGGCTGGTTG CCGCTGGCCT TTGCGCTCGA CTGGAACCGA
CCACCGCGCC AGATGAACAG CACCTCGTTT TTCTACAATC ATTCCAGCCA ATGGCGCTAT
GAAAAAGTCT CTGCGCAGGA GTTACTTTCA CCGCTTGCCG ATGCCAGTAA GTACAGCGGT
CATCTGATTG ATTTTAACGT TCGCGCCGAA CGCATGGGCT GGCTACCTTC TGCGCCGCAG
TTGGGGCGTA ACCCGCTCGG GATTAAAGCT GAAGCCGACA AGGCCGGATT ATCCCCCACA
GAATTTACCG CCCAGGCGCT GAAATCGGGC GATTTACGTA TGGCCTGCGA ACAACCAGAT
AGCGGCAGCA ATCATCCGCG TAATTTGTTT GTCTGGCGTT CTAACCTGCT TGGCTCCTCC
GGCAAAGGCC ACGAGTATAT GCAGAAGTAT CTGCTGGGGA CCGAAAGCGG GATTCAGGGC
GAGGAACTCG GTGCCAGCGA CGGGATCAAA CCGGAAGAAG TCGAGTGGCA AACTGCGGCG
ATTGAAGGCA AGCTCGACCT GCTGGTGACG CTCGACTTCC GCATGTCCAG TACCTGCCTG
TTCTCCGATA TCGTTCTGCC CACCGCCACC TGGTATGAAA AAGACGATAT GAACACCTCG
GATATGCATC CATTTATTCA TCCGCTTTCT GCGGCGGTCG ATCCGGCCTG GGAATCACGC
AGCGACTGGG AAATCTACAA AGGTATTGCC AAAGCATTTT CGCAAGTGTG CGTGGGCCAT
CTTGGCAAAG AAACCGACGT GGTATTACAA CCACTGCTGC ATGACTCTCC GGCAGAGCTC
TCACAGCCGT GTGAAGTGCT CGACTGGCGC AAAGGCGAAT GCGATCTGAG CCCGGGTAAA
ACCGCGCCGA ATATTGTGGC GGTGGAGCGC GACTACCCTG CTACGTATGA ACGCTTTACC
TCGCTCGGGC CATTGATGGA CAAACTTGGC AACGGCGGTA AAGGGATTTC GTGGAATACG
CAGGATGAAA TCGATTTCCT CGGTAAACTC AATTACACCA AGCGTGATGG CCCAGCGCAG
GGGCGTCCGC TGATTGACAC CGCCATTGAC GCTTCAGAAG TGATTCTGGC ACTGGCACCA
GAAACCAACG GTCATGTTGC AGTTAAAGCG TGGCAGGCGC TGGGCGAGAT CACCGGACGC
GAACATACCC ATCTGGCGCT GCACAAAGAG GACGAGAAGA TTCGCTTTCG CGATATTCAG
GCGCAGCCGC GTAAAATTAT CTCCAGCCCC ACATGGTCTG GTCTGGAAAG CGATCACGTC
TCCTATAATG CGGGATACAC CAACGTTCAT GAGTTAATTC CGTGGCGCAC GCTGTCGGGA
CGCCAGCAGC TCTATCAGGA TCATCCGTGG ATGCGTGCTT TTGGTGAAAG CCTGGTGGCA
TATCGCCCGC CTATCGACAC CCGTAGCGTC AGTGAGATGC GCCAGATCCC GCCAAACGGC
TTCCCGGAAA AAGCACTTAA CTTCCTGACG CCGCACCAGA AATGGGGCAT TCACTCAACC
TACAGTGAAA ACCTGCTAAT GCTGACGCTC TCTCGCGGTG GACCGATTGT CTGGATCAGC
GAAACCGATG CCCGTGAACT AACCATTGTC GATAACGACT GGGTGGAAGT GTTTAACGCC
AATGGCGCGC TGACGGCCCG CGCGGTGGTC AGCCAACGTG TACCGCCGGG TATGACCATG
ATGTATCACG CTCAGGAACG CATTATGAAT ATTCCTGGTT CGGAAGTAAC TGGCATGCGC
GGCGGTATTC ATAACTCGGT CACCCGCATT TGCCCGAAAC CAACGCATAT GATTGGCGGT
TACGCGCAGC TGGCCTGGGG CTTTAACTAC TACGGCACCG TCGGCTCGAA CCGCGACGAA
TTCATCATGA TCCGCAAGAT GAAGAACGTT AACTGGCTGG ATGATGAAGG TCGCGATCAG
GTACAGGAGG CGAAAAAATG A
 
Protein sequence
MSKLLDRFRY FKQKGDTFAD GHGQVMHSNR DWEDSYRQRW QFDKIVRSTH GVNCTGSCSW 
KIYVKNGLVT WEIQQTDYPR TRPDLPNHEP RGCPRGASYS WYLYSANRLK YPLIRKRLIE
LWREALKQHS DPVLAWASIM NDPQKSLSYK QVRGRGGFIR SNWPELNQLI AAANVWTIKT
YGPDRVAGFS PIPAMSMVSY AAGTRYLSLL GGTCLSFYDW YCDLPPASPM TWGEQTDVPE
SADWYNSSYI IAWGSNVPQT RTPDAHFFTE VRYKGTKTIA ITPDYSEVAK LCDQWLAPKQ
GTDSALAMAM GHVILKEFHL DNPSDYFINY CRRYSDMPML VMLEPRDDGS YVPGRMIRAS
DLVDGLGESN NPQWKTVAVN TAGELVVPNG SIGFRWGEKG KWNLESIAAG KETELSLTLL
GQHDAVAGVA FPYFGGIENP HFRSVKHNPV LVRQLPVKNL TLAGGSTCPV VSVYDLVLAN
YGLDRGLEDE NSAKDYAEIK PYTPAWGEQI TGVPRQYIET IAREFADTAH KTHGRSMIIL
GAGVNHWYHM DMNYRGMINM LIFCGCVGQS GGGWAHYVGQ EKLRPQTGWL PLAFALDWNR
PPRQMNSTSF FYNHSSQWRY EKVSAQELLS PLADASKYSG HLIDFNVRAE RMGWLPSAPQ
LGRNPLGIKA EADKAGLSPT EFTAQALKSG DLRMACEQPD SGSNHPRNLF VWRSNLLGSS
GKGHEYMQKY LLGTESGIQG EELGASDGIK PEEVEWQTAA IEGKLDLLVT LDFRMSSTCL
FSDIVLPTAT WYEKDDMNTS DMHPFIHPLS AAVDPAWESR SDWEIYKGIA KAFSQVCVGH
LGKETDVVLQ PLLHDSPAEL SQPCEVLDWR KGECDLSPGK TAPNIVAVER DYPATYERFT
SLGPLMDKLG NGGKGISWNT QDEIDFLGKL NYTKRDGPAQ GRPLIDTAID ASEVILALAP
ETNGHVAVKA WQALGEITGR EHTHLALHKE DEKIRFRDIQ AQPRKIISSP TWSGLESDHV
SYNAGYTNVH ELIPWRTLSG RQQLYQDHPW MRAFGESLVA YRPPIDTRSV SEMRQIPPNG
FPEKALNFLT PHQKWGIHST YSENLLMLTL SRGGPIVWIS ETDARELTIV DNDWVEVFNA
NGALTARAVV SQRVPPGMTM MYHAQERIMN IPGSEVTGMR GGIHNSVTRI CPKPTHMIGG
YAQLAWGFNY YGTVGSNRDE FIMIRKMKNV NWLDDEGRDQ VQEAKK