Gene ECH74115_1708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1708 
SymbolnarG 
ID6971407 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1646583 
End bp1650326 
Gene Length3744 bp 
Protein Length1247 aa 
Translation table11 
GC content55% 
IMG OID643385665 
Productnitrate reductase 1, alpha subunit 
Protein accessionYP_002270159 
Protein GI209399352 
COG category[C] Energy production and conversion 
COG ID[COG5013] Nitrate reductase alpha subunit 
TIGRFAM ID[TIGR01580] respiratory nitrate reductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0699868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAT TCCTGGACCG GTTTCGCTAC TTCAAACAGA AGGGTGAAAC CTTTGCCGAT 
GGGCATGGCC AGCTTCTCAA TACCAACCGT GACTGGGAGG ATGGATATCG CCAGCGTTGG
CAGCATGACA AAATCGTCCG CTCTACCCAC GGGGTAAACT GCACCGGCTC CTGCAGCTGG
AAAATCTACG TCAAAAACGG TCTGGTCACC TGGGAAACCC AGCAGACTGA CTATCCGCGT
ACCCGTCCGG ATCTGCCAAA CCATGAACCT CGCGGCTGCC CGCGCGGTGC CAGCTACTCC
TGGTATCTTT ACAGTGCCAA CCGCCTGAAA TACCCGATGA TGCGCAAACG CCTGATGAAA
ATGTGGCGTG AAGCGAAGGC GCTGCATAGC GATCCGGTTG AGGCATGGGC TTCTATCATT
GAAGACGCCG ATAAAGCGAA AAGCTTTAAG CAGGCGCGTG GACGCGGTGG TTTTGTTCGT
TCTTCCTGGC AGGAGGTGAA CGAACTGATC GCCGCATCTA ACGTTTACAC CATCAAAAAC
TACGGCCCGG ACCGTGTTGC TGGTTTCTCG CCAATTCCGG CAATGTCGAT GGTTTCTTAC
GCATCGGGTG CACGCTATCT CTCGCTGATT GGCGGTACTT GCTTAAGCTT CTACGACTGG
TACTGCGACC TGCCTCCTGC GTCTCCGCAA ACCTGGGGCG AGCAAACTGA CGTACCGGAA
TCTGCTGACT GGTACAACTC CAGCTACATC ATCGCCTGGG GTTCAAACGT GCCGCAGACG
CGTACTCCAG ATGCTCACTT CTTTACTGAA GTGCGTTACA AAGGCACCAA AACTGTTGCC
GTCACACCAG ACTACGCTGA AATCGCCAAA CTGTGCGATC TGTGGCTGGC ACCGAAACAG
GGCACCGATG CGGCAATGGC GCTGGCGATG GGCCACGTAA TGCTGCGTGA ATTCCATCTC
GACAACCCAA GCCAGTATTT CACCGACTAT GTGCGTCGCT ACACCGACAT GCCGATGCTG
GTGATGCTGG AAGAACGCGA CGGTTACTAC GCTGCAGGTC GTATGCTGCG CGCTGCTGAT
CTGGTTGATG CGCTGGGCCA GGAAAACAAT CCGGAATGGA AAACTGTCGC CTTTAATACC
AATGGCGAAA TGGTTGCGCC GAACGGTTCT ATTGGCTTCC GCTGGGGCGA GAAGGGCAAA
TGGAATCTTG AGCAGCGCGA CGGCAAAACT GGCGAAGAAA CCGAGCTGCA ACTGAGCCTG
CTGGGTAGCC AGGATGAGAT CGCTGAGGTA GGCTTCCCGT ACTTTGGTGG CGACGGCACG
GAACACTTCA ACAAAGTGGA ACTGGAAAAC GTGCTGCTGC ACAAACTGCC GGTGAAACGC
CTGCAACTGG CTGATGGCAG CACCGCCCTG GTGACCACCG TTTATGATCT GACGCTGGCA
AACTACGGTC TGGAACGTGG CCTGAACGAC GTTAACTGTG CAACCAGCTA TGACGATGTG
AAAGCTTATA CCCCGGCCTG GGCCGAGCAG ATTACCGGCG TTTCTCGCAG CCAGATTATT
CGCATCGCCC GTGAATTTGC CGATAACGCT GATAAAACGC ACGGTCGTTC GATGATTATC
GTCGGTGCGG GGCTGAACCA CTGGTATCAC CTCGATATGA ACTATCGTGG TCTGATCAAC
ATGCTGATTT TCTGCGGCTG TGTCGGTCAG AGCGGGGGCG GCTGGGCGCA CTATGTAGGT
CAGGAAAAAC TGCGTCCGCA AACCGGCTGG CAGCCGCTGG CGTTTGCCCT CGACTGGCAG
CGTCCGGCGC GTCACATGAA CAGCACCTCT TATTTCTATA ACCACTCCAG CCAGTGGCGT
TATGAAACCG TGACTGCGGA AGAACTGCTG TCGCCGATGG CGGATAAATC CCGCTATACC
GGACATCTGA TCGACTTTAA CGTCCGTGCA GAACGCATGG GCTGGCTGCC GTCTGCGCCG
CAGTTAGGCA CTAACCCGCT GACTATCGCT GGCGAAGCAG AAAAAGCCGG GATGAATCCG
GTGGACTATA CGGTGAAATC CCTGAAAGAG GGTTCTATCC GTTTCGCTGC GGAACAGCCG
GAAAACGGCA AAAACCACCC GCGTAACCTG TTTATCTGGC GTTCTAACCT GCTCGGTTCT
TCCGGTAAAG GTCATGAGTT TATGCTCAAG TACCTGCTGG GGACGGAGCA CGGTATCCAG
GGTAAAGATC TGGGGCAGCA GGGCGGCGTG AAGCCGGAAG AAGTGGACTG GCAGGACAAT
GGTCTGGAAG GCAAGCTGGA TCTGGTGGTT ACGCTGGACT TCCGTCTGTC GAGCACCTGT
CTCTATTCCG ACATCATTTT GCCGACGGCG ACCTGGTACG AAAAAGACGA CATGAATACT
TCGGATATGC ATCCGTTTAT TCACCCGCTG TCTGCGGCGG TCGATCCGGC CTGGGAAGCG
AAAAGCGACT GGGAAATCTA CAAAGCCATC GCGAAGAAAT TCTCCGAAGT GTGCGTCGGC
CATCTGGGTA AAGAAACCGA CATCGTCACG CTGCCTATCC AGCACGACTC TGCCGCTGAA
CTGGCGCAGC CGCTGGATGT GAAAGACTGG AAAAAAGGCG AATGCGACCT GATCCCAGGC
AAAACTGCAC CGCACATTAT GGTCGTAGAG CGCGATTATC CGGCAACTTA CGAACGCTTT
ACCTCTATCG GCCCGCTGAT GGAGAAAATC GGTAACGGCG GTAAAGGGAT TGCCTGGAAC
ACCCAGAGCG AGATGGATCT GCTGCGTAAG CTCAACTACA CCAAAGCGGA AGGTCCGGCG
AAAGGCCAGC CGATGCTGAA CACCGCAATT GATGCGGCAG AGATGATCCT GACACTGGCA
CCGGAAACCA ACGGTCAGGT AGCCGTGAAA GCCTGGGCTG CCCTGAGCGA ATTTACCGGT
CGTGACCATA CGCATCTGGC GCTGAATAAA GAAGACGAGA AGATCCGCTT CCGCGATATT
CAGGCACAGC CGCGCAAAAT TATCTCCAGC CCGACCTGGT CTGGTCTGGA AGATGAACAC
GTTTCTTACA ACGCCGGTTA CACCAACGTT CACGAGCTGA TCCCATGGCG TACGCTCTCT
GGTCGTCAGC AACTGTATCA GGATCACCAG TGGATGCGTG ATTTCGGTGA AAGCCTGCTG
GTTTATCGTC CGCCGATCGA CACCCGTTCG GTGAAAGAAG TGATAGGCCA GAAATCCAAC
GGCAACCCGG AAAAAGCGCT CAACTTCCTG ACGCCGCACC AGAAGTGGGG TATCCACTCC
ACCTACAGCG ACAACCTGCT GATGCTGACT TTAGGTCGCG GTGGTCCGGT GGTCTGGTTG
AGTGAAGCCG ATGCCAAAGA TCTGGGTATC GCCGATAACG ACTGGATTGA AGTATTCAAC
AGCAACGGTG CTCTGACTGC CCGTGCGGTT GTCAGCCAGC GTGTTCCGGC AGGGATGACC
ATGATGTACC ACGCGCAGGA ACGTATCGTT AACCTGCCTG GTTCGGAAAT TACCCAACAG
CGTGGTGGTA TCCATAACTC GGTCACCCGT ATCACGCCGA AACCGACGCA TATGATCGGC
GGCTATGCCC ATCTGGCATA CGGCTTTAAC TACTACGGCA CCGTAGGTTC TAACCGCGAT
GAGTTTGTTG TAGTGCGTAA GATGAAGAAC ATTGACTGGT TAGATGGCGA AGGCAATGAT
CAGGTACAGG AGAGCGTAAA ATGA
 
Protein sequence
MSKFLDRFRY FKQKGETFAD GHGQLLNTNR DWEDGYRQRW QHDKIVRSTH GVNCTGSCSW 
KIYVKNGLVT WETQQTDYPR TRPDLPNHEP RGCPRGASYS WYLYSANRLK YPMMRKRLMK
MWREAKALHS DPVEAWASII EDADKAKSFK QARGRGGFVR SSWQEVNELI AASNVYTIKN
YGPDRVAGFS PIPAMSMVSY ASGARYLSLI GGTCLSFYDW YCDLPPASPQ TWGEQTDVPE
SADWYNSSYI IAWGSNVPQT RTPDAHFFTE VRYKGTKTVA VTPDYAEIAK LCDLWLAPKQ
GTDAAMALAM GHVMLREFHL DNPSQYFTDY VRRYTDMPML VMLEERDGYY AAGRMLRAAD
LVDALGQENN PEWKTVAFNT NGEMVAPNGS IGFRWGEKGK WNLEQRDGKT GEETELQLSL
LGSQDEIAEV GFPYFGGDGT EHFNKVELEN VLLHKLPVKR LQLADGSTAL VTTVYDLTLA
NYGLERGLND VNCATSYDDV KAYTPAWAEQ ITGVSRSQII RIAREFADNA DKTHGRSMII
VGAGLNHWYH LDMNYRGLIN MLIFCGCVGQ SGGGWAHYVG QEKLRPQTGW QPLAFALDWQ
RPARHMNSTS YFYNHSSQWR YETVTAEELL SPMADKSRYT GHLIDFNVRA ERMGWLPSAP
QLGTNPLTIA GEAEKAGMNP VDYTVKSLKE GSIRFAAEQP ENGKNHPRNL FIWRSNLLGS
SGKGHEFMLK YLLGTEHGIQ GKDLGQQGGV KPEEVDWQDN GLEGKLDLVV TLDFRLSSTC
LYSDIILPTA TWYEKDDMNT SDMHPFIHPL SAAVDPAWEA KSDWEIYKAI AKKFSEVCVG
HLGKETDIVT LPIQHDSAAE LAQPLDVKDW KKGECDLIPG KTAPHIMVVE RDYPATYERF
TSIGPLMEKI GNGGKGIAWN TQSEMDLLRK LNYTKAEGPA KGQPMLNTAI DAAEMILTLA
PETNGQVAVK AWAALSEFTG RDHTHLALNK EDEKIRFRDI QAQPRKIISS PTWSGLEDEH
VSYNAGYTNV HELIPWRTLS GRQQLYQDHQ WMRDFGESLL VYRPPIDTRS VKEVIGQKSN
GNPEKALNFL TPHQKWGIHS TYSDNLLMLT LGRGGPVVWL SEADAKDLGI ADNDWIEVFN
SNGALTARAV VSQRVPAGMT MMYHAQERIV NLPGSEITQQ RGGIHNSVTR ITPKPTHMIG
GYAHLAYGFN YYGTVGSNRD EFVVVRKMKN IDWLDGEGND QVQESVK