Gene ECH74115_1056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_1056 
SymboldmsA 
ID6967513 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp1075427 
End bp1077871 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content53% 
IMG OID643385068 
Productanaerobic dimethyl sulfoxide reductase, A subunit 
Protein accessionYP_002269567 
Protein GI209400103 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR00509] molybdopterin guanine dinucleotide-containing S/N-oxide reductases
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR02166] anaerobic dimethyl sulfoxide reductase, A subunit, DmsA/YnfE family 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACGA AAATCCCTGA TGCGGTATTG GCTGCTGAGG TGAGTCGCCG TGGTTTGGTA 
AAAACGACAG CGATCGGCGG CCTGGCAATG GCCAGCAGCG CATTAACATT ACCTTTTAGT
CGGATTGCGC ACGCTGTCGA TAGCGCCATT CCAACAAAAT CAGGCGAAAA GGTTATCTGG
AGCGCCTGTA CAGTTAACTG TGGTAGTCGC TGCCCGCTAC GTATGCACGT CGTGGACGGT
GAAATCAAAT ATGTCGAAAC GGACAATACC GGCGATGACA ATTACGACGG CCTGCACCAG
GTTCGCGCCT GCCTGCGTGG GCGTTCCATG CGTCGCCGTG TCTACAATCC GGACCGCCTG
AAATATCCGA TGAAACGAGT CGGGGCGCGC GGTGAAGGCA AATTCGAGCG CATTAGCTGG
GAAGAAGCCT ACGACATCAT CGCGACCAAT ATGCAGCGCC TGATCAAAGA GTACGGCAAC
GAATCCATCT ATCTGAACTA TGGCACCGGT ACGCTGGGTG GCACCATGAC CCGCTCCTGG
CCGCCGGGAA ATACCCTGGT CGCGCGGCTG ATGAACTGCT GCGGCGGCTA TCTGAACCAT
TACGGCGACT ACTCCTCCGC GCAAATTGCG GAAGGTCTGA ACTATACCTA CGGCGGCTGG
GCAGATGGCA ACAGCCCGTC GGATATCGAA AACAGTAAGC TGGTAGTGCT GTTTGGTAAT
AACCCTGGCG AAACGCGAAT GAGTGGCGGT GGGGTGACTT ACTATCTTGA ACAGGCACGC
CAGAAATCTA ATGCCCGCAT GATCATCATC GATCCGCGCT ATACCGACAC CGGTGCCGGG
CGCGAAGATG AGTGGATCCC TATTCGTCCG GGAACAGATG CCGCACTGGT TAACGGTCTG
GCGTACGTCA TGATCACTGA AAACCTGGTG GATCAGGCAT TCCTCGATAA ATATTGCGTT
GGCTACGATG AGAAAACCCT GCCAGCCAGT GCGCCGAAAA ATGGCCACTA TAAAGCTTAT
ATTCTGGGTG AAGGGCCAGA TGGCGTGGCT AAAACACCGG AATGGGCCTC GCAAATCACC
GGTGTTCCGG CAGACAAAAT CATCAAACTG GCTCGTGAAA TCGGCAGTAC CAAACCGGCG
TTTATCAGCC AGGGATGGGG CCCGCAGCGT CACGCTAACG GTGAAATCGC AACCCGTGCT
ATCTCGATGC TGGCGATTCT GACCGGTAAC GTTGGTATTA ATGGAGGGAA CAGCGGCGCG
CGTGAAGGTT CATACAGTTT ACCGTTTGTC CGTATGCCGA CCTTGGAAAA CCCGATCCAG
ACCAGCATTT CGATGTTTAT GTGGACCGAT GCCATTGAAC GTGGCCCGGA AATGACGGCG
CTACGTGATG GTGTGCGCGG GAAAGATAAG CTGGATGTGC CGATCAAAAT GATCTGGAAC
TATGCCGGTA ACTGCCTGAT TAACCAGCAT TCTGAAATCA ACCGTACCCA TGAAATCCTT
CAGGATGATA AGAAGTGCGA GCTGATTGTG GTTATCGACT GCCACATGAC CTCATCGGCG
AAATATGCTG ACATCCTGCT GCCTGACTGC ACCGCTTCCG AACAGATGGA CTTTGCACTG
GATGCATCCT GCGGGAATAT GTCTTACGTG ATTTTCAACG ATCAGGTGAT TAAACCGCGC
TTTGAATGTA AGACCATCTA TGAAATGACC AGCGAACTGG CAAAACGTCT TGGCGTTGAG
CAACAGTTTA CTGAAGGCCG TACCCAGGAA GAGTGGATGC GGCATCTGTA TGCCCAGTCG
CGGGAAGCGA TTCCTGAACT GCCAACGTTT GAAGAGTTCC GCAAGCAGGG GATCTTTAAA
AAGCGCGACC CACAAGGGCA TCACGTTGCT TATAAAGCCT TCCGTGAAGA TCCGCAGGCA
AATCCACTGA CCACGCCATC GGGTAAAATT GAGATTTATT CGCAGGCGCT GGCTGACATT
GCCGCTACCT GGGAATTGCC AGAAGGCGAT GTGATCGATC CACTGCCGAT CTACACGCCG
GGCTTTGAAA GTTATCAGGA TCCGCTGAAC AAACAGTATC CGCTGCAGCT TACAGGTTTC
CACTATAAAT CTCGCGTTCA CTCAACTTAC GGCAACGTTG ATGTGCTGAA AGCGGCTTGC
CGTCAGGAAA TGTGGATCAA CCCGCTTGAT GCCCAAAAAC GCGGTATCCA CAACGGCGAT
AAAGTCAGGA TCTTTAACGA TCGTGGTGAG GTTCATATTG AGGCGAAAGT GACGCCACGA
ATGATGCCGG GTGTGGTCGC ACTGGGTGAA GGTGCCTGGT ATGACCCGGA TGCAAAACGT
GTCGATAAGG GTGGTTGTAT TAACGTACTG ACCACTCAAC GTCCGTCTCC TCTCGCTAAG
GGGAATCCGT CACATACAAA CCTTGTTCAG GTTGAAAAGG TGTAA
 
Protein sequence
MKTKIPDAVL AAEVSRRGLV KTTAIGGLAM ASSALTLPFS RIAHAVDSAI PTKSGEKVIW 
SACTVNCGSR CPLRMHVVDG EIKYVETDNT GDDNYDGLHQ VRACLRGRSM RRRVYNPDRL
KYPMKRVGAR GEGKFERISW EEAYDIIATN MQRLIKEYGN ESIYLNYGTG TLGGTMTRSW
PPGNTLVARL MNCCGGYLNH YGDYSSAQIA EGLNYTYGGW ADGNSPSDIE NSKLVVLFGN
NPGETRMSGG GVTYYLEQAR QKSNARMIII DPRYTDTGAG REDEWIPIRP GTDAALVNGL
AYVMITENLV DQAFLDKYCV GYDEKTLPAS APKNGHYKAY ILGEGPDGVA KTPEWASQIT
GVPADKIIKL AREIGSTKPA FISQGWGPQR HANGEIATRA ISMLAILTGN VGINGGNSGA
REGSYSLPFV RMPTLENPIQ TSISMFMWTD AIERGPEMTA LRDGVRGKDK LDVPIKMIWN
YAGNCLINQH SEINRTHEIL QDDKKCELIV VIDCHMTSSA KYADILLPDC TASEQMDFAL
DASCGNMSYV IFNDQVIKPR FECKTIYEMT SELAKRLGVE QQFTEGRTQE EWMRHLYAQS
REAIPELPTF EEFRKQGIFK KRDPQGHHVA YKAFREDPQA NPLTTPSGKI EIYSQALADI
AATWELPEGD VIDPLPIYTP GFESYQDPLN KQYPLQLTGF HYKSRVHSTY GNVDVLKAAC
RQEMWINPLD AQKRGIHNGD KVRIFNDRGE VHIEAKVTPR MMPGVVALGE GAWYDPDAKR
VDKGGCINVL TTQRPSPLAK GNPSHTNLVQ VEKV