Gene EcHS_A0999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0999 
SymboldmsA 
ID5594963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1001303 
End bp1003747 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content53% 
IMG OID640920170 
Productanaerobic dimethyl sulfoxide reductase, A subunit 
Protein accessionYP_001457735 
Protein GI157160417 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR00509] molybdopterin guanine dinucleotide-containing S/N-oxide reductases
[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR02166] anaerobic dimethyl sulfoxide reductase, A subunit, DmsA/YnfE family 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACGA AAATCCCTGA TGCGGTATTG GCTGCTGAGG TGAGTCGCCG TGGTTTGGTA 
AAAACGACAG CGATCGGCGG CCTGGCAATG GCCAGCAGCG CATTAACATT ACCTTTTAGT
CGGATTGCGC ACGCTGTCGA TAGCGCCATT CCAACAAAAT CAGACGAAAA GGTTATCTGG
AGCGCCTGTA CAGTTAACTG TGGTAGTCGC TGCCCGCTAC GTATGCACGT CGTGGACGGT
GAAATCAAAT ATGTCGAAAC GGACAATACC GGCGATGACA ATTACGACGG CCTGCACCAG
GTTCGCGCCT GCCTGCGTGG GCGTTCCATG CGTCGCCGTG TCTACAATCC GGACCGCCTG
AAATATCCGA TGAAACGAGT CGGGGCGCGC GGTGAAGGCA AATTCGAGCG CATTAGCTGG
GAAGAAGCCT ACGACATCAT CGCGACCAAT ATGCAGCGCC TGATCAAAGA GTACGGCAAC
GAGTCTATCT ATCTGAACTA TGGCACCGGT ACGCTGGGCG GCACCATGAC CCGCTCCTGG
CCGCCGGGAA ATACCCTGGT CGCGCGGCTG ATGAACTGCT GCGGCGGCTA TCTGAACCAT
TACGGCGACT ACTCCTCCGC GCAAATTGCG GAAGGTTTGA ACTATACCTA CGGCGGCTGG
GCAGATGGCA ACAGCCCGTC GGATATCGAA AACAGTAAGC TGGTAGTGCT GTTTGGTAAT
AACCCTGGCG AAACGCGAAT GAGTGGCGGT GGGGTGACTT ACTATCTTGA ACAGGCACGC
CAGAAATCTA ATGCCCGCAT GATCATCATC GATCCGCGCT ATACCGACAC CGGTGCCGGG
CGCGAAGATG AGTGGATCCC TATTCGTCCG GGAACAGATG CCGCACTGGT TAACGGTCTG
GCGTACGTCA TGATCACTGA AAACCTGGTG GATCAGGCAT TCCTCGATAA ATATTGCGTT
GGCTACGATG AGAAAACCCT GCCAGCCAGT GCGCCGAAAA ATGGCCACTA TAAAGCTTAT
ATTCTGGGTG AAGGGCCAGA TGGCGTGGCT AAAACGCCGG AATGGGCCTC GCAAATCACT
GGTGTTCCGG CAGACAAAAT CATCAAATTG GCTCGTGAAA TCGGTAGTAC CAAACCGGCG
TTTATCAGCC AGGGATGGGG CCCGCAGCGT CACGCTAACG GTGAAATCGC AACCCGTGCT
ATCTCGATGC TGGCGATTCT GACCGGTAAC GTTGGTATTA ACGGAGGCAA CAGCGGCGCG
CGTGAAGGTT CATACAGCTT ACCGTTTGTC CGTATGCCGA CCTTGGAAAA CCCGATCCAG
ACCAGCATTT CGATGTTTAT GTGGACCGAT GCCATTGAAC GTGGCCCGGA AATGACGGCG
CTGCGTGATG GTGTACGCGG GAAAGATAAG CTGGATGTGC CGATCAAAAT GATCTGGAAC
TATGCCGGTA ACTGCCTGAT TAACCAGCAT TCTGAAATCA ACCGTACCCA TGAAATCCTT
CAGGATGATA AGAAGTGCGA GCTGATTGTG GTTATCGACT GCCACATGAC CTCATCGGCG
AAATATGCTG ACATCCTGCT GCCTGACTGC ACCGCTTCCG AACAGATGGA CTTTGCGCTG
GATGCATCCT GCGGGAATAT GTCTTACGTG ATTTTCAACG ATCAGGTGAT TAAACCGCGC
TTTGAATGTA AGACCATCTA TGAAATGACC AGCGAACTGG CAAAACGTCT TGGCGTTGAG
CAACAGTTTA CTGAAGGCCG TACCCAGGAA GAGTGGATGC GGCATCTGTA TGCCCAGTCG
CGGGAAGCGA TTCCTGAACT GCCAACGTTT GAAGAGTTCC GCAAGCAGGG GATCTTTAAA
AAGCGCGACC CACAAGGGCA TCACGTTGCT TATAAAGCCT TCCGTGAAGA TCCGCAGGCA
AACCCACTGA CTACGCCATC GGGCAAAATT GAGATTTATT CGCAGGCGCT GGCTGACATT
GCCGCTACCT GGGAATTGCC TGAAGGCGAT GTGATCGATC CACTGCCGAT CTACACGCCG
GGCTTTGAAA GTTATCAGGA TCCGCTGAAC AAACAGTATC CGCTGCAGCT TACAGGTTTC
CACTATAAAT CTCGCGTTCA CTCAACTTAC GGCAACGTTG ATGTGCTGAA AGCGGCTTGC
CGTCAGGAAA TGTGGATCAA CCCGCTTGAT GCCCAAAAAC GCGGTATCCA CAACGGCGAT
AAAGTCAGGA TCTTTAACGA TCGTGGTGAG GTTCATATTG AGGCGAAAGT GACGCCACGA
ATGATGCCGG GTGTGGTCGC ACTGGGTGAA GGTGCCTGGT ATGACCCGGA TGCAAAACGT
GTCGATAAGG GTGGTTGTAT TAACGTACTG ACCACTCAAC GTCCGTCTCC TCTCGCTAAG
GGGAATCCGT CACATACAAA CCTTGTTCAG GTTGAAAAGG TGTAA
 
Protein sequence
MKTKIPDAVL AAEVSRRGLV KTTAIGGLAM ASSALTLPFS RIAHAVDSAI PTKSDEKVIW 
SACTVNCGSR CPLRMHVVDG EIKYVETDNT GDDNYDGLHQ VRACLRGRSM RRRVYNPDRL
KYPMKRVGAR GEGKFERISW EEAYDIIATN MQRLIKEYGN ESIYLNYGTG TLGGTMTRSW
PPGNTLVARL MNCCGGYLNH YGDYSSAQIA EGLNYTYGGW ADGNSPSDIE NSKLVVLFGN
NPGETRMSGG GVTYYLEQAR QKSNARMIII DPRYTDTGAG REDEWIPIRP GTDAALVNGL
AYVMITENLV DQAFLDKYCV GYDEKTLPAS APKNGHYKAY ILGEGPDGVA KTPEWASQIT
GVPADKIIKL AREIGSTKPA FISQGWGPQR HANGEIATRA ISMLAILTGN VGINGGNSGA
REGSYSLPFV RMPTLENPIQ TSISMFMWTD AIERGPEMTA LRDGVRGKDK LDVPIKMIWN
YAGNCLINQH SEINRTHEIL QDDKKCELIV VIDCHMTSSA KYADILLPDC TASEQMDFAL
DASCGNMSYV IFNDQVIKPR FECKTIYEMT SELAKRLGVE QQFTEGRTQE EWMRHLYAQS
REAIPELPTF EEFRKQGIFK KRDPQGHHVA YKAFREDPQA NPLTTPSGKI EIYSQALADI
AATWELPEGD VIDPLPIYTP GFESYQDPLN KQYPLQLTGF HYKSRVHSTY GNVDVLKAAC
RQEMWINPLD AQKRGIHNGD KVRIFNDRGE VHIEAKVTPR MMPGVVALGE GAWYDPDAKR
VDKGGCINVL TTQRPSPLAK GNPSHTNLVQ VEKV