Gene ECD_00898 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_00898 
SymboldmsA 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp946050 
End bp948494 
Gene Length2445 bp 
Protein Length814 aa 
Translation table11 
GC content53% 
IMG OID 
Productdimethyl sulfoxide reductase, anaerobic, subunit A 
Protein accessionACT42793 
Protein GI253977123 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0778088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACGA AAATCCCTGA TGCGGTATTG GCTGCTGAGG TGAGTCGCCG TGGTTTGGTA 
AAAACGACAG CGATCGGCGG CCTGGCAATG GCCAGCAGCG CATTAACATT ACCTTTTAGT
CGGATTGCGC ACGCTGTCGA TAGCGCCATT CCAACAAAAT CAGACGAAAA GGTTATCTGG
AGCGCCTGTA CAGTTAACTG TGGTAGTCGC TGCCCGCTAC GTATGCACGT CGTGGACGGT
GAAATCAAAT ATGTCGAAAC GGACAATACC GGCGATGACA ATTACGACGG CCTGCACCAG
GTTCGCGCCT GCCTGCGTGG GCGTTCCATG CGTCGCCGTG TCTACAATCC GGACCGCCTG
AAATATCCGA TGAAACGAGT CGGGGCGCGC GGTGAAGGCA AATTCGAGCG CATTAGCTGG
GAAGAAGCCT ACGACATCAT CGCGACCAAT ATGCAGCGCC TGATCAAAGA GTACGGCAAC
GAGTCTATCT ATCTGAACTA TGGCACCGGT ACGCTGGGCG GCACCATGAC CCGCTCCTGG
CCGCCGGGAA ATACCCTGGT CGCGCGGCTG ATGAACTGCT GCGGCGGCTA TCTGAACCAT
TACGGCGACT ACTCCTCCGC GCAAATTGCG GAAGGTTTGA ACTATACCTA CGGCGGCTGG
GCAGATGGCA ACAGCCCGTC GGATATCGAA AACAGTAAGC TGGTAGTGCT GTTTGGTAAT
AACCCTGGCG AAACGCGAAT GAGTGGCGGT GGGGTGACTT ACTATCTTGA ACAGGCACGC
CAGAAATCTA ATGCCCGCAT GATCATCATC GATCCGCGCT ATACCGACAC CGGTGCCGGG
CGCGAAGATG AGTGGATCCC TATTCGTCCG GGAACAGATG CCGCACTGGT TAACGGTCTG
GCGTACGTCA TGATCACTGA AAACCTGGTG GATCAGGCAT TCCTCGATAA ATATTGCGTT
GGCTACGATG AGAAAACCCT GCCAGCCAGT GCGCCGAAAA ATGGCCACTA TAAAGCTTAT
ATTCTGGGTG AAGGGCCAGA TGGCGTGGCT AAAACGCCGG AATGGGCCTC GCAAATCACT
GGTGTTCCGG CAGACAAAAT CATCAAATTG GCTCGTGAAA TCGGTAGTAC CAAACCGGCG
TTTATCAGCC AGGGATGGGG CCCGCAGCGT CACGCTAACG GTGAAATCGC AACCCGTGCT
ATCTCGATGC TGGCGATTCT GACCGGTAAC GTTGGTATTA ACGGAGGCAA CAGCGGCGCG
CGTGAAGGTT CATACAGCTT ACCGTTTGTC CGTATGCCGA CCTTGGAAAA CCCGATCCAG
ACCAGCATTT CGATGTTTAT GTGGACCGAT GCCATTGAAC GTGGCCCGGA AATGACGGCG
CTGCGTGATG GTGTACGCGG GAAAGATAAG CTGGATGTGC CGATCAAAAT GATCTGGAAC
TATGCCGGTA ACTGCCTGAT TAACCAGCAT TCTGAAATCA ACCGTACCCA TGAAATCCTT
CAGGATGATA AGAAGTGCGA GCTGATTGTG GTTATCGACT GCCACATGAC CTCATCGGCG
AAATATGCTG ACATCCTGCT GCCTGACTGC ACCGCTTCCG AACAGATGGA CTTTGCGCTG
GATGCATCCT GCGGGAATAT GTCTTACGTG ATTTTCAACG ATCAGGTGAT TAAACCGCGC
TTTGAATGTA AGACCATCTA TGAAATGACC AGCGAACTGG CAAAACGTCT TGGCGTTGAG
CAACAGTTTA CTGAAGGCCG TACCCAGGAA GAGTGGATGC GGCATCTGTA TGCCCAGTCG
CGGGAAGCGA TTCCTGAACT GCCAACGTTT GAAGAGTTCC GCAAGCAGGG GATCTTTAAA
AAGCGCGACC CACAAGGGCA TCACGTTGCT TATAAAGCCT TCCGTGAAGA TCCGCAGGCA
AACCCACTGA CTACGCCATC GGGCAAAATT GAGATTTATT CGCAGGCGCT GGCTGACATT
GCCGCTACCT GGGAATTGCC TGAAGGCGAT GTGATCGATC CACTGCCGAT CTACACGCCG
GGCTTTGAAA GTTATCAGGA TCCGCTGAAC AAACAGTATC CGCTGCAGCT TACAGGTTTC
CACTATAAAT CTCGCGTTCA CTCAACTTAC GGCAACGTTG ATGTGCTGAA AGCGGCTTGC
CGTCAGGAAA TGTGGATCAA CCCGCTTGAT GCCCAAAAAC GCGGTATCCA CAACGGCGAT
AAAGTCAGGA TCTTTAACGA TCGTGGTGAG GTTCATATTG AGGCGAAAGT GACGCCACGA
ATGATGCCGG GTGTGGTCGC ACTGGGTGAA GGTGCCTGGT ATGACCCGGA TGCAAAACGT
GTCGATAAGG GTGGTTGTAT TAACGTACTG ACCACTCAAC GTCCGTCTCC TCTCGCTAAG
GGGAATCCGT CACATACAAA CCTTGTTCAG GTTGAAAAGG TGTAA
 
Protein sequence
MKTKIPDAVL AAEVSRRGLV KTTAIGGLAM ASSALTLPFS RIAHAVDSAI PTKSDEKVIW 
SACTVNCGSR CPLRMHVVDG EIKYVETDNT GDDNYDGLHQ VRACLRGRSM RRRVYNPDRL
KYPMKRVGAR GEGKFERISW EEAYDIIATN MQRLIKEYGN ESIYLNYGTG TLGGTMTRSW
PPGNTLVARL MNCCGGYLNH YGDYSSAQIA EGLNYTYGGW ADGNSPSDIE NSKLVVLFGN
NPGETRMSGG GVTYYLEQAR QKSNARMIII DPRYTDTGAG REDEWIPIRP GTDAALVNGL
AYVMITENLV DQAFLDKYCV GYDEKTLPAS APKNGHYKAY ILGEGPDGVA KTPEWASQIT
GVPADKIIKL AREIGSTKPA FISQGWGPQR HANGEIATRA ISMLAILTGN VGINGGNSGA
REGSYSLPFV RMPTLENPIQ TSISMFMWTD AIERGPEMTA LRDGVRGKDK LDVPIKMIWN
YAGNCLINQH SEINRTHEIL QDDKKCELIV VIDCHMTSSA KYADILLPDC TASEQMDFAL
DASCGNMSYV IFNDQVIKPR FECKTIYEMT SELAKRLGVE QQFTEGRTQE EWMRHLYAQS
REAIPELPTF EEFRKQGIFK KRDPQGHHVA YKAFREDPQA NPLTTPSGKI EIYSQALADI
AATWELPEGD VIDPLPIYTP GFESYQDPLN KQYPLQLTGF HYKSRVHSTY GNVDVLKAAC
RQEMWINPLD AQKRGIHNGD KVRIFNDRGE VHIEAKVTPR MMPGVVALGE GAWYDPDAKR
VDKGGCINVL TTQRPSPLAK GNPSHTNLVQ VEKV