Gene EcSMS35_2127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2127 
SymboltorA 
ID6144206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2134828 
End bp2137374 
Gene Length2547 bp 
Protein Length848 aa 
Translation table11 
GC content55% 
IMG OID641617003 
Producttrimethylamine-N-oxide reductase 
Protein accessionYP_001744178 
Protein GI170684290 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID[TIGR00509] molybdopterin guanine dinucleotide-containing S/N-oxide reductases
[TIGR02164] trimethylamine-N-oxide reductase TorA 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.600325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAATA ACGATCTCTT TCAGGCATCA CGTCGGCGTT TTCTGGCACA ACTCGGCGGC 
TTAACCGTCG CCGGGATGCT GGGGCCGTCA TTGTTAACGC CGCGCCGTGC GACTGCGGCG
CAAGCGGCGA CTGAGGCTGT CATCTCGAAA GAGGGCATTC TTACCGGGTC GCACTGGGGG
GCTATCCGCG CGACGGTGAA GGATGGTCGC TTTGTGGCGG CAAAACCGTT CGAACTGGAT
AAATATCCGT CGAAAATGAT TGCCGGATTG CCGGATCACG TACACAACGC GGCGCGTATT
CGTTACCCGA TGGTACGCGT GGACTGGCTA CGTAAGCGCC ATCTGAGCGA CACTTCCCAG
CGCGGTGATA ACCGTTTTGT GCGCGTGAGC TGGGATGAAG CCCTCGACAT GTTCTATGAA
GAACTGGAAC GCGTACAGAA AACTCACGGG CCGAGTGCCT TGCTGACCGC CAGTGGTTGG
CAATCGACGG GGATGTTCCA TAACGCTTCG GGGATGCTGG CGAAAGCTAT TGCCTTGCAT
GGTAATAGCG TTGGTACGGG CGGAGATTAC TCTACCGGTG CTGCGCAGGT GATCCTGCCG
CGCGTGGTTG GCTCAATGGA AGTGTATGAA CAGCAAACCT CCTGGCCGCT GGTATTGCAG
AACAGCAAAA CCATTGTGCT GTGGGGCTCT GATTTGCTGA AAAACCAGCA AGCGAACTGG
TGGTGCCCGG ATCACGATGT TTATGAATAT TACGCGCAGT TGAAAGCGAA AGTCGCCGCC
GGTGAAATTG AGGTCATCAG CATCGATCCG GTTGTCACAT CCACCCATGA GTATCTGGGG
CGCGAGCATG TGAAGCACAT TGCTGTTAAC CCGCAAACTG ACGTGCCACT GCAACTGGCG
CTGGCGTATA CGCTGTACAG TGAAAACCTG TACGACAAAA ACTTCCTCGC TAACTACTGT
GTGGGGTTTG AGCAGTTCCT GCCGTATCTG CTGGGTGAGA AAGACGGTCA GCCGAAAGAT
GCCGCATGGG CTGAAAAACT GACCGGCATT GATGCCGAAA CCATTCGTGG GCTGGCGCGG
CAGATGGCGG CGAACAGAAC GCAGATTATT GCTGGCTGGT GCGTACAGCG TATGCAGCAC
GGTGAACAGT GGGCGTGGAT GATTGTCGTT CTGGCGGCGA TGCTGGGGCA AATTGGCCTG
CCAGGTGGTG GCTTTGGTTT TGGCTGGCAC TATAACGGCG CAGGCACGCC GGGGCGTAAA
GGCGTTATTC TGAGTGGTTT CTCCGGCTCT ACGTCGATTT CGCCTGTTCA CGACAACAGT
GATTACAAAG GTTACAGCAG CACCATTCCG ATTGCCCGTT TTATCGATGC GATCCTCGAA
CCGGGGAAAG TGATCAACTG GAACGGTAAA TCGGTAAAAC TGCCGCCGCT GAAAATGTGT
ATTTTTGCCG GAACTAACCC GTTCCATCGC CATCAGCAGA TCAACCGCAT TATTGAAGGC
TGGCGCAAGC TGGAAACGGT TATCGCCATA GATAACCAGT GGACCTCAAC CTGCCGCTTT
GCCGATATCG TACTGCCTGC GACCACGCAG TTTGAGCGTA ACGATCTCGA CCAGTACGGT
AACCACTCTA ACCGTGGCAT TATCGCCATG AAACAGGTCG TGCCGCCGCA GTTTGAGGCG
CGCAACGACT TTGATATTTT CCGCGAGCTG TGCCGTCGCT TTAATCGCGA AGAAGCCTTT
ACCGAAGGGC TGGACGAAAT GGGCTGGCTG AAACGCATCT GGCAGGAAGG TGTACAACAG
GGCAAAGGAC GCGGCGTTCA TCTGCCAGCG TTTGATGACT TCTGGAATAA CAAAGAGTAT
GTCGAGTTTG ACCATCCGCA GATGTTTGTT CGCCACCAGG CATTCCGCGA AGATCCGGAT
CTCGAACCGC TGGGCACGCC GAGTGGCCTG ATTGAGATCT ACTCGAAAAC CATCGCCGAT
ATGAACTACG ACGATTGTCA GGGGCATCCG ATGTGGTTTG AGAAAATCGA ACGCTCCCAC
GGCGGGCCCG GCTCGCAGAA GTATCCGTTG CATCTGCAAT CTGTGCATCC GGATTTCCGA
CTTCACTCGC AGTTATGTGA GTCGGAAACT CTGCGTCAGC AATATACGGT AGCGGGTAAA
GAGCCAGTGT TCATTAACCC GCAGGATGCC AGCGCGCGCG GTATTCGTAA CGGTGATGTG
GTACGCGTCT TTAACGCTCG CGGTCAGGTG TTGGCTGGGG CAGTGGTTTC TGACCGCTAT
GCACCCGGCG TGGCACGAAT TCATGAAGGG GCATGGTACG ATCCAGATAA AGGCGGCGAG
CCTGGTGCGC TGTGCAAATA CGGTAATCCC AACGTGTTGA CCATTGACAT CGGTACTTCG
CAGTTGGCGC AGGCGACCAG TGCGCACACT ACGCTGGTGG AAATTGAGAA GTACAACGGA
ACAGTGGAGC AGGTAACGGC GTTTAACGGC CCCGTGGAGA TGGTGGCGCA GTGTGAATAT
GTTCCCGCGT CGCAGGTGAA ATCATGA
 
Protein sequence
MNNNDLFQAS RRRFLAQLGG LTVAGMLGPS LLTPRRATAA QAATEAVISK EGILTGSHWG 
AIRATVKDGR FVAAKPFELD KYPSKMIAGL PDHVHNAARI RYPMVRVDWL RKRHLSDTSQ
RGDNRFVRVS WDEALDMFYE ELERVQKTHG PSALLTASGW QSTGMFHNAS GMLAKAIALH
GNSVGTGGDY STGAAQVILP RVVGSMEVYE QQTSWPLVLQ NSKTIVLWGS DLLKNQQANW
WCPDHDVYEY YAQLKAKVAA GEIEVISIDP VVTSTHEYLG REHVKHIAVN PQTDVPLQLA
LAYTLYSENL YDKNFLANYC VGFEQFLPYL LGEKDGQPKD AAWAEKLTGI DAETIRGLAR
QMAANRTQII AGWCVQRMQH GEQWAWMIVV LAAMLGQIGL PGGGFGFGWH YNGAGTPGRK
GVILSGFSGS TSISPVHDNS DYKGYSSTIP IARFIDAILE PGKVINWNGK SVKLPPLKMC
IFAGTNPFHR HQQINRIIEG WRKLETVIAI DNQWTSTCRF ADIVLPATTQ FERNDLDQYG
NHSNRGIIAM KQVVPPQFEA RNDFDIFREL CRRFNREEAF TEGLDEMGWL KRIWQEGVQQ
GKGRGVHLPA FDDFWNNKEY VEFDHPQMFV RHQAFREDPD LEPLGTPSGL IEIYSKTIAD
MNYDDCQGHP MWFEKIERSH GGPGSQKYPL HLQSVHPDFR LHSQLCESET LRQQYTVAGK
EPVFINPQDA SARGIRNGDV VRVFNARGQV LAGAVVSDRY APGVARIHEG AWYDPDKGGE
PGALCKYGNP NVLTIDIGTS QLAQATSAHT TLVEIEKYNG TVEQVTAFNG PVEMVAQCEY
VPASQVKS