Gene EcSMS35_2129 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2129 
SymboltorT 
ID6146354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2139341 
End bp2140369 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content54% 
IMG OID641617006 
ProductTMAO reductase system periplasmic protein TorT 
Protein accessionYP_001744181 
Protein GI170680014 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR02955] TMAO reductase system periplasmic protein TorT 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGCAC TGCTATTGTT ACTTCTTTCC CTTTTCATGT TATCGGCATT TTCGGCTGAT 
AACCTGTTGC GCTGGCATGA TGCGCAGCAT TTCTCGGTGC AAGCCTCTAT GCCGCTTAAA
GCCAAACGCG CATGGAAACT GTGCGCGCTT TATCCCAGCC TGAAAGATTC ATACTGGTTA
TCGTTGAACT ATGGTATGCA GGAGGCTGCT CGCCGCTACG GTGTGGATTT AAAAGTGCTG
GAGGCAGGCG GCTACAGCCA GTTGGCTACC CAGCAAGCAC AAATCGACCA GTGTAAACAG
TGGGGCGCAG AGGCCATCTT GCTCGGAAGT AGCACGACGT CATTTCCCGA CCTGCAAAAG
CAGGTAGCAA ATCTGCCGGT GATCGAACTG GTGAATGCTA TTGATGCTCC CCACGTGAAA
AGCCGCGTTG GTGTGCCCTG GTTTCAGATG GGCTATCAAC CAGGACGATA TCTGGTGCAA
TGGAGCCACG GTAAACCACT GAACGTGCTG TTGATGCCCG GCCCCGATAA CGCCGGGGGC
AGTAAGGAGA TGGTAGAGGG TTTTCGCGCA GCCATTGCCG GAAGTCCGGT ACGTATTGTT
GATATTGCGC TCGGTGATAA CGATATTGAA ATCCAGCGTA ACCTGTTGCA GGAGATGCTG
GAGCGCCATC CAGAAATCGA CGTCGTTGCC GGAACGGCCA TTGCGGCAGA GGCGGCAATG
GGGGAAGGGC GTAACCTGAA AACACCGCTT ACCGTGGTGT CGTTTTATCT TTCACATCAG
GTGTATCGCG GGCTGAAGCG GGGAAGAGTG ATTATGGCTG CCAGCGATCA AATGGTCTGG
CAGGGGGAAC TGGCGGTTGA GCAGGCCATC AGGCAATTAC AGGGGCAATC GGTGTCTGAT
AATGTCAGCC CACCGATTTT AGTTCTGACG CCGAAAAATG CCGACCGCGA ACATATCCGC
CGCTCGCTGT CACCGGGGGG ATTTCGTCCG GTCTATTATT ATCAGCACAC ATCAGCGGCT
AAGAAATAA
 
Protein sequence
MRALLLLLLS LFMLSAFSAD NLLRWHDAQH FSVQASMPLK AKRAWKLCAL YPSLKDSYWL 
SLNYGMQEAA RRYGVDLKVL EAGGYSQLAT QQAQIDQCKQ WGAEAILLGS STTSFPDLQK
QVANLPVIEL VNAIDAPHVK SRVGVPWFQM GYQPGRYLVQ WSHGKPLNVL LMPGPDNAGG
SKEMVEGFRA AIAGSPVRIV DIALGDNDIE IQRNLLQEML ERHPEIDVVA GTAIAAEAAM
GEGRNLKTPL TVVSFYLSHQ VYRGLKRGRV IMAASDQMVW QGELAVEQAI RQLQGQSVSD
NVSPPILVLT PKNADREHIR RSLSPGGFRP VYYYQHTSAA KK