Gene EcSMS35_1289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1289 
SymbolotsA 
ID6143857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1277904 
End bp1279328 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content49% 
IMG OID641616167 
Producttrehalose-6-phosphate synthase 
Protein accessionYP_001743347 
Protein GI170683748 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00398964 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.000000119842 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCGTT TAGTCGTAGT ATCTAACCGG ATTGCACCAC CAGACGAGCA CGCCGCCAGT 
GCCGGTGGCC TTGCCGTTGG CATACTGGGG GCACTGAAAG CCGCAGGCGG ACTGTGGTTT
GGCTGGAGTG GTGAAACAGG GAATGAGGAT CAGCCGCTAA AAAAGGTGAA AAAAGGTAAC
ATTACGTGGG CCTCTTTTAA CCTCAGCGAA CAGGACCTTG ACGAATACTA CAACCAATTC
TCCAATGCCG TTCTCTGGCC CGCTTTTCAT TATCGGCTCG ATCTGGTGCA ATTTCAGCGT
CCTGCCTGGG ACGGCTATCT ACGCGTAAAT GCGTTGCTGG CAGATAAATT ACTGCCGCTG
TTGCAAGACG ATGACATTAT CTGGATCCAC GATTATCACC TGTTGCCATT TGCGCATGAA
TTACGCAAAC GGGGAGTGAA TAATCGCATT GGTTTCTTTC TGCATATTCC TTTCCCGACA
CCGGAAATCT TCAACGCGCT GCCGACATAT GACACCTTGC TTGAACAGCT TTGTGATTAT
GATTTGCTGG GTTTCCAGAC AGAAAACGAT CGTCTGGCGT TCCTGGATTG TCTTTCTAAC
CTGACCCGCG TCACGACACG TAGCGCAAAA AGCCATACAG CCTGGGGCAA AGCATTTCGA
ACAGAAGTCT ACCCGATCGG CATTGAACCG AAAGAAATAG CCAAACAGGC TGCCGGGCCA
CTGCCGCCAA AACTGGCGCA ACTTAAAGCG GAACTGAAAA ACGTACAAAA TATCTTTTCT
GTCGAACGGC TGGATTATTC CAAAGGTTTG CCAGAGCGTT TTCTCGCCTA TGAAGCGTTG
CTGGAAAAAT ATCCGCAGCA TCATGGTAAA ATTCGTTATA CCCAGATTGC ACCAACGTCG
CGTGGTGATG TGCAAGCCTA TCAGGATATT CGTCATCAGC TCGAAAATGA AGCTGGACGA
ATTAATGGTA AATACGGGCA ATTAGGCTGG ACGCCGCTTT ATTATTTGAA TCAGCATTTT
GACCGTAAAT TACTGATGAA AATATTCCGC TACTCTGACG TGGGCTTAGT GACGCCACTG
CGTGACGGGA TGAACCTGGT AGCAAAAGAG TATGTTGCTG CTCAGGACCC AGCCAACCCG
GGCGTTCTTG TTCTTTCGCA ATTTGCGGGA GCGGCAAACG AGTTAACGTC GGCGTTAATT
GTTAACCCCT ACGATCGTGA CGAAGTTGCA GCTGCGCTGG ATCGTGCATT GACTATGTCG
CTGGCGGAAC GTATTTCCCG TCATGCAGAA ATGCTGGACG TTATCGTGAA AAACGATATT
AACCACTGGC AGGAGTGCTT CATTAGCGAC CTAAAGCAGA TAGTTCCGCG AAGCGCGGAA
AGCCAGCAGC GCGATAAAGT TGCTACCTTT CCAAAGCTTG CGTAG
 
Protein sequence
MSRLVVVSNR IAPPDEHAAS AGGLAVGILG ALKAAGGLWF GWSGETGNED QPLKKVKKGN 
ITWASFNLSE QDLDEYYNQF SNAVLWPAFH YRLDLVQFQR PAWDGYLRVN ALLADKLLPL
LQDDDIIWIH DYHLLPFAHE LRKRGVNNRI GFFLHIPFPT PEIFNALPTY DTLLEQLCDY
DLLGFQTEND RLAFLDCLSN LTRVTTRSAK SHTAWGKAFR TEVYPIGIEP KEIAKQAAGP
LPPKLAQLKA ELKNVQNIFS VERLDYSKGL PERFLAYEAL LEKYPQHHGK IRYTQIAPTS
RGDVQAYQDI RHQLENEAGR INGKYGQLGW TPLYYLNQHF DRKLLMKIFR YSDVGLVTPL
RDGMNLVAKE YVAAQDPANP GVLVLSQFAG AANELTSALI VNPYDRDEVA AALDRALTMS
LAERISRHAE MLDVIVKNDI NHWQECFISD LKQIVPRSAE SQQRDKVATF PKLA