Gene ECH74115_2633 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_2633 
SymbolotsA 
ID6971571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp2487634 
End bp2489058 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content49% 
IMG OID643386496 
Producttrehalose-6-phosphate synthase 
Protein accessionYP_002270978 
Protein GI209398057 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0142463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000000000000299712 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTCGTT TAGTCGTAGT ATCTAACCGG ATTGCACCAC CAGATGAGCA CGCCGCCAGT 
GCCGGTGGCC TTGCCGTTGG CATACTGGGG GCACTGAAAG CCGCAGGCGG ACTGTGGTTT
GGCTGGAGTG GTGAAACAGG GAATGAGGAT CAGCCGCTAA AAAAGGTGAA AAAGGGTAAC
ATTACGTGGG CCTCTTTTAA CCTCAGCGAA CAGGACCTTG ACGAATACTA CAACCAATTC
TCCAATGCCG TTCTCTGGCC TGCTTTTCAT TATCGCCTCG ATCTGGTGCA ATTTCAGCGT
CCTGCCTGGG ACGGCTATCT ACGCGTAAAT GCGTTGCTGG CAGATAAATT ACTGCCGCTG
TTGCAAGACG ATGACATTAT CTGGATCCAC GATTATCACC TGTTGCCATT TGCGCATGAA
TTACGCAAAC GGGGAGTGAA TAATCGCATT GGTTTCTTTC TGCATATTCC TTTCCCGACA
CCGGAAATCT TCAACGCGCT GCCGACATAT GACACCTTGC TTGAACAGCT TTGTGATTAT
GATTTGCTGG GTTTCCAGAC AGAAAACGAT CGTCTGGCGT TCCTGGATTG CCTTTCTAAC
CTGACCCGCG TCACGACACG TAGCGCAAAA AGCCATACAG CCTGGGGCAA AGCGTTTCGA
ACAGAAGTTT ACCCGATTGG CATTGAGCCA AAAGAAATAG CCAAACAGGC TGCCGGGCCA
CTGCCGCCAA AACTGGCGCA ACTTAAAGCG GAACTGAAAA ACGTACAAAA TATCTTTTCT
GTCGAACGGC TGGATTATTC CAAAGGTTTG CCAGAGCGTT TTCTCGCCTA TGAAGCGTTG
CTGGAAAAAT ATCCGCAGCA TCATGGTAAA ATTCGTTATA CCCAGATTGC ACCAACGTCG
CGTGGTGATG TGCAAGCCTA TCAGGATATT CGTCATCAGC TCGAAAATGA AGCCGGACGA
ATTAATGGTA AATACGGGCA ATTAGGCTGG ACGCCGCTTT ATTATTTGAA TCAGCATTTT
GACCGTAAAT TACTGATGAA AATATTCCGC TACTCTGACG TGGGCTTAGT GACGCCACTG
CGTGACGGGA TGAACCTGGT AGCAAAAGAG TATGTTGCCG CTCAGGACCC AGCCAACCCG
GGCGTTCTTG TTCTTTCGCA ATTTGCGGGA GCGGCAAACG AGTTAACGTC GGCGTTAATT
GTTAATCCCT ACGATCGTGA CGAAGTTGCA GCTGCGCTGG ATCGTGCATT GACTATGTCG
CTGGCGGAAC GTATTTCCCG TCATGCAGAA ATGCTGGACG TTATCGTGAA AAACGATATT
AACCACTGGC AGGAGTGCTT CATTAGCGAC CTAAAGCAGA TAGTTCCGCG AAGCGCGGAA
AGCCAGCAGC GCGATAAAGT TGCTACCTTT CCTAAGCTTG TGTAG
 
Protein sequence
MSRLVVVSNR IAPPDEHAAS AGGLAVGILG ALKAAGGLWF GWSGETGNED QPLKKVKKGN 
ITWASFNLSE QDLDEYYNQF SNAVLWPAFH YRLDLVQFQR PAWDGYLRVN ALLADKLLPL
LQDDDIIWIH DYHLLPFAHE LRKRGVNNRI GFFLHIPFPT PEIFNALPTY DTLLEQLCDY
DLLGFQTEND RLAFLDCLSN LTRVTTRSAK SHTAWGKAFR TEVYPIGIEP KEIAKQAAGP
LPPKLAQLKA ELKNVQNIFS VERLDYSKGL PERFLAYEAL LEKYPQHHGK IRYTQIAPTS
RGDVQAYQDI RHQLENEAGR INGKYGQLGW TPLYYLNQHF DRKLLMKIFR YSDVGLVTPL
RDGMNLVAKE YVAAQDPANP GVLVLSQFAG AANELTSALI VNPYDRDEVA AALDRALTMS
LAERISRHAE MLDVIVKNDI NHWQECFISD LKQIVPRSAE SQQRDKVATF PKLV