Gene EcolC_1738 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1738 
Symbol 
ID6065352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1935078 
End bp1936502 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content49% 
IMG OID641601153 
Producttrehalose-6-phosphate synthase 
Protein accessionYP_001724715 
Protein GI170019761 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.483813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGTT TAGTCGTAGT ATCTAACCGG ATTGCACCAC CAGACGAGCA CGCCGCCAGT 
GCCGGTGGCC TTGCCGTTGG CATACTGGGG GCACTGAAAG CCGCAGGCGG ACTGTGGTTT
GGCTGGAGTG GTGAAACAGG GAATGAGGAT CAGCCGCTAA AAAAGGTGAA AAAAGGTAAC
ATTACGTGGG CCTCTTTTAA CCTCAGCGAA CAGGACCTTG ACGAATACTA CAACCAATTC
TCCAATGCCG TTCTCTGGCC CGCTTTTCAT TATCGGCTCG ATCTGGTGCA ATTTCAGCGT
CCTGCCTGGG ACGGCTATCT ACGCGTAAAT GCGTTGCTGG CAGATAAATT ACTGCCGCTG
TTGCAAGACG ATGACATTAT CTGGATCCAC GATTATCACC TGTTGCCATT TGCGCATGAA
TTACGCAAAC GGGGAGTGAA TAATCGCATT GGTTTCTTTC TGCATATTCC TTTCCCGACA
CCGGAAATCT TCAACGCGCT GCCGACATAT GACACCTTGC TTGAACAGCT TTGTGATTAT
GATTTGCTGG GTTTCCAGAC AGAAAACGAT CGTCTGGCGT TCCTGGATTG TCTTTCTAAC
CTGACCCGCG TCACGACACG TAGCGCAAAA AGCCATACAG CCTGGGGCAA AGCATTTCGA
ACAGAAGTCT ACCCGATCGG CATTGAACCG AAAGAAATAG CCAAACAGGC TGCCGGGCCA
CTGCCGCCAA AACTGGCGCA ACTTAAAGCG GAACTGAAAA ACGTACAAAA TATCTTTTCT
GTCGAACGGC TGGATTATTC CAAAGGTTTG CCAGAGCGTT TTCTCGCCTA TGAAGCGTTG
CTGGAAAAAT ATCCGCAGCA TCATGGTAAA ATTCGTTATA CCCAGATTGC ACCAACGTCG
CGTGGTGATG TGCAAGCCTA TCAGGATATT CGTCATCAGC TCGAAAATGA AGCTGGACGA
ATTAATGGTA AATACGGGCA ATTAGGCTGG ACGCCGCTTT ATTATTTGAA TCAGCATTTT
GACCGTAAAT TACTGATGAA AATATTCCGC TACTCTGACG TGGGCTTAGT GACGCCACTG
CGTGACGGGA TGAACCTGGT AGCAAAAGAG TATGTTGCTG CTCAGGACCC AGCCAATCCG
GGCGTTCTTG TTCTTTCGCA ATTTGCGGGA GCGGCAAACG AGTTAACGTC GGCGTTAATT
GTTAACCCCT ACGATCGTGA CGAAGTTGCA GCTGCGCTGG ATCGTGCATT GACTATGTCG
CTGGCGGAAC GTATTTCCCG TCATGCAGAA ATGCTGGACG TTATCGTGAA AAACGATATT
AACCACTGGC AGGAGTGCTT CATTAGCGAC CTAAAGCAGA TAGTTCCGCG AAGCGCGGAA
AGCCAGCAGC GCGATAAAGT TGCTACCTTT CCAAAGCTTG CGTAG
 
Protein sequence
MSRLVVVSNR IAPPDEHAAS AGGLAVGILG ALKAAGGLWF GWSGETGNED QPLKKVKKGN 
ITWASFNLSE QDLDEYYNQF SNAVLWPAFH YRLDLVQFQR PAWDGYLRVN ALLADKLLPL
LQDDDIIWIH DYHLLPFAHE LRKRGVNNRI GFFLHIPFPT PEIFNALPTY DTLLEQLCDY
DLLGFQTEND RLAFLDCLSN LTRVTTRSAK SHTAWGKAFR TEVYPIGIEP KEIAKQAAGP
LPPKLAQLKA ELKNVQNIFS VERLDYSKGL PERFLAYEAL LEKYPQHHGK IRYTQIAPTS
RGDVQAYQDI RHQLENEAGR INGKYGQLGW TPLYYLNQHF DRKLLMKIFR YSDVGLVTPL
RDGMNLVAKE YVAAQDPANP GVLVLSQFAG AANELTSALI VNPYDRDEVA AALDRALTMS
LAERISRHAE MLDVIVKNDI NHWQECFISD LKQIVPRSAE SQQRDKVATF PKLA