Gene EcHS_A1993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1993 
SymbolotsA 
ID5592242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1999815 
End bp2001239 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content49% 
IMG OID640921139 
Producttrehalose-6-phosphate synthase 
Protein accessionYP_001458687 
Protein GI157161369 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0380] Trehalose-6-phosphate synthase 
TIGRFAM ID[TIGR02400] alpha,alpha-trehalose-phosphate synthase [UDP-forming] 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.000154738 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGTT TAGTCGTAGT ATCTAACCGG ATTGCACCAC CAGACGAGCA CGCCGCCAGT 
GCCGGTGGCC TTGCCGTTGG CATACTGGGG GCACTGAAAG CCGCAGGCGG ACTGTGGTTT
GGCTGGAGTG GTGAAACAGG GAATGAGGAT CAGCCGCTAA AAAAGGTGAA AAAAGGTAAC
ATTACGTGGG CCTCTTTTAA CCTCAGCGAA CAGGACCTTG ACGAATACTA CAACCAATTC
TCCAATGCCG TTCTCTGGCC CGCTTTTCAT TATCGGCTCG ATCTGGTGCA ATTTCAGCGT
CCTGCCTGGG ACGGCTATCT ACGCGTAAAT GCGTTGCTGG CAGATAAATT ACTGCCGCTG
TTGCAAGACG ATGACATTAT CTGGATCCAC GATTATCACC TGTTGCCATT TGCGCATGAA
TTACGCAAAC GGGGAGTGAA TAATCGCATT GGTTTCTTTC TGCATATTCC TTTCCCGACA
CCGGAAATCT TCAACGCGCT GCCGACATAT GACACCTTGC TTGAACAGCT TTGTGATTAT
GATTTGCTGG GTTTCCAGAC AGAAAACGAT CGTCTGGCGT TCCTGGATTG TCTTTCTAAC
CTGACCCGCG TCACGACACG TAGCGCAAAA AGCCATACAG CCTGGGGCAA AGCATTTCGA
ACAGAAGTCT ACCCGATCGG CATTGAACCG AAAGAAATAG CCAAACAGGC TGCCGGGCCA
CTGCCGCCAA AACTGGCGCA ACTTAAAGCG GAACTGAAAA ACGTACAAAA TATCTTTTCT
GTCGAACGGC TGGATTATTC CAAAGGTTTG CCAGAGCGTT TTCTCGCCTA TGAAGTGTTG
CTGGAAAAAT ATCCGCAGCA TCATGGTAAA ATTCGTTATA CCCAGATTGC ACCAACGTCG
CGTGGTGATG TGCAAGCCTA TCAGGATATT CGTCATCAGC TCGAAAATGA AGCTGGACGA
ATTAATGGTA AATACGGGCA ATTAGGCTGG ACGCCGCTTT ATTATTTGAA TCAGCATTTT
GACCGTAAAT TACTGATGAA AATATTCCGC TACTCTGACG TGGGCTTAGT GACGCCACTG
CGTGACGGGA TGAACCTGGT AGCAAAAGAG TATGTTGCTG CTCAGGACCC AGCCAATCCG
GGCGTTCTTG TTCTTTCGCA ATTTGCGGGA GCGGCAAACG AGTTAACGTC GGCGTTAATT
GTTAACCCCT ACGATCGTGA CGAAGTTGCA GCTGCGCTGG ATCGTGCATT GACTATGTCG
CTGGCGGAAC GTATTTCCCG TCATGCAGAA ATGCTGGACG TTATCGTGAA AAACGATATT
AACCACTGGC AGGAGTGCTT CATTAGCGAC CTAAAGCAGA TAGTTCCGCG AAGCGCGGAA
AGCCAGCAGC GCGATAAAGT TGCTACCTTT CCAAAGCTTG CGTAG
 
Protein sequence
MSRLVVVSNR IAPPDEHAAS AGGLAVGILG ALKAAGGLWF GWSGETGNED QPLKKVKKGN 
ITWASFNLSE QDLDEYYNQF SNAVLWPAFH YRLDLVQFQR PAWDGYLRVN ALLADKLLPL
LQDDDIIWIH DYHLLPFAHE LRKRGVNNRI GFFLHIPFPT PEIFNALPTY DTLLEQLCDY
DLLGFQTEND RLAFLDCLSN LTRVTTRSAK SHTAWGKAFR TEVYPIGIEP KEIAKQAAGP
LPPKLAQLKA ELKNVQNIFS VERLDYSKGL PERFLAYEVL LEKYPQHHGK IRYTQIAPTS
RGDVQAYQDI RHQLENEAGR INGKYGQLGW TPLYYLNQHF DRKLLMKIFR YSDVGLVTPL
RDGMNLVAKE YVAAQDPANP GVLVLSQFAG AANELTSALI VNPYDRDEVA AALDRALTMS
LAERISRHAE MLDVIVKNDI NHWQECFISD LKQIVPRSAE SQQRDKVATF PKLA