Gene EcHS_A3647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3647 
SymbolugpC 
ID5594087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3634368 
End bp3635438 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content58% 
IMG OID640922764 
Productglycerol-3-phosphate transporter ATP-binding subunit 
Protein accessionYP_001460244 
Protein GI157162926 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGGAC TGAAATTACA GGCAGTAACC AAAAGCTGGG ATGGTAAAAC CCAGGTCATT 
AAACCGCTGA CCCTTGATGT GGCGGATGGC GAATTTATCG TGATGGTCGG GCCGTCTGGC
TGCGGGAAAT CGACGCTGCT GCGCATGGTT GCCGGGCTGG AGCGGGTGAC AGAAGGCGAT
ATCTGGATCA ACGACCAGCG CGTGACTGAA ATGGAGCCAA AAGATCGCGG GATTGCGATG
GTGTTCCAGA ACTACGCGCT TTATCCGCAT ATGAGTGTCG AAGAAAACAT GGCGTGGGGG
CTGAAAATTC GCGGCATGGG CAAGCAGCAA ATTGCCGAGC GCGTTAAAGA AGCGGCGCGC
ATTCTGGAGC TGGACGGTCT GCTCAAACGT CGCCCGCGCG AGCTTTCCGG CGGTCAGCGC
CAGCGTGTGG CGATGGGCCG CGCGATTGTG CGCGATCCGG CGGTGTTCCT GTTTGATGAG
CCGCTCTCTA ACCTCGATGC CAAGCTGCGC GTGCAGATGC GTCTTGAACT GCAACAGTTG
CACCGTCGCC TGAAAACGAC TTCACTCTAC GTTACTCACG ATCAGGTTGA AGCGATGACG
CTCGCCCAGC GAGTAATGGT GATGAACGGC GGCGTTGCCG AACAGATTGG CACACCAGTT
GAAGTCTACG AAAAGCCCGC CAGCCTGTTT GTAGCGAGTT TTATCGGCAG TCCGGCGATG
AATCTGCTGA CAGGCCGCGT GAATAACGAA GGCACGCATT TCGAACTGGA CGGCGGTATT
GAGCTGCCGC TAAACGGTGG CTACCGTCAG TATGCCGGGC GTAAAATGAC TCTCGGCATT
CGCCCGGAAC ATATTGCGCT AAGCTCGCAG GCAGAAGGCG GCGTACCGCT GGTGATGGAC
ACGCTGGAGA TCCTCGGCGC AGATAACCTG GCGCACGGAC GCTGGGGCGA GCAGAAGCTG
GTGGTGCGAC TGGCGCATCA GGAGCGCCCG ACGGCAGGCA GCACGCTGTG GCTGCATCTG
GCGGAAAATC AGCTGCATCT TTTTGATGGT GAAACAGGAC AACGAGTATG A
 
Protein sequence
MAGLKLQAVT KSWDGKTQVI KPLTLDVADG EFIVMVGPSG CGKSTLLRMV AGLERVTEGD 
IWINDQRVTE MEPKDRGIAM VFQNYALYPH MSVEENMAWG LKIRGMGKQQ IAERVKEAAR
ILELDGLLKR RPRELSGGQR QRVAMGRAIV RDPAVFLFDE PLSNLDAKLR VQMRLELQQL
HRRLKTTSLY VTHDQVEAMT LAQRVMVMNG GVAEQIGTPV EVYEKPASLF VASFIGSPAM
NLLTGRVNNE GTHFELDGGI ELPLNGGYRQ YAGRKMTLGI RPEHIALSSQ AEGGVPLVMD
TLEILGADNL AHGRWGEQKL VVRLAHQERP TAGSTLWLHL AENQLHLFDG ETGQRV