Gene ECH74115_4764 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4764 
SymbolugpC 
ID6968129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4408849 
End bp4409919 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content58% 
IMG OID643388460 
Productglycerol-3-phosphate transporter ATP-binding subunit 
Protein accessionYP_002272888 
Protein GI209399510 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.257443 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGAC TGAAATTACA GGCAGTAACC AAAAGCTGGG ATGGCAAAAC CCAGGTGATT 
AAACCGCTGA CCCTTGATGT GGCGGATGGC GAATTTATCG TGATGGTCGG GCCGTCAGGC
TGCGGGAAAT CGACGCTGCT GCGCATGGTT GCCGGGCTGG AGCGGGTGAC GGAAGGCGAT
ATCTGTATCA ACGACCAGCG GGTGACCGAA ATGGAGCCGA AAGATCGCGG GATTGCGATG
GTGTTCCAGA ACTACGCGCT TTATCCGCAT ATGAGTGTTG AAGAAAACAT GGCGTGGGGG
CTGAAAATTC GCGGCATGGG CAAGCAGCAA ATTGCCGAGC GCGTTAAAGA GGCGGCGCGC
ATTCTGGAAC TGGACGGTCT GCTTAAGCGC CGCCCGCGCG AGCTTTCCGG CGGTCAGCGT
CAGCGTGTGG CGATGGGCCG GGCGATTGTG CGCGATCCGG CGGTGTTCCT GTTTGATGAG
CCACTCTCTA ACCTCGATGC CAAGCTGCGC GTACAGATGC GTCTTGAACT GCAACAGCTG
CACCGTCGCC TGAAAACGAC TTCACTCTAC GTTACTCACG ATCAGGTTGA GGCGATGACC
CTCGCCCAGC GAGTAATGGT GATGAACGGC GGCGTTGCCG AACAGATTGG CACACCAGTT
GAAGTCTACG AAAAGCCCGC CAGCCTGTTT GTGGCGAGTT TTATTGGCAG CCCGGCGATG
AATCTGCTGA CAGGCCGCGT GAATAACGAA GGCACGCACT TCGAGCTGGA CGGCGGCATT
GCGCTGCCGC TAAACGGTGG CTACCGTCAG TATGCAGGGC GTAAAATGAC TCTCGGCATT
CGCCCGGAAC ATATCGCGCT AAGCTCGCAG GCAGAAGGCG GCGTGCCGCT GGTGATGGAC
ACGCTGGAGA TCCTCGGCGC AGATAACCTG GCGCACGGAC GCTGGGGCGA ACAGAAGCTG
GTGGTACGGC TGGCGCATCA GGAGCGCCCG ACGGCAGGCA GCACGCTGTG GCTGCATCTG
CCGGAAAATC AGCTACATCT TTTTGATGGT GAAACAGGAC AACGAGTATG A
 
Protein sequence
MAGLKLQAVT KSWDGKTQVI KPLTLDVADG EFIVMVGPSG CGKSTLLRMV AGLERVTEGD 
ICINDQRVTE MEPKDRGIAM VFQNYALYPH MSVEENMAWG LKIRGMGKQQ IAERVKEAAR
ILELDGLLKR RPRELSGGQR QRVAMGRAIV RDPAVFLFDE PLSNLDAKLR VQMRLELQQL
HRRLKTTSLY VTHDQVEAMT LAQRVMVMNG GVAEQIGTPV EVYEKPASLF VASFIGSPAM
NLLTGRVNNE GTHFELDGGI ALPLNGGYRQ YAGRKMTLGI RPEHIALSSQ AEGGVPLVMD
TLEILGADNL AHGRWGEQKL VVRLAHQERP TAGSTLWLHL PENQLHLFDG ETGQRV