Gene ECH74115_3376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3376 
SymbolglpQ 
ID6970744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3120669 
End bp3121745 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content46% 
IMG OID643387185 
Productglycerophosphodiester phosphodiesterase 
Protein accessionYP_002271648 
Protein GI209396883 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGA CGCTGAAAAA CCTTAGCATG GCGATCATGA TGAGCACTAT AGTCATGGGA 
AGCAGTGCAA TGGCGGCGGA CAGCAACGAA AAAATAGTCA TCGCCCATCG CGGTGCCAGT
GGATATTTGC CGGAGCATAC GCTGCCAGCA AAAGCGATGG CGTATGCGCA GGGAGCGGAT
TATCTGGAAC AGGATTTGGT GATGACCAAA GACGACCATC TGGTTGTTCT GCATGACCAT
TACCTCGATC GTGTTACTGA TGTTGCCGAT CGTTTCCCGG ATCGGGCGCG CAAAGACGGT
CGTTACTACG CGATAGATTT CACGCTGGAT GAAATTAAGT CGTTGAAATT TACCGAAGGT
TTCGATATTG AAAACGGTAA AAAAGTGCAG ACTTATCCAG GGCGTTTCCC AATGGGTAAG
TCCGACTTCC GGGTGCACAC CTTTGAAGAA GAGATTGAAT TTGTTCAGGG GTTAAATCAC
TCTACCGGGA AAAATATCGG TATCTATCCA GAAATCAAAG CGCCGTGGTT CCATCATCAG
GAAGGGAAGG ATATTGCGGC AAAAACGCTG GAAGTGCTGA AGAAATATGG TTACACCGGT
AAAGACGATA AAGTTTATTT GCAATGTTTT GATGCTGATG AGCTGAAGCG TATTAAGAAT
GAGCTGGAAC CCAAAATGGG CATGGATCTC AATCTGGTAC AGCTGATTGC CTATACCGAC
TGGAATGAAA CGCAGCAGAA ACAGCCGGAC GGAAGCTGGG TTAATTACAA CTACGACTGG
ATGTTTAAGC CGGGTGCCAT GAAACAGGTG GCGGAATATG CAGATGGTAT TGGTCCGGAT
TACCATATGT TGATTGAGGA GACATCGCAG CCGGGTAATA TCAAACTCAC TGGCATGGTG
CAAGATGCTC AGCAGAACAA GCTGGTAGTG CATCCTTATA CCGTGCGGTC AGATAAACTG
CCTGAATACA CAACTGATGT GAATCAGTTA TATGATGCTC TGTATAACAA AGCGGGTGTA
AATGGGTTGT TTACTGATTT CCCTGATAAA GCGGTTAAAT TCCTTAATAA AGAGTAA
 
Protein sequence
MKLTLKNLSM AIMMSTIVMG SSAMAADSNE KIVIAHRGAS GYLPEHTLPA KAMAYAQGAD 
YLEQDLVMTK DDHLVVLHDH YLDRVTDVAD RFPDRARKDG RYYAIDFTLD EIKSLKFTEG
FDIENGKKVQ TYPGRFPMGK SDFRVHTFEE EIEFVQGLNH STGKNIGIYP EIKAPWFHHQ
EGKDIAAKTL EVLKKYGYTG KDDKVYLQCF DADELKRIKN ELEPKMGMDL NLVQLIAYTD
WNETQQKQPD GSWVNYNYDW MFKPGAMKQV AEYADGIGPD YHMLIEETSQ PGNIKLTGMV
QDAQQNKLVV HPYTVRSDKL PEYTTDVNQL YDALYNKAGV NGLFTDFPDK AVKFLNKE