Gene EcSMS35_2391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2391 
SymbolglpQ 
ID6144249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2439256 
End bp2440332 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content46% 
IMG OID641617264 
Productglycerophosphodiester phosphodiesterase 
Protein accessionYP_001744436 
Protein GI170679810 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGA CGCTGAAAAA CCTTAGCATG GCGATCATGA TGAGTGGAAT GATCATGGGA 
AGCAGTGCAA TGGCGGCGGA CAGCAACGAA AAAATAGTCA TCGCCCATCG CGGTGCCAGT
GGATATTTGC CGGAGCATAC GCTGCCAGCA AAAGCGATGG CGTATGCGCA GGGAGCGGAT
TATCTGGAAC AGGATTTGGT GATGACCAAA GACGACCATC TGGTTGTTCT GCATGACCAT
TATCTCGATC GTGTTACTGA TGTTGCCGAT CGTTTCCCGG ATCGGGCGCG CAAAGACGGT
CGTTACTACG CGATAGATTT CACGCTCGAT GAAATTAAGT CACTGAAATT TACCGAAGGT
TTCGATATTG AAAACGGTAA GAAAGTACAG ACTTATCCGG GGCGTTTCCC AATGGGTAAG
TCCGACTTCC GGGTGCACAC CTTTGAAGAA GAGATTGAAT TTGTTCAGGG GTTAAATCAC
TCTACCGGGA AAAATATTGG TATCTACCCG GAGATTAAAG CGCCGTGGTT CCATCATCAG
GAAGGGAAGG ATATTGCGGC AAAAACGCTG GAAGTGCTGA AGAAATATGG TTACACCGGT
AAGGACGACA AAGTTTATTT GCAATGTTTT GATGCCGATG AACTGAAGCG TATTAAGAAT
GAGCTGGAAC CTAAAATGGG CATGGACCTC AATCTGGTAC AGCTGATTGC CTATACCGAC
TGGAACGAAA CTCAGCAGAA ACAGCCGGAC GGAAGCTGGG TTAATTACAA CTACGACTGG
ATGTTTAAGC CGGGGGCCAT GAAACAGGTG GCGGAATATG CTGATGGTAT TGGTCCGGAT
TACCATATGT TGATTGAGGA GACATCGCAG CCAGGTAATA TCAAACTCAC TGGCATGGTG
CAAGATGCTC AGCAGAATAA ACTGGTAGTG CATCCTTACA CCGTGCGGTC AGATAAGCTG
CCTGAATACA CTACTGATGT GAATCAGTTA TATGATGCTC TGTATAACAA AGCGGGTGTA
AATGGGCTGT TTACTGATTT CCCTGATAAG GCAGTAAAAT TCCTTAATAA AGAGTAA
 
Protein sequence
MKLTLKNLSM AIMMSGMIMG SSAMAADSNE KIVIAHRGAS GYLPEHTLPA KAMAYAQGAD 
YLEQDLVMTK DDHLVVLHDH YLDRVTDVAD RFPDRARKDG RYYAIDFTLD EIKSLKFTEG
FDIENGKKVQ TYPGRFPMGK SDFRVHTFEE EIEFVQGLNH STGKNIGIYP EIKAPWFHHQ
EGKDIAAKTL EVLKKYGYTG KDDKVYLQCF DADELKRIKN ELEPKMGMDL NLVQLIAYTD
WNETQQKQPD GSWVNYNYDW MFKPGAMKQV AEYADGIGPD YHMLIEETSQ PGNIKLTGMV
QDAQQNKLVV HPYTVRSDKL PEYTTDVNQL YDALYNKAGV NGLFTDFPDK AVKFLNKE