Gene EcHS_A2379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2379 
SymbolglpQ 
ID5595519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2390917 
End bp2391993 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content46% 
IMG OID640921506 
Productglycerophosphodiester phosphodiesterase 
Protein accessionYP_001459040 
Protein GI157161722 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones50 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTGA CGCTGAAAAA CCTTAGCATG GCGATCATGA TGAGCACTAT AGTCATGGGA 
AGCAGTGCAA TGGCGGCGGA CAGCAACGAA AAAATAGTCA TCGCCCATCG CGGTGCCAGT
GGATATTTGC CGGAGCATAC GCTGCCAGCA AAAGCGATGG CGTATGCGCA GGGAGCGGAT
TATCTGGAAC AGGATTTGGT GATGACCAAA GACGACAATC TGGTTGTTCT GCATGACCAT
TACCTCGATC GTGTTACTGA TGTTGCCGAT CGTTTCCCGG ATCGGGCGCG CAAAGACGGT
CGTTACTACG CGATAGATTT CACGCTGGAT GAAATTAAGT CGTTGAAATT TACCGAAGGT
TTCGATATTG AAAACGGTAA AAAAGTGCAG ACTTATCCGG GGCGTTTCCC AATGGGTAAG
TCCGACTTCC GGGTGCACAC CTTTGAAGAA GAGATTGAAT TTGTTCAGGG GTTAAATCAC
TCTACCGGGA AAAATATCGG TATTTATCCA GAAATCAAAG CGCCGTGGTT CCATCATCAG
GAAGGGAAGG ATATTGCGGC AAAAACGCTG GAAGTGCTGA AGAAATATGG TTACACCGGT
AAAGACGATA AAGTTTATTT GCAATGTTTT GATGCTGATG AGCTGAAGCG TATTAAGAAT
GAGCTGGAAC CCAAAATGGG CATGGAGCTC AATCTGGTAC AGCTGATTGC CTATACCGAC
TGGAATGAAA CGCAGCAGAA ACAGCCGGAT GGAAGCTGGG TTAATTACAA CTACGACTGG
ATGTTTAAGC CGGGTGCCAT GAAACAGGTG GCGGAATATG CAGATGGTAT TGGTCCGGAT
TACCATATGT TGATTGAGGA GACATCGCAG CCGGGTAATA TCAAACTCAC TGGCATGGTG
CAAGATGCTC AGCAGAATAA ACTGGTAGTG CATCCTTATA CCGTGCGGTC AGATAAACTG
CCTGAATACA CTCCTGATGT GAATCAGTTA TATGATGCTC TGTATAACAA AGCGGGTGTA
AATGGGCTGT TTACTGATTT CCCTGATAAG GCAGTAAAAT TTCTTAATAA AGAGTAA
 
Protein sequence
MKLTLKNLSM AIMMSTIVMG SSAMAADSNE KIVIAHRGAS GYLPEHTLPA KAMAYAQGAD 
YLEQDLVMTK DDNLVVLHDH YLDRVTDVAD RFPDRARKDG RYYAIDFTLD EIKSLKFTEG
FDIENGKKVQ TYPGRFPMGK SDFRVHTFEE EIEFVQGLNH STGKNIGIYP EIKAPWFHHQ
EGKDIAAKTL EVLKKYGYTG KDDKVYLQCF DADELKRIKN ELEPKMGMEL NLVQLIAYTD
WNETQQKQPD GSWVNYNYDW MFKPGAMKQV AEYADGIGPD YHMLIEETSQ PGNIKLTGMV
QDAQQNKLVV HPYTVRSDKL PEYTPDVNQL YDALYNKAGV NGLFTDFPDK AVKFLNKE