Gene EcolC_1412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1412 
SymbolglpQ 
ID6067805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1545509 
End bp1546585 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content46% 
IMG OID641600831 
Productglycerophosphodiester phosphodiesterase 
Protein accessionYP_001724402 
Protein GI170019448 
COG category[C] Energy production and conversion 
COG ID[COG0584] Glycerophosphoryl diester phosphodiesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.308397 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTGA AGCTGAAAAA CCTTAGCATG GCGATCATGA TGAGCACTAT AGTCATGGGA 
AGCAGTGCAA TGGCGGCGGA CAGCAACGAA AAAATAGTCA TCGCCCATCG CGGTGCCAGT
GGATATTTGC CGGAGCATAC GCTGCCAGCA AAAGCGATGG CGTATGCGCA GGGAGCGGAT
TATCTGGAAC AGGATTTGGT GATGACCAAA GACGACCATC TGGTTGTTCT GCATGACCAT
TATCTCGATC GTGTTACTGA TGTTGCCGAT CGTTTCCCGG ATCGGGCGCG CAAAGACGGT
CGTTACTACG CGATAGATTT CACGCTGGAT GAAATTAAGT CGCTGAAATT TACCGAAGGT
TTCGATATTG AAAACGGTAA AAAAGTACAG ACTTATCCGG GGCGTTTCCC AATGGGTAAG
TCCGACTTCC GGGTGCACAC CTTTGAAGAA GAGATTGAAT TTGTTCAGGG GTTAAATCAC
TCTACCGGGA AAAATATCGG TATCTATCCA GAAATCAAAG CGCCGTGGTT CCATCATCAG
GAAGGGAAGG ATATTGCGGC AAAAACGCTG GAAGTGCTGA AGAAATATGG TTACACCGGT
AAGGACGATA AAGTTTATTT GCAATGTTTT GATGCTGATG AGCTGAAGCG TATTAAGAAT
GAGCTGGAAC CCAAAATGGG CATGGAGCTC AATTTGGTAC AGCTGATTGC CTATACCGAC
TGGAATGAAA CGCAGCAGAA ACAGCCGGAC GGAAGCTGGG TTAATTACAA CTACGACTGG
ATGTTTAAGC CGGGTGCCAT GAAACAGGTG GCGGAATATG CAGATGGTAT TGGTCCGGAT
TACCATATGT TGATTGAGGA GACATCGCAG CCGGGTAATA TCAAACTCAC TGGCATGGTG
CAAGATGCTC AGCAGAACAA GCTGGTAGTG CATCCTTATA CCGTGCGGTC AGATAAACTG
CCTGAATACA CAACTGATGT GAATCAGTTA TATGATGCTC TGTATAACAA AGCGGGTGTA
AATGGGTTGT TTACTGATTT CCCTGATAAG GCAGTAAAAT TTCTTAATAA AGAGTAA
 
Protein sequence
MKLKLKNLSM AIMMSTIVMG SSAMAADSNE KIVIAHRGAS GYLPEHTLPA KAMAYAQGAD 
YLEQDLVMTK DDHLVVLHDH YLDRVTDVAD RFPDRARKDG RYYAIDFTLD EIKSLKFTEG
FDIENGKKVQ TYPGRFPMGK SDFRVHTFEE EIEFVQGLNH STGKNIGIYP EIKAPWFHHQ
EGKDIAAKTL EVLKKYGYTG KDDKVYLQCF DADELKRIKN ELEPKMGMEL NLVQLIAYTD
WNETQQKQPD GSWVNYNYDW MFKPGAMKQV AEYADGIGPD YHMLIEETSQ PGNIKLTGMV
QDAQQNKLVV HPYTVRSDKL PEYTTDVNQL YDALYNKAGV NGLFTDFPDK AVKFLNKE