Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3376 |
Symbol | glpQ |
ID | 6970744 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 3120669 |
End bp | 3121745 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643387185 |
Product | glycerophosphodiester phosphodiesterase |
Protein accession | YP_002271648 |
Protein GI | 209396883 |
COG category | [C] Energy production and conversion |
COG ID | [COG0584] Glycerophosphoryl diester phosphodiesterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTGA CGCTGAAAAA CCTTAGCATG GCGATCATGA TGAGCACTAT AGTCATGGGA AGCAGTGCAA TGGCGGCGGA CAGCAACGAA AAAATAGTCA TCGCCCATCG CGGTGCCAGT GGATATTTGC CGGAGCATAC GCTGCCAGCA AAAGCGATGG CGTATGCGCA GGGAGCGGAT TATCTGGAAC AGGATTTGGT GATGACCAAA GACGACCATC TGGTTGTTCT GCATGACCAT TACCTCGATC GTGTTACTGA TGTTGCCGAT CGTTTCCCGG ATCGGGCGCG CAAAGACGGT CGTTACTACG CGATAGATTT CACGCTGGAT GAAATTAAGT CGTTGAAATT TACCGAAGGT TTCGATATTG AAAACGGTAA AAAAGTGCAG ACTTATCCAG GGCGTTTCCC AATGGGTAAG TCCGACTTCC GGGTGCACAC CTTTGAAGAA GAGATTGAAT TTGTTCAGGG GTTAAATCAC TCTACCGGGA AAAATATCGG TATCTATCCA GAAATCAAAG CGCCGTGGTT CCATCATCAG GAAGGGAAGG ATATTGCGGC AAAAACGCTG GAAGTGCTGA AGAAATATGG TTACACCGGT AAAGACGATA AAGTTTATTT GCAATGTTTT GATGCTGATG AGCTGAAGCG TATTAAGAAT GAGCTGGAAC CCAAAATGGG CATGGATCTC AATCTGGTAC AGCTGATTGC CTATACCGAC TGGAATGAAA CGCAGCAGAA ACAGCCGGAC GGAAGCTGGG TTAATTACAA CTACGACTGG ATGTTTAAGC CGGGTGCCAT GAAACAGGTG GCGGAATATG CAGATGGTAT TGGTCCGGAT TACCATATGT TGATTGAGGA GACATCGCAG CCGGGTAATA TCAAACTCAC TGGCATGGTG CAAGATGCTC AGCAGAACAA GCTGGTAGTG CATCCTTATA CCGTGCGGTC AGATAAACTG CCTGAATACA CAACTGATGT GAATCAGTTA TATGATGCTC TGTATAACAA AGCGGGTGTA AATGGGTTGT TTACTGATTT CCCTGATAAA GCGGTTAAAT TCCTTAATAA AGAGTAA
|
Protein sequence | MKLTLKNLSM AIMMSTIVMG SSAMAADSNE KIVIAHRGAS GYLPEHTLPA KAMAYAQGAD YLEQDLVMTK DDHLVVLHDH YLDRVTDVAD RFPDRARKDG RYYAIDFTLD EIKSLKFTEG FDIENGKKVQ TYPGRFPMGK SDFRVHTFEE EIEFVQGLNH STGKNIGIYP EIKAPWFHHQ EGKDIAAKTL EVLKKYGYTG KDDKVYLQCF DADELKRIKN ELEPKMGMDL NLVQLIAYTD WNETQQKQPD GSWVNYNYDW MFKPGAMKQV AEYADGIGPD YHMLIEETSQ PGNIKLTGMV QDAQQNKLVV HPYTVRSDKL PEYTTDVNQL YDALYNKAGV NGLFTDFPDK AVKFLNKE
|
| |