Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_1412 |
Symbol | glpQ |
ID | 6067805 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 1545509 |
End bp | 1546585 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641600831 |
Product | glycerophosphodiester phosphodiesterase |
Protein accession | YP_001724402 |
Protein GI | 170019448 |
COG category | [C] Energy production and conversion |
COG ID | [COG0584] Glycerophosphoryl diester phosphodiesterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.308397 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTGA AGCTGAAAAA CCTTAGCATG GCGATCATGA TGAGCACTAT AGTCATGGGA AGCAGTGCAA TGGCGGCGGA CAGCAACGAA AAAATAGTCA TCGCCCATCG CGGTGCCAGT GGATATTTGC CGGAGCATAC GCTGCCAGCA AAAGCGATGG CGTATGCGCA GGGAGCGGAT TATCTGGAAC AGGATTTGGT GATGACCAAA GACGACCATC TGGTTGTTCT GCATGACCAT TATCTCGATC GTGTTACTGA TGTTGCCGAT CGTTTCCCGG ATCGGGCGCG CAAAGACGGT CGTTACTACG CGATAGATTT CACGCTGGAT GAAATTAAGT CGCTGAAATT TACCGAAGGT TTCGATATTG AAAACGGTAA AAAAGTACAG ACTTATCCGG GGCGTTTCCC AATGGGTAAG TCCGACTTCC GGGTGCACAC CTTTGAAGAA GAGATTGAAT TTGTTCAGGG GTTAAATCAC TCTACCGGGA AAAATATCGG TATCTATCCA GAAATCAAAG CGCCGTGGTT CCATCATCAG GAAGGGAAGG ATATTGCGGC AAAAACGCTG GAAGTGCTGA AGAAATATGG TTACACCGGT AAGGACGATA AAGTTTATTT GCAATGTTTT GATGCTGATG AGCTGAAGCG TATTAAGAAT GAGCTGGAAC CCAAAATGGG CATGGAGCTC AATTTGGTAC AGCTGATTGC CTATACCGAC TGGAATGAAA CGCAGCAGAA ACAGCCGGAC GGAAGCTGGG TTAATTACAA CTACGACTGG ATGTTTAAGC CGGGTGCCAT GAAACAGGTG GCGGAATATG CAGATGGTAT TGGTCCGGAT TACCATATGT TGATTGAGGA GACATCGCAG CCGGGTAATA TCAAACTCAC TGGCATGGTG CAAGATGCTC AGCAGAACAA GCTGGTAGTG CATCCTTATA CCGTGCGGTC AGATAAACTG CCTGAATACA CAACTGATGT GAATCAGTTA TATGATGCTC TGTATAACAA AGCGGGTGTA AATGGGTTGT TTACTGATTT CCCTGATAAG GCAGTAAAAT TTCTTAATAA AGAGTAA
|
Protein sequence | MKLKLKNLSM AIMMSTIVMG SSAMAADSNE KIVIAHRGAS GYLPEHTLPA KAMAYAQGAD YLEQDLVMTK DDHLVVLHDH YLDRVTDVAD RFPDRARKDG RYYAIDFTLD EIKSLKFTEG FDIENGKKVQ TYPGRFPMGK SDFRVHTFEE EIEFVQGLNH STGKNIGIYP EIKAPWFHHQ EGKDIAAKTL EVLKKYGYTG KDDKVYLQCF DADELKRIKN ELEPKMGMEL NLVQLIAYTD WNETQQKQPD GSWVNYNYDW MFKPGAMKQV AEYADGIGPD YHMLIEETSQ PGNIKLTGMV QDAQQNKLVV HPYTVRSDKL PEYTTDVNQL YDALYNKAGV NGLFTDFPDK AVKFLNKE
|
| |