Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A2183 |
Symbol | gnd |
ID | 5594769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 2161862 |
End bp | 2163268 |
Gene Length | 1407 bp |
Protein Length | 468 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640921316 |
Product | 6-phosphogluconate dehydrogenase |
Protein accession | YP_001458855 |
Protein GI | 157161537 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0362] 6-phosphogluconate dehydrogenase |
TIGRFAM ID | [TIGR00873] 6-phosphogluconate dehydrogenase, decarboxylating |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.000119365 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCAAGC AACAGATCGG AGTTGTCGGT ATGGCTGTGA TGGGGCGTAA CCTAGCGCTG AACATCGAAA GCCGTGGTTA TACCGTCTCC GTTTTCAACC GCTCCCGTGA AAAGACCGAA GAAGTGATTG CCGAAAATCC AGGCAAGAAG CTGGTCCCTT ACTATACAGT ACAAGAGTTC GTTGAATCCC TTGAGACACC ACGCCGTATC CTGTTGATGG TGAAGGCCGG GGCTGGCACC GACAGCGCCA TCGATTCCCT GAAGCCTTAC CTCGACAAAG GCGACATCAT CATTGATGGC GGCAACACCT TCTTCCAGGA CACCATCCGT CGTAACCGTG AGCTGTCTGC CGAAGGTTTT AACTTTATCG GTACCGGTGT TTCCGGTGGT GAAGAGGGGG CCCTGAAAGG GCCTTCCATC ATGCCTGGTG GGCAGAAAGA AGCCTATGAG CTGGTTGCTC CGATTCTGGA GCAGATTGCT GCAGTTGCTG AAGATGGTGA GCCATGTGTG ACCTATATCG GTGCTGATGG CGCTGGCCAT TATGTGAAGA TGGTTCATAA CGGCATCGAG TATGGAGACA TGCAGCTGAT TGCTGAAGCT TATGCGCTGT TGAAAGGCGG TCTGGCACTT TCCAACGAAG AGCTGGCGCA AACCTTCACC GAATGGAACG AAGGCGAGCT AAGCAGCTAC CTGATCGATA TTACCAAAGA TATCTTCACC AAGAAGGATG AAGAGGGTAA ATACCTCGTT GATGTTATTC TTGATGAAGC AGCAAACAAA GGTACCGGCA AGTGGACCAG CCAGAGCTCA CTGGATCTTG GCGAACCTCT GTCTCTGATC ACCGAGTCCG TGTTTGCTCG CTATATTTCT TCGCTTAAAG ACCAGCGTGT AGCCGCGTCG AAAGTGCTGA GTGGTCCGCA GGCTCAACCG GCCGGTGATA AAGCGGAATT TATCGAAAAA GTGCGTCGCG CGTTGTACCT CGGTAAAATC GTTTCCTATG CTCAGGGCTT CTCCCAGCTG CGTGCAGCTT CTGATGAATA CAACTGGGAT CTTAACTACG GCGAGATCGC TAAGATTTTC CGCGCTGGCT GTATCATTCG TGCGCAGTTC CTGCAGAAGA TCACCGATGC TTATGCGCAA AACGCTGGCA TTGCTAACCT CTTGCTGGCG CCGTACTTCA AACAGATCGC TGATGACTAT CAGCAAGCGC TGCGTGATGT CGTGGCTTAT GCTGTGCAGA ATGGTATTCC GGTACCGACG TTCTCTGCTG CAATTGCCTA CTATGACAGC TACCGTTCTG CGGTTCTGCC GGCTAACCTG ATTCAGGCTC AGCGTGATTA CTTTGGTGCG CACACCTATA AGCGCACTGA TAAGGAAGGC GTTTTCCATA CTGAGTGGTT AGATTAA
|
Protein sequence | MSKQQIGVVG MAVMGRNLAL NIESRGYTVS VFNRSREKTE EVIAENPGKK LVPYYTVQEF VESLETPRRI LLMVKAGAGT DSAIDSLKPY LDKGDIIIDG GNTFFQDTIR RNRELSAEGF NFIGTGVSGG EEGALKGPSI MPGGQKEAYE LVAPILEQIA AVAEDGEPCV TYIGADGAGH YVKMVHNGIE YGDMQLIAEA YALLKGGLAL SNEELAQTFT EWNEGELSSY LIDITKDIFT KKDEEGKYLV DVILDEAANK GTGKWTSQSS LDLGEPLSLI TESVFARYIS SLKDQRVAAS KVLSGPQAQP AGDKAEFIEK VRRALYLGKI VSYAQGFSQL RAASDEYNWD LNYGEIAKIF RAGCIIRAQF LQKITDAYAQ NAGIANLLLA PYFKQIADDY QQALRDVVAY AVQNGIPVPT FSAAIAYYDS YRSAVLPANL IQAQRDYFGA HTYKRTDKEG VFHTEWLD
|
| |