Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PG0885 |
Symbol | |
ID | 2552917 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Porphyromonas gingivalis W83 |
Kingdom | Bacteria |
Replicon accession | NC_002950 |
Strand | + |
Start bp | 947885 |
End bp | 948985 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 637149613 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase/chorismate mutase |
Protein accession | NP_905132 |
Protein GI | 34540653 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1605] Chorismate mutase [COG2876] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR01361] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTACT GTGATTTTAC GCCGTTGCCT CTCCCTTCGG AGCCTAATAC GACAGTCATT GCCGGTCCTT GCAGTGCAGA AAGTGAGGAG CAGATAATGA CTACTGCTCG TGCCCTCAGG GATGAAGCAG GCATTCGTAT TTTTCGTGCC GGTCTGTGGA AACCTCGTAC CTTGCCGGGG TGCTTCGAAG GAGTAGGAGA AACAGGGCTA CCTTGGTTGG TGCGTGTACA GGATGAATTG GATATGCTTG CTACCACGGA AGTGGCTACT CGCGAACACG TAGAGCAAGC CATGCAAGCC GGTATCAGAA TACTCTGGTT AGGTGCACGA ACCACATCCA ATCCCTTTGC TGTACAAGAA ATTGCCGATA CGATAGGCAA GGACGAATCG GTGATTGTCC TCGTCAAGAA TCCGATCAGT CCCGATTTGG ATCTGTGGAC AGGAGCCCTA GAACGGCTTC GGCAGTCCGG AGTTCGACAG ATCGGAGCCA TCCATAGAGG ATTCAGTACC TATGCGACCA AGACGTTTCG CAATCCTCCA CATTGGCAGA TTCCCTTCGA TTTGAAAAGA CGTTTTCCTT CGTTAACCAT CCTTTGTGAT CCGAGTCATA TTACGGGACA GAGAGATCGG ATCGAATCCG TCAGCCAGCA AGCCATGGAA ATGAATTTCG ACGGGCTAAT TATCGAGTCG CATTGCTGCC CGGATAAGGC TCTAAGCGAT GCAAGCCAGC AGATAACACC TACTGTACTT GCCCAAATCC TCCGGCGACT GCGTATCCCA CGGCGCCAAT CCGAAAAGCA AGACGAAGAG CTGATCTCTT GGCGCATGCA GATTGATCAG ATAGATGAGA GTATAGTGGA ATTGCTAGCT CGGCGGATGC AAGTGGCATA CGAGATAGGT TTGTTCAAAA AAGAGCACAA TCTGGCTGTG GTTCAGAATC TCCGCTACGA ACAACTACAG CGCAACCGTG CCCGTACTGC AGCCCTCTTA GGTTTGGACG AAACATTTAT ATCGGAGCTG TTCAGCCGTA TTCATGAGGA ATCTGTCCGT CTGCAGACCC TTGCCCCCCA AAAGCCACAC ACCGACGACT GTATATCATG A
|
Protein sequence | MKYCDFTPLP LPSEPNTTVI AGPCSAESEE QIMTTARALR DEAGIRIFRA GLWKPRTLPG CFEGVGETGL PWLVRVQDEL DMLATTEVAT REHVEQAMQA GIRILWLGAR TTSNPFAVQE IADTIGKDES VIVLVKNPIS PDLDLWTGAL ERLRQSGVRQ IGAIHRGFST YATKTFRNPP HWQIPFDLKR RFPSLTILCD PSHITGQRDR IESVSQQAME MNFDGLIIES HCCPDKALSD ASQQITPTVL AQILRRLRIP RRQSEKQDEE LISWRMQIDQ IDESIVELLA RRMQVAYEIG LFKKEHNLAV VQNLRYEQLQ RNRARTAALL GLDETFISEL FSRIHEESVR LQTLAPQKPH TDDCIS
|
| |