Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_0679 |
Symbol | aroB |
ID | 4689702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008781 |
Strand | + |
Start bp | 723057 |
End bp | 724175 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 639833673 |
Product | 3-dehydroquinate synthase |
Protein accession | YP_980919 |
Protein GI | 121603590 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0337] 3-dehydroquinate synthetase |
TIGRFAM ID | [TIGR01357] 3-dehydroquinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.241981 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGCAA ATCCTCCTTC TCTCGCAACG GCCCAGGTGC AAATCAACCT GGCCGAACGC AGCTATCCCA TCCTGATTGG CACTTCACTG CTGGCCAATG CACTGACCTA TCAGCATCTG CCCCAAGCCG CAACGGCACT CGTGGTGTCC AACACGACCG TGGCGCCCCT GTACGCGGCG CAATTGACTG AAGCGCTGCA GGCGCACTAC GGCAAGGTTC TGCTGGTGAC CCTGCCCGAT GGCGAAGTCC ACAAGGACTG GCCGACCCTG CAACTGATTT TTGATGCGCT GCTTGAAAAC GGCTGCGACC GCAAGACGGT GCTTTTCGCG CTCGGCGGCG GTGTGGTGGG CGACATGACC GGCTTTGCGG CGGCCAGCTA CATGCGCGGC GTGCCGTTTG TGCAGGTGCC GACAACCCTG CTGGCCCAGG TGGACTCGTC GGTCGGTGGC AAGACCGCGA TCAATCACCC GCTGGGCAAG AACATGATTG GCGCGTTCTA CCAGCCCCAG CAGGTAATCT GCGACCTGGA GGTGCTGAAG ACCTTGCCCG ACCGTGAACT GAGCGCCGGA CTGGCCGAAG TCATCAAGTA CGGGCCGATT GCCGACATGG CCTTCCTCGA CTGGATCGAA GCCAACCTGG ATGCGCTGCT GGCCAAGGAG CCTGCCGCGC TGGCGCACGC CATTCAGCGC AGCTGCGAGA TCAAGGCCTG GGTCGTCGGC CAGGATGAGC GCGAGTCGGG TCTGCGGGCG ATTCTGAATT TCGGCCACAC CTTTGGCCAT GCGATTGAAT CCGGGCTGGG CTATGGCGAA TGGCTGCACG GCGAAGGCGT GGGCTGCGGC ATGGTGATGG CCGCGCACCT GTCGCAGCGC CTGGGCCGGA TTGACATGGC GTTTGTGCAG CGCCTGACCA CGCTGATCCA GCGCGCCGGA CTGCCGGTCA AGGCGCCGCT GCTCTCAAGC ACGGACAATG CAGGCCGCTA CCTCGACCTG ATGCGGATTG ACAAGAAATC CGAAGCCGGC GAGATTCGCT TCGTGGTGAT TGATGGACCG GGCAAGGCCG CCGTGTGCGC CGCGCCCGAT GCCGTGGTGC GTGAAGTCAT CGACTTGTGC TGCGCCTGA
|
Protein sequence | MQANPPSLAT AQVQINLAER SYPILIGTSL LANALTYQHL PQAATALVVS NTTVAPLYAA QLTEALQAHY GKVLLVTLPD GEVHKDWPTL QLIFDALLEN GCDRKTVLFA LGGGVVGDMT GFAAASYMRG VPFVQVPTTL LAQVDSSVGG KTAINHPLGK NMIGAFYQPQ QVICDLEVLK TLPDRELSAG LAEVIKYGPI ADMAFLDWIE ANLDALLAKE PAALAHAIQR SCEIKAWVVG QDERESGLRA ILNFGHTFGH AIESGLGYGE WLHGEGVGCG MVMAAHLSQR LGRIDMAFVQ RLTTLIQRAG LPVKAPLLSS TDNAGRYLDL MRIDKKSEAG EIRFVVIDGP GKAAVCAAPD AVVREVIDLC CA
|
| |