Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnap_4121 |
Symbol | |
ID | 4686188 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polaromonas naphthalenivorans CJ2 |
Kingdom | Bacteria |
Replicon accession | NC_008781 |
Strand | + |
Start bp | 4409056 |
End bp | 4410153 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639837133 |
Product | phospho-2-dehydro-3-deoxyheptonate aldolase |
Protein accession | YP_984332 |
Protein GI | 121607003 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0722] 3-deoxy-D-arabino-heptulosonate 7-phosphate (DAHP) synthase |
TIGRFAM ID | [TIGR00034] phospho-2-dehydro-3-deoxyheptonate aldolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.752451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.115252 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACCC ACGTCCGCCC TGCCACGCCG CTTTCCACCC ACGACACCAC TCGCATCGAC GACCTGCGCA TCGGCGCGGT GCGTCCGCTG ATCACGCCCG CGCTGCTGCA GGAATGGCTG CCCACGCCGG TCAGCGTCCA GGCGCTGGTC GCGGCCAGCC GCGCCGCGAT CTCGCGTGTG CTGCACGGCG CCGACGACCG GCTGGTGGTC GTGGTCGGGC CGTGTTCCAT CCACGACCAT GCGCAGGCCA TGGACTACGC CCGCCAATTC AAGGCGCAGG CCGATGCGCT CAAGGACGAT TTGCTGGTCG TGATGCGCGT GTATTTTGAA AAGCCGCGCA CCACCGTTGG CTGGAAGGGC TACATCAACG ACCCGCACCT GGACGGCAGC TTTGCCATCA ACGAAGGGCT GGAAATGGCG CGCCAGCTGC TGCTCGACGT GCTGGCGCTC GGCCTGCCGG TGGGCACCGA ATTCCTTGAT CTGCTGAGCC CGCAGTTCAT CAGCGACCTG GTGAGCTGGG GCGCGATTGG CGCGCGCACC ACCGAAAGCC AGAGCCACCG CCAGCTCGCC AGCGGCCTGT CGTGCCCGGT CGGCTTCAAG AACGGCACCG ACGGCGGCGT GAAGGTGGCA GCCGACGCCA TCCAGGCGGC GCAGGCCACG CACGCCTTCA TGGGCATGAC CAAGATGGGC CAGGCGGCAA TTTTTGAAAC CCGGGGCAAT GACGACTGCC ATGTGATTCT GCGCGGCGGC AAGCAGACCA ATTATTCAAA GGCCGACGTG GACGCGACCT GCGCGCAACT CAGGGCCGCC GGCCTGCGCG AGCAGGTGAT GATCGACGTG TCGCACGCCA ACAGCAGCAA GCAGCACCAG CGGCAAATCG AAGTCGCCGC CGACGTGGCG AGCCAGGTGG CGGCGGGCGA CCACCGCATC ATGGGCCTGA TGATTGAAAG CCACCTCAAC GAAGGCCGGC AGGACATCGT CGCCGGCCAG CCCTTGAAGC ACGGCGTGTC GGTGACCGAT GCGTGCATCA GTTTTGCGCA GACCGTGCCC GTGCTGCAAG GGCTGGCGGC GGCGGTGCGG GCGCGGCGCC TGGCTTGA
|
Protein sequence | MNTHVRPATP LSTHDTTRID DLRIGAVRPL ITPALLQEWL PTPVSVQALV AASRAAISRV LHGADDRLVV VVGPCSIHDH AQAMDYARQF KAQADALKDD LLVVMRVYFE KPRTTVGWKG YINDPHLDGS FAINEGLEMA RQLLLDVLAL GLPVGTEFLD LLSPQFISDL VSWGAIGART TESQSHRQLA SGLSCPVGFK NGTDGGVKVA ADAIQAAQAT HAFMGMTKMG QAAIFETRGN DDCHVILRGG KQTNYSKADV DATCAQLRAA GLREQVMIDV SHANSSKQHQ RQIEVAADVA SQVAAGDHRI MGLMIESHLN EGRQDIVAGQ PLKHGVSVTD ACISFAQTVP VLQGLAAAVR ARRLA
|
| |