Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2210 |
Symbol | |
ID | 4022695 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 2473580 |
End bp | 2474488 |
Gene Length | 909 bp |
Protein Length | 302 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637962405 |
Product | catechol 1,2-dioxygenase |
Protein accession | YP_569346 |
Protein GI | 91976687 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3485] Protocatechuate 3,4-dioxygenase beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0438906 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.697276 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCAAT TCGACGAGAC CGGACTGACC GAAGCGGTCG TCGCAAGCTT CGACGAGACG CCTGATCCGC GGCTGAAGCA CCTGATGCGC GAGTTGGTCC GCTCGCTGCA CGACTATGTT CGCCGCACCG GGCTGACCTT CGATGAATGG CAAAAGGCGA TCGATTTCCT CACCCGCACC GGGCAGAAAT GCTCGCCGAG CCGGCAGGAA TTCATCCTGC TGTCGGACGT GCTCGGCGTC TCGATGCTGG TCGACGCGGT CAATCATCGC GAGCGCGATG GCGCGACCCA AACGACGGTG CTCGGACCGT TCTATGTCGG CGAGCACAGG CTGATGCCGC ACGGCGCCGA TATCTCCGAG GGCGTCGAGG GGGAGCGGAT GTACGTCGAA AGCCGCGTGA CCGATCTTGC CGGTGAACCG CTGGCCGGCG TGCCGATCGA TGTCTGGCAT GCCGATGACG ACGGTTTCTA CGATACGCAG AAGCCGTCCT ACGCGCAGGC CGGCCCGTCG CTGCGGGCGC GGTTCATCAC CGATTCCGAC GGGCGGTTCA CGTTCCGAAC CATCCTGCCG TGCAGCTATC CGATCCCGAT TGACGGGCCG GTCGGCGATC TGATCCTCGC CACGCAGCGG CATCCGATGC GCCCGGCGCA TGTGCATTTT CTGGTCAACG CCCCGGGATA CGAGCCGCTG ATCACGCATG TGTTCATCGA GGGCGACAAA TATCTCGAAA GCGATGTCGT GTTCGGCGTC AAGGACGACC TGATTTCGAC CATCGAAATG CGCAATGATG CGGTGATGCC CGATGGCCGC GAGGCGCCGG GGCCGTGGCA TCTGATGACC TATGAATTCC GGCTGAAGCC GGGCAGGGGC GCGGCGCCGA AGCCGATCTT GGCGGCTGCT GCGGAGTAA
|
Protein sequence | MPQFDETGLT EAVVASFDET PDPRLKHLMR ELVRSLHDYV RRTGLTFDEW QKAIDFLTRT GQKCSPSRQE FILLSDVLGV SMLVDAVNHR ERDGATQTTV LGPFYVGEHR LMPHGADISE GVEGERMYVE SRVTDLAGEP LAGVPIDVWH ADDDGFYDTQ KPSYAQAGPS LRARFITDSD GRFTFRTILP CSYPIPIDGP VGDLILATQR HPMRPAHVHF LVNAPGYEPL ITHVFIEGDK YLESDVVFGV KDDLISTIEM RNDAVMPDGR EAPGPWHLMT YEFRLKPGRG AAPKPILAAA AE
|
| |