Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_3196 |
Symbol | |
ID | 3971994 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | - |
Start bp | 3536836 |
End bp | 3537747 |
Gene Length | 912 bp |
Protein Length | 303 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 637926306 |
Product | catechol 1,2-dioxygenase |
Protein accession | YP_533057 |
Protein GI | 90424687 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3485] Protocatechuate 3,4-dioxygenase beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.42127 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.302441 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCACT TCGACGACAG CGAACTCACC GCGGCGGTGG TCGGCAGCTT TGCGGAAGCC AACGACCCGC GGTTGAAATT CCTCATGGAA GAATTGGTGA CGTCGCTGCA CGGCTTCGTG CGGCGGACCA ATCTGAGCTT CGAGGAATGG GGCGCCGCGC TGGAGTTTCT GACCCGCACC GGACAGAAAT GCAGCGCCAC GCGGCAGGAA TTCATCCTGC TCTCGGACGT GCTCGGGGTG TCGATGCTGG TCGACGCGGT CAACCATCGC GATCGCGGCG GCGCCACCCA GACCACGGTG CTGGGGCCGT TCTATGTCGG CGAGCACAGG CCGTTGCCGC ATGGCGCGGA TATTTCGGAA AACGTCGAGG GCGACCGCAT GTTCGTGCAA TGCCGGGTCA CCGACCTTGC CGGCAAGCCG CTGGCCGGGG TGACGGTCGA TGTCTGGCAC GCCGACGACG AGGGCTTCTA CGATTCGCAG AAACCGTCCT ATGCCAGCGA CGGCCCGTCG CTGCGGGCGC GCTTCACCAC CGATGCGGGT GGTCGGCTGT TCTTTCGCAC CATCTTGCCG TGCAGCTATC CGATCCCGAT CGACGGCCCG GTGGGCGAAT TGATCCTGGC CACAAGGCGG CATCCGATGC GGCCGGCCCA TGTGCACTTC CTGCTCGACG CCAAGGGCTA TGAGCCGCTG GTCACCCACG TCTTCATCGA GGGCGACAAG TATCTGGAGT CCGACGTGGT GTTCGGAGTG AAGCAGGAAC TGATCTCGAC CATCGAGTTG CGCACCGACG CGACGATGCC GGACGGGCTG CCGGCGCCGG GGCCGTGGCA TCTGATGACC TATGATTTCC GGCTGAAGCC CGGCAAGGGC GTGGCGCCGA AGCCGATGAT CGCGGTCAGC GTCGATGCCT GA
|
Protein sequence | MPHFDDSELT AAVVGSFAEA NDPRLKFLME ELVTSLHGFV RRTNLSFEEW GAALEFLTRT GQKCSATRQE FILLSDVLGV SMLVDAVNHR DRGGATQTTV LGPFYVGEHR PLPHGADISE NVEGDRMFVQ CRVTDLAGKP LAGVTVDVWH ADDEGFYDSQ KPSYASDGPS LRARFTTDAG GRLFFRTILP CSYPIPIDGP VGELILATRR HPMRPAHVHF LLDAKGYEPL VTHVFIEGDK YLESDVVFGV KQELISTIEL RTDATMPDGL PAPGPWHLMT YDFRLKPGKG VAPKPMIAVS VDA
|
| |