Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_0880 |
Symbol | |
ID | 4021354 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 990962 |
End bp | 992650 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637961070 |
Product | benzoyl-CoA-dihydrodiol lyase |
Protein accession | YP_568019 |
Protein GI | 91975360 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1024] Enoyl-CoA hydratase/carnithine racemase |
TIGRFAM ID | [TIGR03222] benzoyl-CoA-dihydrodiol lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.436676 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.55485 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGGG AGGATCGCGT TCTTGCCAAC GGAGCGAGCC GGATCGACTT TCAAACCGAT CCGTCGCGCT ATCGGCACTG GACCCTCGCG GTCGACGGCG ACGTCGCGAC GCTCACCATG GATGTCGACG AGAATGGCGG CCTGTTCGAG GGCTATCAGC TCAAGCTGAA TTCCTATGAC CTCGGTGTCG ACATCGAGCT TGCCGACGCG ATGCAGCGGC TGCGTTTCGA GCACCCGCAG GTGAAGGTCA TCCTGCTGCG CTCCGGCAAG AACCGGGTGT TCTGCGCCGG CGCCAACATC CGGATGCTGG CCGGCGCGAC CCATGCCCAC AAGGTCAATT TCTGCAAATT CACCAACGAG ACCCGCAACG GCTTCGAGGA TTCCTCCGAA CATTCCGGTC AGCGCACCAT CGCGGTGATC AACGGCACCG CGGCCGGCGG CGGCTACGAA TTGGCGCTCG CCGCCGATCA CATCATCATG GCCGACGACG GCGCCGCGGC CGTGGCACTG CCGGAAGTGC CGCTGCTGGC GGTGCTGCCC GGCACCGGCG GGCTGACGCG GGTGGTCGAC AAGCGCAAGG TGCGGCGCGA CCGCGCCGAC TTCTTCTGCA CCATCGAGGA AGGCATCAAG GGCAAGCGCG CAGTGCAGTG GCGCCTCGTC GACGAGATCG CCCCGAACAG CAAGCTCGAA GCCAGGATCG CCGAGCGCGC GCAGCAATTC GCCGCGTCGT CCTCCCGCAA TGGCGGCGGC AAGGGCATCG CGCTGACGCA GCTTCCGCGC AGCTTCGACG AGGCCGGCGT GCGCTATCAA TTCGTCAGCG TCGACATCGA CCGCGCGGCG CGGATCGCGA CGATCTCGAT CGCGGGCCCG CAATCCGGTC CTCCTGCCGA TATCGACGGT CTGATCGCAC AAGGCGCGTC GACCTGGTCA CTGCAGGTCG CGCGCGAACT CGACGACGCC ATCCTGCATC TGCGCATCAA CGAACTCGGC ATTGCGATGC TGGTGTTCAA GTCGCACGGC GACAGCGATC TGGTGCTGGC GCACGACGCC TTCCTGGAAG CCAACAAGGC GCATTGGCTG GTGAACGAGA TCCGGCACTA CTGGAAGCGC GTGTTGAAGC GGATCGACGT CACCTCGCGC ACGCTGGTGA CGCTGGTCGA GCCGGGCTCG TGCTTCGCCG GCACGCTGGC CGAACTCGTA TTCGCCGCCG ACCGCTCCTA CATGCTGATC GGCGCCCGCC AGGGCGACAA CCGCGCCGCG CCGCAGTTGA CGCTGAGCGC GATGAATTTC GGTCCCTATC CGATGAGTCA CGGCCTGACG CGGCTGCAGT CGCGCTTCCA GGCCGATGCC GATGAGCTTG TCCAGGCGCA GGCGAACATC GGCGAGCCGC TGGATGCCGA GGCCGCCGAC GAACTCGGCC TCGTCACCTT CGCGCTCGAC GACATCGACT GGGACGACGA GGTTCGCGTG TTTCTTGAGG AGCGCGCCAG CTTCTCGCCC GACAGCCTCA CCGGCATGGA AGCCAATCTG CGCTTCGCCG GCCCTGAGAC GATGGAATCC AAGATCTTCT CGCGCCTCAC CGCGTGGCAG AACTGGATCT TCCAGCGCCC CAACGCCGTC GGCGAAGACG GCGCGCTCCG CCGCTACGGC ACCGGCCAGA AGGCGCAATT CGACATGACG CGGGTGTGA
|
Protein sequence | MAGEDRVLAN GASRIDFQTD PSRYRHWTLA VDGDVATLTM DVDENGGLFE GYQLKLNSYD LGVDIELADA MQRLRFEHPQ VKVILLRSGK NRVFCAGANI RMLAGATHAH KVNFCKFTNE TRNGFEDSSE HSGQRTIAVI NGTAAGGGYE LALAADHIIM ADDGAAAVAL PEVPLLAVLP GTGGLTRVVD KRKVRRDRAD FFCTIEEGIK GKRAVQWRLV DEIAPNSKLE ARIAERAQQF AASSSRNGGG KGIALTQLPR SFDEAGVRYQ FVSVDIDRAA RIATISIAGP QSGPPADIDG LIAQGASTWS LQVARELDDA ILHLRINELG IAMLVFKSHG DSDLVLAHDA FLEANKAHWL VNEIRHYWKR VLKRIDVTSR TLVTLVEPGS CFAGTLAELV FAADRSYMLI GARQGDNRAA PQLTLSAMNF GPYPMSHGLT RLQSRFQADA DELVQAQANI GEPLDAEAAD ELGLVTFALD DIDWDDEVRV FLEERASFSP DSLTGMEANL RFAGPETMES KIFSRLTAWQ NWIFQRPNAV GEDGALRRYG TGQKAQFDMT RV
|
| |