Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4654 |
Symbol | |
ID | 3912472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 5264075 |
End bp | 5265763 |
Gene Length | 1689 bp |
Protein Length | 562 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637886559 |
Product | benzoyl-CoA-dihydrodiol lyase |
Protein accession | YP_488248 |
Protein GI | 86751752 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1024] Enoyl-CoA hydratase/carnithine racemase |
TIGRFAM ID | [TIGR03222] benzoyl-CoA-dihydrodiol lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGGG AGGATCGCGT TCTTGCGAAC GGAGCGAGCC GGATCGACTT TCAGACCGAT CCGTCGCGCT ATCGGCACTG GAAGCTCGCG GTCGACGGCG AGGTTGCGAC CCTCACTATG GACGTCGACG AAAACGGCGG CCTGTTCGAG GGCTATCAGC TCAAGCTGAA TTCCTACGAC CTCGGCGTCG ACATCGAGCT TGCCGACGCG ATGCAGCGGC TGCGCTTCGA GTATCCGCAG GTGAAGGTGA TCCTGCTGCG CTCCGGCAAG AACCGGGTGT TCTGCGCCGG CGCCAACATC CGGATGCTGG CCGGCGCTAC CCACGCGCAT AAAGTCAATT TCTGCAAATT CACCAACGAG ACCCGCAACG GCTTCGAGGA TTCCTCCGAG CATTCCGGCC AGCGTAGCAT CGCGGTGATC AACGGCACTG CGGCCGGCGG CGGTTATGAA CTGGCGCTCG CCGCCGATCA CATCATTCTG GCCGACGACG GCTCGGCCTC CGTGGCGCTG CCGGAAGTCC CGCTGCTCGC GGTATTGCCC GGCACCGGCG GGCTGACGCG GGTGGTCGAC AAGCGCAAGG TGCGCCGCGA TCGCGCCGAC TTCTTCTGCA CCATCGAGGA AGGCATCAAG GGCAAGCGTG CCGTGCAGTG GCGGCTGGTC GACGAGATTG CGCCGAACAG CAAACTGGAC GGCATCGCCG CGGACCGCGC CAAGCAATTT GCCGCCTCGT CATCGCGCAA CGCCACTGGC CCAGGCATTG CGCTGACGCC GCTGTCGCGC AGCTTCCACG ATGCCGGCCT GCGCTACAAA TTCGTCGGCG TCGATCTCGA CCGCGACGCG CGCATCGCCA CGCTCTCGAT CGCGGGACCC GACGCTGCTC CGCCCGCCGA CATCGACGGC CTGATCGAAC AAGGCGCGTC GACCTGGTCG CTGCAGGTCG CGCGCGAACT CGACGACGCG ATTTTGCATC TGCGCATCAA CGAACTCGGC ATCGCGATGC TGGTGTTCAA GTCGCACGGC GACAGCGAAT TGGTGCTGGC GCACGACGCC TTCCTCGAGG CCAACAAGGC GCACTGGCTG GTCAACGAGA TCCGGCATTA CTGGAAGCGC GTGCTGAAGC GCGTCGACGT CACCTCGCGC ACGCTGGTGA CGCTGGTCGA GCCGGGCTCG TGCTTCGCCG GCACGCTGGC CGAACTCGTG TTCGCCGCCG ACCGCTCCTA CATGCTGATC GGAACGCGGC AGGGCGATAA CCGCGCCGCG CCGCAGCTCA CATTGAGCGC GATGAATTTC GGTCCCTATC CGATGAGCCA CGGCTTGACC CGGCTGCAGT CACGCTTCCA GTCCGACGGC GATGAGTTGG CTCAGGCGCA AGCGAAGATC GGCGAGCCGC TCGACGCCGA AGCCGCCGAC GAACTCGGTC TCGTCACCTT CGCGCTCGAC GACATCGACT GGGACGACGA GGTCCGCGTC TTTCTCGAGG AGCGCGCCTC GTTCTCGCCC GACAGCCTCA CCGGCATGGA AGCCAACCTG CGCTTCGTCG GCCCGGAGAC GATGGAGTCG AAGATCTTCT CGCGCCTCAC CGCCTGGCAG AACTGGATCT TCCAACGCCC CAACGCCGTC GGCGAGGACG GCGCACTGCG CCGCTACGGC ACCGGCCAGA AGGCGCAATT CGACATGACG CGGGTGTAG
|
Protein sequence | MAGEDRVLAN GASRIDFQTD PSRYRHWKLA VDGEVATLTM DVDENGGLFE GYQLKLNSYD LGVDIELADA MQRLRFEYPQ VKVILLRSGK NRVFCAGANI RMLAGATHAH KVNFCKFTNE TRNGFEDSSE HSGQRSIAVI NGTAAGGGYE LALAADHIIL ADDGSASVAL PEVPLLAVLP GTGGLTRVVD KRKVRRDRAD FFCTIEEGIK GKRAVQWRLV DEIAPNSKLD GIAADRAKQF AASSSRNATG PGIALTPLSR SFHDAGLRYK FVGVDLDRDA RIATLSIAGP DAAPPADIDG LIEQGASTWS LQVARELDDA ILHLRINELG IAMLVFKSHG DSELVLAHDA FLEANKAHWL VNEIRHYWKR VLKRVDVTSR TLVTLVEPGS CFAGTLAELV FAADRSYMLI GTRQGDNRAA PQLTLSAMNF GPYPMSHGLT RLQSRFQSDG DELAQAQAKI GEPLDAEAAD ELGLVTFALD DIDWDDEVRV FLEERASFSP DSLTGMEANL RFVGPETMES KIFSRLTAWQ NWIFQRPNAV GEDGALRRYG TGQKAQFDMT RV
|
| |