Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0843 |
Symbol | |
ID | 3969840 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 929336 |
End bp | 930754 |
Gene Length | 1419 bp |
Protein Length | 472 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637923959 |
Product | carboxylyase-like protein |
Protein accession | YP_530732 |
Protein GI | 90422362 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.22869 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCAAGA CAAGCAGCCT CCACGCCCTC GCCGGGCACA ACGCCACGCC GGATCTGCGC AGCTGGCTGC GGCAGCTCGA AGCCTGCGGC CGGTTGGCGC TGGCGCGCAG CGGCGTTGCG CTGATCGACG AACTGGCGGC GGTGGCCAAG ACGCTCGAGC GCGACACCGC GGTGCTGTTT CCGCAGCCCG GCCAGCACGC GATCCCGGTG GTCGCCAATA TCTTCGCCGA CCGCAGCTGG GTCGCCGACT CGCTCGGCGT GCCGACCGAT CAATTGCTGA CGCGGTTTCA GGACGCGGTG CGGCACCCGT TGCCGTGGGT CGAAGTCGAC GAGGCCCCGG CGCAGCAGGT GATCCACCGC GAGGTCGATC TACTGACCCA ATTGCCGATC CCCAAGCACA ATGAGCACGA CAGCGGGCCC TATATCACTG CCGCATTGTT GATCGCGCGC AATCCGGTGA CCGGAATCCA GAACGTCTCG ATCCATCGCT GCCAGGTCAG CGGCCCGGAC CGGATCGGCG TGCTGCTGCT GCCGCGTCAC ACGCTGCATT ACTTCAAGAT GGCGGAGCAG GCCGGGCAGG CGCTGGAGAT CGCGCTGGTG ATCGGGGTAC ATCCGGCCTG CATCCTGGCG TCGCAGGCGA TCGCCGCGGT CGACGAAGAC GAGATGGAAA TCGCCGGCGC GCTGCTCGGC CATCCGATCG AGATGGTGAA GTGCCGCACC AATCAGGTGC GGGTGCCGGC GCACGCCGAG ATCGTGATCG AGGGCCGCAT CCTGCCGAAA CTGCGCGAGC CGGAAGGCCC GTTCGGCGAA TTCCCGCAAT ATTACGGCCC GCGCGCCGAT CGCGAGGTGA TCCAGGTCGA CGCCATCACC CATCGCGCCA ACCCGATTTT CCACACCATC GTCGGCGGCG GCATGGAGCA TCTGGTGCTC GGCGAAATCC CCCGCGAGGC GACGCTGCTG GAGCATCTGC AACGCAGCTT TTCCAGCGTG CGCGACGTCC GGCTGACGCG CGGCGGGGTG TGCCGCTATC ACCTGGTGGT CAAGATCGAC AAGACCAGCA ATGGCGAGCC GAAGAACATC ATCATGGGGG CGTTCGGCGG ACATTACGAC TTGAAGCAGG TGGTGATCGT CGACATGGAC GTCGATATCG ACGATCCGCA CGAGATCGAA TGGGCGATCG CGACCCGCTT CCAGGCCGAT CGCGATCTCT TGGTGGTGTC CGGCGCGCAG GGCTCCAAGC TCGATCCGAC CAGCGACGGC GGCATCAGCG CCAAGATGGG GCTCGACGCC ACCAAGCCGA TCGAAGCCGA GCCGATGGTG TTCAAGCGGA TTCACGTCCA TGGTCTGGAA GACGTCGATC TGTCGCGCGT GCTGCAGAGC GACTCGAAGG CGGCGCTGGC GCGGATCATC TGCGGCTGA
|
Protein sequence | MTKTSSLHAL AGHNATPDLR SWLRQLEACG RLALARSGVA LIDELAAVAK TLERDTAVLF PQPGQHAIPV VANIFADRSW VADSLGVPTD QLLTRFQDAV RHPLPWVEVD EAPAQQVIHR EVDLLTQLPI PKHNEHDSGP YITAALLIAR NPVTGIQNVS IHRCQVSGPD RIGVLLLPRH TLHYFKMAEQ AGQALEIALV IGVHPACILA SQAIAAVDED EMEIAGALLG HPIEMVKCRT NQVRVPAHAE IVIEGRILPK LREPEGPFGE FPQYYGPRAD REVIQVDAIT HRANPIFHTI VGGGMEHLVL GEIPREATLL EHLQRSFSSV RDVRLTRGGV CRYHLVVKID KTSNGEPKNI IMGAFGGHYD LKQVVIVDMD VDIDDPHEIE WAIATRFQAD RDLLVVSGAQ GSKLDPTSDG GISAKMGLDA TKPIEAEPMV FKRIHVHGLE DVDLSRVLQS DSKAALARII CG
|
| |