Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_4328 |
Symbol | |
ID | 4024852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | + |
Start bp | 4794723 |
End bp | 4796261 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637964537 |
Product | carboxylyase-like protein |
Protein accession | YP_571446 |
Protein GI | 91978787 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.568607 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAACC GCGTTAAACC GCCTTTTCCG GACCTTCGCG CGTTCGCCGC CTATCTGGAA TCCCGCGGAC AGTTGCACCG CATCCGCAAG CCGGTCTCGG TGGTGCACGA CCTCACCGAG ATTCATCGCC GCGTGCTGCA CGCCGGCGGC CCGGCGCTGA TGATCGAACA GCCGATCAAG GCCGATGGCA CGCCGTCCGA GATGCCGATG CTGGTCAATC TGTTCGGCAC CGTCGAGCGG GTGGCGTGGG GCCTCGGCAT CCTGCCCGAG AATCTGCCGC AGCTCGGCGA GGCACTCGCC GAAATGCGCG AGCCGGCGCC GCCGCAGAGC CTGACCGACG CGCTGAGCAA GCTGCCGATG GCGAGGGCCG CCTTGTCGAT GCGCCCGAAG ATGGCCCGGA CGGCGCCCGC GCAGGACGTG GTGCTGACCG GCGACGCGGT CGATCTCGGC CGGCTGCCGG TGCAGATCCC GTGGCCGGGC GAGCCGGCGC CGCTGATCAC CTGGCCGCTG GTGTTCACCA AGCCGCCGCC CGGCGTGGCC GGCACCGACA ATGTCGGCGT TTACCGGATG CAGGTGCTGG GCAAGGATCG CGTCATCATG CGCTGGCTGG CGCATCGCGG CGGCGCCAAG CATCACCATC AATGGGCGGC GCAGAAGCGC GAGATGCCGG TCGCGGTGGT GATCGGCGCC GATCCGTCGA TGATCCTGTC GGCGGTGCTG CCGCTGCCGG AGACGGTGTC CGAAATCAAG TTCTCGGGCC TGCTGCGCGG CGACCGGCCG AGCCTGACGC CCTGCGTCAG CATTCCGCTC AACGTGCCGG CCGATGCCGA GATCGTGCTG GAGGGCCTGG TGTCGCCGAC CGAGACCGCG CCGGAGGGGC CGTATGGCGA CCACACCGGC TATTACAACG CGGTCGAGGA ATTCCCGGTG ATGCGGATCA CCGCGATCAC GATGCGGCGC AACCCGATCT ATCTCTCGAC CTACACCGGA CGGCCGCCGG ACGAGCCGTC GCGGCTCGGC GAAGCGCTCA ACGACGTGTT TCTGCCGGTG GCGCGGCGGC AGTTTCCCGA GATCGTCGAT CTGTGGCTGC CGCCGGAAGC GTGCTCGTAC CGAATCGCGG TCGCCTCGAT CAAGAAACGT TATCCCGGCC AGGCGCGGCG GCTGATGATG GGGCTGTGGT CGATGCTGCC GCAGTTCAGC TACACCAAGC TCCTGATCAT CGTCGACGAC GACGTCGACG TCCGCGAATG GGCCGATGTG ATGTGGGCGG TGTCGACCCG CTGCGACACC TCGCGCGACA TGGTGGCGAT CAGCGACACC CCGATCGACT ATCTCGACTT CGCCTCGCCG AAATCCGGAC TCGGGGGCAA ACTCGGCATC GACGCCACCA ACAAGATCGG CACCGAGACC GAGCGCGAAT GGGGCAAGGT GCTGGAGATG GACGACGATG TGATCGCGCG CGTCGACGCG ATGTGGTCCG GCCTCGGGCT CGCGCCCGCG TTGACGCCGG CCGCGGCGCA ACGCAAGCTG CTGCGATGA
|
Protein sequence | MLNRVKPPFP DLRAFAAYLE SRGQLHRIRK PVSVVHDLTE IHRRVLHAGG PALMIEQPIK ADGTPSEMPM LVNLFGTVER VAWGLGILPE NLPQLGEALA EMREPAPPQS LTDALSKLPM ARAALSMRPK MARTAPAQDV VLTGDAVDLG RLPVQIPWPG EPAPLITWPL VFTKPPPGVA GTDNVGVYRM QVLGKDRVIM RWLAHRGGAK HHHQWAAQKR EMPVAVVIGA DPSMILSAVL PLPETVSEIK FSGLLRGDRP SLTPCVSIPL NVPADAEIVL EGLVSPTETA PEGPYGDHTG YYNAVEEFPV MRITAITMRR NPIYLSTYTG RPPDEPSRLG EALNDVFLPV ARRQFPEIVD LWLPPEACSY RIAVASIKKR YPGQARRLMM GLWSMLPQFS YTKLLIIVDD DVDVREWADV MWAVSTRCDT SRDMVAISDT PIDYLDFASP KSGLGGKLGI DATNKIGTET EREWGKVLEM DDDVIARVDA MWSGLGLAPA LTPAAAQRKL LR
|
| |