Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_4478 |
Symbol | |
ID | 3912294 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 5067751 |
End bp | 5069289 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637886381 |
Product | carboxylyase-like protein |
Protein accession | YP_488072 |
Protein GI | 86751576 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases |
TIGRFAM ID | [TIGR00148] UbiD family decarboxylases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0662178 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGAGCC GCGTCAAACC GCCCTTTCCG GACCTCCGCG CTTTCGCCGC CTATCTGGAA TCCCGCGGGC AGTTGCACCG GATCAAGAAG CCGGTCTCGG TGGTCCACGA ACTGACCGAA ATCCATCGCC GCGTGCTGCA CGCCGGCGGC CCGGCGCTGC TGATCGAACA TCCGGTCAAG GCCGACGGCA CGCCGTCGGA GATGCCGATT CTGGTCAATT TGTTCGGCAC CGTCGAGCGG GTGGCGTGGG GGCTCGGCAT CGAACCGCAA AATCTGTCGG CGCTGGGCGA AGCGCTCGCC GAAATGCGCG AGCCGGCGCC GCCGCAAAGC CTCACCGACG CATTGAGCAA ATTGCCGATG GCCCGCGCAG CGCTGTCGAT GCGGCCGAAG ACCGCCAAGA CCGCACCGGC GCAGGACGTG GTGCTGACCG GGGACGCGGT CGATCTCGGC CGGCTGCCGA TCCAGATTCC GTGGCCGGGC GAGCCGGCGC CGCTGATCAC CTGGCCGCTG GTGTTCACCA GGCCGCCACC CGGCGCGCCC GGCACCGACA ATGTCGGCGT CTACCGCATC CAGGTGCTGG GCAAGGATCG CATCATCATG CGCTGGCTGG CGCATCGCGG CGGCGCCAAG CATCACCACC AATGGAAGGC CGAGGGGCGC GAGATGCCGG TCGCGATCGT GATCGGCGCC GACCCGGCGA TGATCCTGTC GGCGGTGCTG CCGCTGCCGG AGAATATCTC CGAGATCAAA TTCTCCGGGC TGCTGCGCGG CGACCGCCCG AGCATGACGC CCTGCGTCGG GATTCCGCTG AACGTGCCGG CCGACGCCGA GATCGTGCTG GAAGGCTTCG TGTCGCCGAC CGAGACCGCG CCGGAGGGCC CCTATGGCGA CCACACCGGC TATTACAACG CGGTCGAGGA ATTCCCGGTG ATGCGGATCA CCGCGATCAC GATGCGGCGC AGTCCGATCT ATCTGTCGAC CTATACGGGG CGTCCGCCGG ACGAGCCGTC GCGGCTCGGC GAGGCGTTCA ACGACGTGTT CCTCCCCGTC GCGCGGCGGC AGTTTCCGGA GATCGTCGAT CTGTGGCTGC CGCCGGAGGC GTGCTCCTAT CGAATTGCGG TCGCCTCGAT CAAGAAACGC TATCCCGGCC AGGCGCGGCG GCTGATGATG GGGCTGTGGT CGATGCTGCC GCAGTTCAGC TACACCAAGC TTTTGATCAT CGTCGACGAC GACGTCGACG TGCGCGACTG GGCCGACGTG ATGTGGGCGG TGTCGACCCG CGCCGACACC TCGCGCGACA TGCTGTCGAT CAGCGATACG CCGATCGACT ATCTCGATTT CGCCTCGCCG AAATCCGGGC TCGGCGGCAA GCTGGGCATC GACGCCACCA ACAAGATCGG CACCGAGACC GAGCGCGAAT GGGGCAAGGT GCTGGAGATG GACAAGGACG TGATCGCGCG GGTCGACGCA ATGTGGTCGA GCCTCGGGCT GCCCCCGGCG CCGACGCCGG CCCTGGCGCA ACGCAGGCTG CTGCGATGA
|
Protein sequence | MLSRVKPPFP DLRAFAAYLE SRGQLHRIKK PVSVVHELTE IHRRVLHAGG PALLIEHPVK ADGTPSEMPI LVNLFGTVER VAWGLGIEPQ NLSALGEALA EMREPAPPQS LTDALSKLPM ARAALSMRPK TAKTAPAQDV VLTGDAVDLG RLPIQIPWPG EPAPLITWPL VFTRPPPGAP GTDNVGVYRI QVLGKDRIIM RWLAHRGGAK HHHQWKAEGR EMPVAIVIGA DPAMILSAVL PLPENISEIK FSGLLRGDRP SMTPCVGIPL NVPADAEIVL EGFVSPTETA PEGPYGDHTG YYNAVEEFPV MRITAITMRR SPIYLSTYTG RPPDEPSRLG EAFNDVFLPV ARRQFPEIVD LWLPPEACSY RIAVASIKKR YPGQARRLMM GLWSMLPQFS YTKLLIIVDD DVDVRDWADV MWAVSTRADT SRDMLSISDT PIDYLDFASP KSGLGGKLGI DATNKIGTET EREWGKVLEM DKDVIARVDA MWSSLGLPPA PTPALAQRRL LR
|
| |