Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_1216 |
Symbol | |
ID | 3910151 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | - |
Start bp | 1390031 |
End bp | 1391434 |
Gene Length | 1404 bp |
Protein Length | 467 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637883110 |
Product | carotenoid oxygenase |
Protein accession | YP_484837 |
Protein GI | 86748341 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.548821 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGCAGG TCACCGGAGT TCCGGATGCG TGCGATAATC TGGCGCCGAT CCCGATGGAG TGCGACGCCG CGTTTCTGTC GATCAAGGGC GAGCTGCCGC GCGAATTGAA CGGCACGCTG TATCGCAACG GCGCCAATCC GCAATTCGCA TCAAAGAACG CGCATTGGTT CTTCGGCGAC GGCATGCTGC ATGCCTTCCG GCTGGAGAAC GGCCGCGCCA GCTATCGCAA CCGCTGGGTC CGCACCCCGA AATGGCTGGC CGAGCACGAA GCCGGCCGGC CGCTTTACGG CGAGTTCAAT CTCAAGCGGC CCGATGCGCC GCGTTCGGCG CCCGACGACG GCAACGTCGC CAACACCAAC ATCGTGTTCC ACGCCGGCCG GCTGCTGGCG CTGGAAGAGG CGCATCTGCC GATCGAAATC GAGCGCGACA CGCTGGCGAC GCGCGGCTAT TGCGACTATG GCGGGGCGCT GAAAGGGCCG TTCACCGCGC ATCCGAAGAT CGACCCGGTC ACCGGCGAGA TGCTGTTCTT CGGCTACAAT GCCGACGGCC CGTTGAAACG GACGATGTCG TTCGGCGCGA TCGACGCGTC CGGCCACGTC ACCCGGTTCG AGCGCTTCAA GGCGCCCTAT GCAGCGATGG TGCACGATTT CATCGTCACC GAGAACTATG TGCTGTTTCC GATCCTGCCG CTCACCGGCA GCATCTGGCG GGCGATGCGC GGCCGCCCGC CTTATGCCTG GGACCCCGCT AAGGGCTCTT ACGTCGGCGT GATGAAGCGC ACCGGCTCGA CCCGCGACAT CCGCTGGTTC CGCGGTGACG CCTGCTTCGT GTTCCACGTC ATGAACGCGT GGGAGGACGG CACCAAGATC GTCGCCGACG TGATGCAATC CGAGGAAGCG CCGCTGTTCA CCCATCCGGA CGGCCGCCGC ACCGATCCGG AGAAGGGCCG CGCCCGGCTG TGCCGCTGGA GCTTCGACCT CGCCGGCAAC ACCAACGCCT TCACGCGCAG CTATCTCGAC GACATCAGCG GCGAATTCCC GCGGATCGAC GAGCGCCGCG CCGGCCTGCG CAGCGGCCAT GGCTGGTACG CCTGCGCCAG CCCGGAGACG CCGACGCTCG GGATGCTGAC GGGGCTGGTG CATGTCGACG GCAACGGCGA TCGCCGCGCC CGCTATCTGC TGCCGACCGG CGACAGCATC GGCGAGCCGG TGTTCGTGCC GCGCACGCCG GATGCGGCGG AAGCCGATGG CTGGATCCTC ACGGTGATCT GGCGCGGCTG CGAGAACCGC AGCGACCTCG CGGTGTTCGA CGCCGCCGAC ATCGCGGGCG GCCCGATCGC GCTGGTCCAA CTCGGCCACC GCGTCCCGGA CGGCTTCCAC GGCAATTGGG TGGCGGCGGG GTGA
|
Protein sequence | MLQVTGVPDA CDNLAPIPME CDAAFLSIKG ELPRELNGTL YRNGANPQFA SKNAHWFFGD GMLHAFRLEN GRASYRNRWV RTPKWLAEHE AGRPLYGEFN LKRPDAPRSA PDDGNVANTN IVFHAGRLLA LEEAHLPIEI ERDTLATRGY CDYGGALKGP FTAHPKIDPV TGEMLFFGYN ADGPLKRTMS FGAIDASGHV TRFERFKAPY AAMVHDFIVT ENYVLFPILP LTGSIWRAMR GRPPYAWDPA KGSYVGVMKR TGSTRDIRWF RGDACFVFHV MNAWEDGTKI VADVMQSEEA PLFTHPDGRR TDPEKGRARL CRWSFDLAGN TNAFTRSYLD DISGEFPRID ERRAGLRSGH GWYACASPET PTLGMLTGLV HVDGNGDRRA RYLLPTGDSI GEPVFVPRTP DAAEADGWIL TVIWRGCENR SDLAVFDAAD IAGGPIALVQ LGHRVPDGFH GNWVAAG
|
| |