Gene RPB_1216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1216 
Symbol 
ID3910151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1390031 
End bp1391434 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content68% 
IMG OID637883110 
Productcarotenoid oxygenase 
Protein accessionYP_484837 
Protein GI86748341 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.548821 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCAGG TCACCGGAGT TCCGGATGCG TGCGATAATC TGGCGCCGAT CCCGATGGAG 
TGCGACGCCG CGTTTCTGTC GATCAAGGGC GAGCTGCCGC GCGAATTGAA CGGCACGCTG
TATCGCAACG GCGCCAATCC GCAATTCGCA TCAAAGAACG CGCATTGGTT CTTCGGCGAC
GGCATGCTGC ATGCCTTCCG GCTGGAGAAC GGCCGCGCCA GCTATCGCAA CCGCTGGGTC
CGCACCCCGA AATGGCTGGC CGAGCACGAA GCCGGCCGGC CGCTTTACGG CGAGTTCAAT
CTCAAGCGGC CCGATGCGCC GCGTTCGGCG CCCGACGACG GCAACGTCGC CAACACCAAC
ATCGTGTTCC ACGCCGGCCG GCTGCTGGCG CTGGAAGAGG CGCATCTGCC GATCGAAATC
GAGCGCGACA CGCTGGCGAC GCGCGGCTAT TGCGACTATG GCGGGGCGCT GAAAGGGCCG
TTCACCGCGC ATCCGAAGAT CGACCCGGTC ACCGGCGAGA TGCTGTTCTT CGGCTACAAT
GCCGACGGCC CGTTGAAACG GACGATGTCG TTCGGCGCGA TCGACGCGTC CGGCCACGTC
ACCCGGTTCG AGCGCTTCAA GGCGCCCTAT GCAGCGATGG TGCACGATTT CATCGTCACC
GAGAACTATG TGCTGTTTCC GATCCTGCCG CTCACCGGCA GCATCTGGCG GGCGATGCGC
GGCCGCCCGC CTTATGCCTG GGACCCCGCT AAGGGCTCTT ACGTCGGCGT GATGAAGCGC
ACCGGCTCGA CCCGCGACAT CCGCTGGTTC CGCGGTGACG CCTGCTTCGT GTTCCACGTC
ATGAACGCGT GGGAGGACGG CACCAAGATC GTCGCCGACG TGATGCAATC CGAGGAAGCG
CCGCTGTTCA CCCATCCGGA CGGCCGCCGC ACCGATCCGG AGAAGGGCCG CGCCCGGCTG
TGCCGCTGGA GCTTCGACCT CGCCGGCAAC ACCAACGCCT TCACGCGCAG CTATCTCGAC
GACATCAGCG GCGAATTCCC GCGGATCGAC GAGCGCCGCG CCGGCCTGCG CAGCGGCCAT
GGCTGGTACG CCTGCGCCAG CCCGGAGACG CCGACGCTCG GGATGCTGAC GGGGCTGGTG
CATGTCGACG GCAACGGCGA TCGCCGCGCC CGCTATCTGC TGCCGACCGG CGACAGCATC
GGCGAGCCGG TGTTCGTGCC GCGCACGCCG GATGCGGCGG AAGCCGATGG CTGGATCCTC
ACGGTGATCT GGCGCGGCTG CGAGAACCGC AGCGACCTCG CGGTGTTCGA CGCCGCCGAC
ATCGCGGGCG GCCCGATCGC GCTGGTCCAA CTCGGCCACC GCGTCCCGGA CGGCTTCCAC
GGCAATTGGG TGGCGGCGGG GTGA
 
Protein sequence
MLQVTGVPDA CDNLAPIPME CDAAFLSIKG ELPRELNGTL YRNGANPQFA SKNAHWFFGD 
GMLHAFRLEN GRASYRNRWV RTPKWLAEHE AGRPLYGEFN LKRPDAPRSA PDDGNVANTN
IVFHAGRLLA LEEAHLPIEI ERDTLATRGY CDYGGALKGP FTAHPKIDPV TGEMLFFGYN
ADGPLKRTMS FGAIDASGHV TRFERFKAPY AAMVHDFIVT ENYVLFPILP LTGSIWRAMR
GRPPYAWDPA KGSYVGVMKR TGSTRDIRWF RGDACFVFHV MNAWEDGTKI VADVMQSEEA
PLFTHPDGRR TDPEKGRARL CRWSFDLAGN TNAFTRSYLD DISGEFPRID ERRAGLRSGH
GWYACASPET PTLGMLTGLV HVDGNGDRRA RYLLPTGDSI GEPVFVPRTP DAAEADGWIL
TVIWRGCENR SDLAVFDAAD IAGGPIALVQ LGHRVPDGFH GNWVAAG