Gene RPD_1317 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1317 
Symbol 
ID4021794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1479825 
End bp1481228 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content66% 
IMG OID637961510 
Productcarotenoid oxygenase 
Protein accessionYP_568456 
Protein GI91975797 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3670] Lignostilbene-alpha,beta-dioxygenase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCAGG TAACGGGGAT TCCGGATGCG TGCGATAATC TTGCACCGAT CCCGATGGAA 
TGCGACGCGC CGTCTCTGCC GATCAAAGGC GAGCTGCCGC GCGAGCTGAA CGGCACGCTG
TATCGCAACG GCGCCAACCC GCAATTCGCC TCGCCGAACG CGCACTGGTT CTTCGGCGAC
GGGATGCTCC ATGCTTTCAG GCTGGAGAAC GGCCGCGCCA GCTATCGCAA CCGCTGGGTT
CGCACGCCGA AATGGCTGGC CGAACATGCC GCCGGTCGGC CGCTCTACGG CGAGTTCAAC
CTCAAGCTGC CAGACGCGCC GCGCTCGGCG CCCGATGACG GCAACGTCGC CAACACCAAC
ATCGTGTTTC ACGCCGGCCG GCTGCTCGCG CTGGAAGAGG CGCATCTGCC GATCGAAATC
GAGCGCGACA CGCTGGCGAC CCGCGGCTAT TGCGACTATG GCGGCGCGCT GAAAGGGCCG
TTCACCGCGC ATCCGAAGAT CGACCCGGTG ACCGGCGAGA TGCTGTTCTT CGGCTACAAC
GCCGCCGGGC CGTTGAAACG GACGATGTCC TTCGGCGCGA TCGATGCGTC GGGTCATGTG
ACGCGATTCG AGTACTTCAA GGCGCCTTAC GCGGCGATGG TGCACGACTT CATCGTCACC
GAGAACTACG TGCTGTTTCC GATCCTGCCG CTGACCGGCA GCATCTGGCG GGCGATGCGC
GGTCGGCCGC CTTATGCCTG GGACCCCGGT AAGGGCTCCT ATGTCGGCGT GATGAAGCGC
ACCGGCACGA CGCGCGACAT CCGCTGGTTT CGCGGCGACG CATGCTTCGT GTTCCACGTC
ATGAATGCGT GGGAGGACGG GACAAAGATC GTCGCCGACG TGATGCAATC CGAGGAAGCG
CCGCTGTTCA CCCATCCCGA CGGCCGCCGC ACCGATCCCG AGAAGGGCCG CGCGCGGTTG
TGCCGCTGGA GCTTCGACCT CGCCGGCAAC ACCAATGCCT TCAAGCGCAG CTATCTCGAC
GACATCAGCG GCGAATTCCC GCGGATCGAC GAGCGCCGCG CCGGCCTGCG CAGCGGCCAC
GGCTGGTACG CCTGCGCCAG CCCGGAGACG CCGATGCTCG GGATGCTCAC CGGACTCGTG
CATGTCGACG GCAACGGCCA TCGTCGCGCG CGCTATCTGC TGCCAACCGG CGACACCATC
GGCGAGCCGG TGTTCGTGCC GCGCAAGCCG GATTCAGCCG AAGCCGATGG CTGGCTGCTG
ACCGTGATCT GGCGCAGCTG CGAAAACCGC AGCGACCTCG CGGTGTTCAA CGCCGCCGAC
ATCGCCGGCG GCCCGATCGC CTTGGTGCAA CTCGGCCACC GCGTCCCGGA CGGCTTTCAC
GGCAATTGGG TGGCGGCGGG GTGA
 
Protein sequence
MLQVTGIPDA CDNLAPIPME CDAPSLPIKG ELPRELNGTL YRNGANPQFA SPNAHWFFGD 
GMLHAFRLEN GRASYRNRWV RTPKWLAEHA AGRPLYGEFN LKLPDAPRSA PDDGNVANTN
IVFHAGRLLA LEEAHLPIEI ERDTLATRGY CDYGGALKGP FTAHPKIDPV TGEMLFFGYN
AAGPLKRTMS FGAIDASGHV TRFEYFKAPY AAMVHDFIVT ENYVLFPILP LTGSIWRAMR
GRPPYAWDPG KGSYVGVMKR TGTTRDIRWF RGDACFVFHV MNAWEDGTKI VADVMQSEEA
PLFTHPDGRR TDPEKGRARL CRWSFDLAGN TNAFKRSYLD DISGEFPRID ERRAGLRSGH
GWYACASPET PMLGMLTGLV HVDGNGHRRA RYLLPTGDTI GEPVFVPRKP DSAEADGWLL
TVIWRSCENR SDLAVFNAAD IAGGPIALVQ LGHRVPDGFH GNWVAAG