Gene RPD_3766 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_3766 
Symbol 
ID4024282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4202544 
End bp4204079 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content63% 
IMG OID637963970 
Productamine oxidase 
Protein accessionYP_570888 
Protein GI91978229 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.918224 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0064797 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCGATC CAGGTCCCAA AACCGCTCCT CGTCTCCCTC GTGATCACGC ACCGCACGCG 
GTGGTGATCG GCTCCGGCTT TGGCGGTCTC GCCGCCGCGG TCCGGCTCGG TGCCAAAGGG
TACCGTGTCA CCGTCTTGGA GAAGCTCGAT AAGCCGGGCG GCCGCGCTTA CGTCCACAAG
CAGGACGGCT TCACCTTCGA CGCAGGTCCG ACCATCGTCA CCGCGCCTTA TCTGTTCGAA
GAGCTGTGGA AGCTCTGCGG TAAGCGAATG TCCGACGACG TCACGCTGAA GCCGATGGCG
CCGTTCTATC GCATCCGCTT CGACGACGGC ACGCATTTCG ATTATTCCGA CGACCGCGAC
GCGGTGCTGG ATCAGATCGC GAAATTCTGC CCGGACGACG TGCCGGCCTA CGACCGCTTC
ATGGCGGCGT CGCAGGCGAT CTTCAAGGTC GGCTTCGAGC AACTCGGCGA TCAGGCGTTC
AGCCGCTTCA CCGACATGCT GAAGATCGCG CCGGACATGA TCAAGCTGGA GAGCTATCGC
AGCGTCTACG GGCTGGTCGC GAAGCACTTC AAGGATCCGA AGCTGCGTCA GGTGTTCAGC
TTCCATCCGC TGCTGATCGG CGGCAACCCG TTCATGTCGA GTTCGGTGTA CTGCCTGATC
ACCTATCTGG AGAAGCAGTG GGGCGTGCAC TCGGCGATGG GCGGCACCGG CAGCCTGGTG
ACCGGGCTGG TGAAGCTGAT CGAAGGGCAG GGCAACGAGA TCCGCTACAA TCAGGACGTT
CGCCAGATCG TCGTCGAGAA CGGCGCCGCC TGCGGCGTCA AGCTCGCCGA CGGCGAAGTG
ATCAAGGCCG ACATCGTGGT GTCGAATGCG GATTCGGCCT CGACCTATCG TTATCTACTG
GCGCCGGAGA CGCGCAGCCG TTGGACCGAC GCCAAGATCG AGAAGTCCCG TTACTCGATG
AGCCTGTTCG TCTGGTACTT CGGCACGAAG CGGCGGTACG AGGACGTCAA ACACCACACC
ATCCTGCTCG GCCCGCGCTA CCGCGAGCTG ATCACCGACA TCTTCTCGCG CAAGATCGTC
GCCGATGATT TCAGCCTGTA TCTGCACCGC CCGACCGCGA CCGATCCGTC GCTGGCGCCG
CCCGGCTGCG ATACGTTCTA CGTACTGTCG CCGGTGCCGA ACCTGCTCGG CGACATCGAC
TGGACCACCA AGGCCGAGAG CTATCGCGCA TCGATCGCCA AGATGCTCGG CGCCACCGTG
CTGCCCGATC TCGAAAATCA GGTGGTGACC TCGAAGCTCA CCACGCCGCT GGATTTCCGC
GACCGGCTGT CGTCGTTCCG CGGTGCTGCG TTCGGGCTCG AGCCGGTGTT GTGGCAGAGC
GCCTGGTTCC GGCCGCACAA CCAGAGCGAA GACGTCAAGA ATCTGTATCT CGTCGGCGCG
GGAACGCATC CCGGCGCTGG TCTCCCCGGG GTGTTGTCCT CGGCACGCGT GCTCGATTCC
CTGGTCCCCG AGGCTGACAG TCTGGTGAAT TCATGA
 
Protein sequence
MLDPGPKTAP RLPRDHAPHA VVIGSGFGGL AAAVRLGAKG YRVTVLEKLD KPGGRAYVHK 
QDGFTFDAGP TIVTAPYLFE ELWKLCGKRM SDDVTLKPMA PFYRIRFDDG THFDYSDDRD
AVLDQIAKFC PDDVPAYDRF MAASQAIFKV GFEQLGDQAF SRFTDMLKIA PDMIKLESYR
SVYGLVAKHF KDPKLRQVFS FHPLLIGGNP FMSSSVYCLI TYLEKQWGVH SAMGGTGSLV
TGLVKLIEGQ GNEIRYNQDV RQIVVENGAA CGVKLADGEV IKADIVVSNA DSASTYRYLL
APETRSRWTD AKIEKSRYSM SLFVWYFGTK RRYEDVKHHT ILLGPRYREL ITDIFSRKIV
ADDFSLYLHR PTATDPSLAP PGCDTFYVLS PVPNLLGDID WTTKAESYRA SIAKMLGATV
LPDLENQVVT SKLTTPLDFR DRLSSFRGAA FGLEPVLWQS AWFRPHNQSE DVKNLYLVGA
GTHPGAGLPG VLSSARVLDS LVPEADSLVN S