Gene RPB_4011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4011 
Symbol 
ID3911818 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4577188 
End bp4578723 
Gene Length1536 bp 
Protein Length511 aa 
Translation table11 
GC content64% 
IMG OID637885915 
Productamine oxidase 
Protein accessionYP_487615 
Protein GI86751119 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.984835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0140967 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCGATC CCGGTCTCAA ACCCGCTCCC CGTCCTTCGA ATGATCGCGC GCCGCATGCG 
GTGGTGATCG GCTCCGGCTT TGGCGGCTTG GCCGCTGCGG TCCGGCTCGG TGCCAAAGGG
TACCGGGTCA CCGTTCTTGA AAAGCTCGAT GCGCCCGGCG GCCGCGCTTA CGTCCACAAG
CAGGACGGCT TCACCTTCGA CGCCGGCCCG ACCATCGTCA CCGCGCCGTA TCTGTTCGAG
GAGCTGTGGA CGCTGTGCGG CAAGCGGATG TCCGACGACG TCGTGCTGAA GCCGATGTCG
CCGTTCTACC GCATCCGCTT CGACGACGGC ACCTTCTTCG ACTATTCGGA CGACCGTAAC
GCGGTGCTGG ACCAGATCGC GAAATTCTGC CCGGACGACG TCCCGGCCTA TGACCGCTTC
ATGGCGGCGT CGCAGGCGAT CTTCAAGGTC GGCTTCGAGC AGCTCGGCGA CCAGGCGTTC
AGCCGCTTCA CCGACATGCT GAAGATCGCG CCGGACATGA TCAAGCTGGA GAGCTATCGC
AGCGTCTTCG GCATGGTCGC CAAGCACTTC AAGGACCCGA AGCTGCGCCA GGTGTTCAGC
TTCCATCCGC TGCTGATCGG CGGCAATCCG TTCATGTCGA GTTCGGTGTA CTGCCTGATC
ACCTATCTCG AAAAGCAGTG GGGCGTGCAT TCGGCGATGG GCGGCACCGG CGCGCTGGTC
TCCGGCCTCG TCAAGCTGAT CGAAGGCCAG GGCAACACGA TCCGCTATCA GCAGGACGTC
CGCCGCATCG TGGTCGAGAA CGGCGCCGCC CGCGGCGTCG AGCTCGCCGA CGGCGAGGTG
ATCAAGGCCG ATATCGTGGT GTCGAACGCC GATTCGGCCT CGACCTATCG CTACCTGCTG
GCGCCGGAGA CGCGCAAGCG CTGGACCGAC GCCAAGATCG AGCGCTCGCG CTATTCGATG
AGCCTGTTCG TCTGGTACTT CGGCACCAAG CGGCGCTACG ACGACGTCAA GCACCACACC
ATCCTGCTCG GGCCGCGCTA CAAAGAGCTG ATCACCGACA TTTTCTCCCG CAAGATCGTC
GCCGACGATT TCAGCCTGTA TCTGCACCGC CCGACCGCGA CCGATCCGTC GCTGGCGCCG
CCCGGCTGCG ATACGTTCTA CGTGCTGTCG CCGGTGCCGA ATCTGCTCGG CGACATCGAC
TGGACCACCA AGGCCGAGAA CTATCGCGCA TCGATTGCGA AGATGCTCGG CGCCACCGTG
CTGCCCGATC TCGAGAATCA GGTGGTGAGT TCGAAGCTCA CCACGCCGCT GGATTTCCGC
GACCGGCTGT CCTCGTTCCG CGGCGCCGCA TTCGGCCTCG AGCCGGTGCT GTGGCAGAGC
GCCTGGTTCA GGCCGCACAA CCAGAGCGAA GACGTCAAGA ATTTGTATCT CGTCGGTGCC
GGAACCCATC CCGGCGCCGG TCTGCCCGGG GTGCTGTCCT CGGCGCGTGT GCTCGATTCC
CTGGTCCCCG AGGCCGACAG TCTGGTGACG TCATGA
 
Protein sequence
MLDPGLKPAP RPSNDRAPHA VVIGSGFGGL AAAVRLGAKG YRVTVLEKLD APGGRAYVHK 
QDGFTFDAGP TIVTAPYLFE ELWTLCGKRM SDDVVLKPMS PFYRIRFDDG TFFDYSDDRN
AVLDQIAKFC PDDVPAYDRF MAASQAIFKV GFEQLGDQAF SRFTDMLKIA PDMIKLESYR
SVFGMVAKHF KDPKLRQVFS FHPLLIGGNP FMSSSVYCLI TYLEKQWGVH SAMGGTGALV
SGLVKLIEGQ GNTIRYQQDV RRIVVENGAA RGVELADGEV IKADIVVSNA DSASTYRYLL
APETRKRWTD AKIERSRYSM SLFVWYFGTK RRYDDVKHHT ILLGPRYKEL ITDIFSRKIV
ADDFSLYLHR PTATDPSLAP PGCDTFYVLS PVPNLLGDID WTTKAENYRA SIAKMLGATV
LPDLENQVVS SKLTTPLDFR DRLSSFRGAA FGLEPVLWQS AWFRPHNQSE DVKNLYLVGA
GTHPGAGLPG VLSSARVLDS LVPEADSLVT S