Gene RPB_2014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_2014 
Symbol 
ID3909520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp2291381 
End bp2293039 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content70% 
IMG OID637883908 
ProductFAD dependent oxidoreductase 
Protein accessionYP_485633 
Protein GI86749137 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.166375 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.451875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCGCCG CGGCGCAGGA GCCGCCGCCC GCCCCTCCTG GAACAGCCAT GTCCGATTCC 
GACGTGCTGA TCATCGGCGC CGGTCACAAC GGCCTCACCT GCGCCGCCTA TCTGGCGCAG
GCCGGGCTGA CGGTGAAGGT CGTCGAGCGC CGCAGCGTGG TCGGCGGCGC CGCGGTGACT
CAGGAATTTC ATCCCGGCTT CCGCAATTCG GTCGCGGCCT ACACCGTCAG CCTGCTCAAC
CCCAAGGTGA TCGCCGACCT GAAGCTCGCC GAGCACGGGC TGCGCATCGT CGAGCGCCGG
GCGCAGAACT TCCTGCCCGA CCCGAACGGC AAATATCTGC TGACCGGGGC CAAATACACT
GCCAGATCGG TGGCGCATCT CAGCAAGCCC GACGCCGGCA AGATCGACGG TTTCACCGCC
GAGTTGGAGA CCATCGCCGA CGTGCTGCGG CATTTCGTGC TGCGGGCGCC GCCCAATCTG
GTCGAGCGCT TCGGCGTCGG CGCCGTTCGC GAGGGCCTCG CCGCGCTCGG CGCCGCCAAC
CGGCTGCGCG CGCTGACGCT GGAGCAGCAG CGGCTGCTGC TCGATCTGTT CACCTGCTCG
GCCGGTGAGA TGCTCGACGC GCGGTTCGAG CACGATCTGG TCAAGGCGCT GTTCGGCTTC
GACGCCATCG TCGGCAATTA CGCCAGCCCC TACGCCGCCG GCTCCGCCTA CGTCATGCTG
CACCACGCCT TCGGCGAGGT GAACGGCAGG AAGGGCGTCT GGGGCCATGC GGTCGGCGGG
ATGGGGGCGA TTTCGCAAGC CATGGCCGCC GCAGCCCGCG CCGCCGGTGC CGAGATCGAA
ACCTCGGCCG GCGTCCGCGA GGTGCTGGTC GAGAAGGACC GGGTGGTCGG CGTCACGCTC
GACGACGGCC GCGGCCTGCG GGCGAAATAC GTCGCCGCCA ACGTCAATCC GAAGCTGCTC
TACACCAGGC TGCTACCGAA GGATGCGCTG CCGGGGGATG TCCACCGCCG CATGACCGCG
TGGAAGAACG GCTCCGGCAC CTTCCGCATG AATGTCGCGT TGGCCGGCCT GCCCTCGTTC
ACGGCGCTGC CCGGCACCGG CGACCATCTC ACCGCCGGCA TCATCATCGC CCCCGGCCTC
GATTACATGG ACCGCGCCTG GAGCGACGCA CGCGCGCACG GCTGGAGCCG CGAGCCGGTG
GTCGAGATGC TGATCCCATC GACGCTGGAC GAATCCCTGG CCCCGCCCGG CCGGCACGTC
GCCAGCCTGT TCTGCCAGCA CGTCGCGCCG CAACTGCCCG GTGGCGTGTC CTGGGACGAC
CGCCGCGACG AGGTCGCCGA TCTAATGATC GCCACCGTCG ACCGCTACGC CCCCGGCTTC
GCCGCCAGCG TGCTCGGCCG CCAGATCCTG TCGCCGCTCG ATCTCGAGCG CGATTTCGGC
CTGCTCGGCG GCGACATTTT TCACGGCGCG CTGAGCCTGA ACCAGCTGTT CTCGGCCCGG
CCGATGCTCG GCCATGCCGA CTACCGCGGG CCGCTGAAGG GCCTCTACCA CTGCGGTAGC
GGCGCCCATC CCGGCGGCGG GGTCACCGGC GCCCCCGGCC ACAACGCCGC AGCCGCGATC
CTCAACGATC ACCGCAGCCT GTTCACAAAG CGTGGATAA
 
Protein sequence
MAAAAQEPPP APPGTAMSDS DVLIIGAGHN GLTCAAYLAQ AGLTVKVVER RSVVGGAAVT 
QEFHPGFRNS VAAYTVSLLN PKVIADLKLA EHGLRIVERR AQNFLPDPNG KYLLTGAKYT
ARSVAHLSKP DAGKIDGFTA ELETIADVLR HFVLRAPPNL VERFGVGAVR EGLAALGAAN
RLRALTLEQQ RLLLDLFTCS AGEMLDARFE HDLVKALFGF DAIVGNYASP YAAGSAYVML
HHAFGEVNGR KGVWGHAVGG MGAISQAMAA AARAAGAEIE TSAGVREVLV EKDRVVGVTL
DDGRGLRAKY VAANVNPKLL YTRLLPKDAL PGDVHRRMTA WKNGSGTFRM NVALAGLPSF
TALPGTGDHL TAGIIIAPGL DYMDRAWSDA RAHGWSREPV VEMLIPSTLD ESLAPPGRHV
ASLFCQHVAP QLPGGVSWDD RRDEVADLMI ATVDRYAPGF AASVLGRQIL SPLDLERDFG
LLGGDIFHGA LSLNQLFSAR PMLGHADYRG PLKGLYHCGS GAHPGGGVTG APGHNAAAAI
LNDHRSLFTK RG