Gene RPB_4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4005 
Symbol 
ID3911812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp4571884 
End bp4573434 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content70% 
IMG OID637885909 
ProductFAD dependent oxidoreductase 
Protein accessionYP_487609 
Protein GI86751113 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02734] phytoene desaturase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.665607 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00234507 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGAGGGATC AGACCGCAAA GAATCGCGTG GTGGTGATCG GTGCCGGCGT GGCCGGACTG 
ACCTCCGCTC TCGCCCTGTC GGCGCGCGGC CTCGACGTGA CCGTGGTCGA GCGCGCCGCG
ACGCCGGGCG GCAAGATCCG CGAGGTCGGC ATCGGCGCGG CGCGGATCGA CAGCGGCCCC
ACCGTCTTCA CCATGCGCTG GGTGTTCGAG GAATTGTTCG CCGCGGCCAA GCTGAACTTC
GCCGATCACG TCCGGCTGCG GCCGCTGTCG GTGCTGGCGC GCCATGCCTG GAACGAGGCC
ACGCGACTCG ATCTGTTCGC CGACGAGGCG CGCTCGGCGG ATGCGATCGG CGTGTTCGCC
GGCCCGGCCG AGGCGAAGCG CTACCGACAA TTCTGCGCCG ACAGCCGGCG GATCTATCAG
ATCCTAGAAC AACCGTTCCT GCGCGCCACC GCGCCGAGCC TGCCCAGCCT CGCCACCGCC
AACGGCCTCG GCGGGCTGCT GGAGTTGCGC AAGATCCGCC CGTTCACCAC GATGTGGAAC
GCGCTCGGCG ACTACTTCCG CGATCCACGC CTGCGGCAAT TGTTCGGGCG CTATTCGACT
TATTGCGGAT CCTCGCCGTT CCAGGCGCCG GCGACGCTGA TGCTGGTCGC GCATGTCGAG
CAGCAGGGCG TCTGGATCAT CGAGGGCGGC ATGCATGCGC TGGCGCGCGC GCTGGCCGAT
TGCGCGGCGT CGCAAGGCGC CAGCATTCGC TACGATCAGG AGGTGCGCGA GATCATGGTC
TCGGGCGGCC GCGCCGCCGG GGTGATTCTC GCCAGCGGCG AACGCATCGA AGCGCAATCG
GTGATCGTCA ACGCCGATGT CGGCGCCCTG CCCGGCGGCC TGTTCGGCGA CCAGGTGCAA
CGCGCCGTGC CGCCGCTGGC GCCGAAGCTG CGGTCGCTGT CGGCGATGAC CTGGAGTATG
GTGGCGAAGA CCGAGGGTTT TCCGCTCAGC CGGCATTCGG TGTTCTTCTC CCGCGACTAC
GCGCGCGAAT TCAACGATAT CTTCCAGCGC GGCGAAGTGC CGAGCGAGCC GACCGTCTAT
GTCTGCGGTC AGGACCGAAT CGACGACGAC GCGCGGCTCG GCGATGGCGG CGAGGAAGAA
CTGCTGGTGC TGATCAACGC GCCGCCTGTC GGCGATCGCA AACCGTTCGA TGAGGCCGAG
ATCGCGCGTT GCGCGGAGCG CACGTTCGAC GTGCTGGAAC GCTGCGGGTT GCGCATCCGG
CTTCAGCCTG ACAAAACCAC GGTGACCACG CCCGCCGATT TCAACCGGCT GTTTCCGGCG
ACCGGCGGGG CGCTGTACGG CCGCGCCTCG CACGGCTGGA CCGCCTCGTT CGAGCGCCCC
GGCGCACGGA CGAAAATCCC CGGCCTGTAT CTCGCCGGCG GCAGCACCCA TCCCGGCCCC
GGCGTGCCGA TGGCCGGCCT GTCCGGGCGC GCCGCCGCGG CGAGCCTCGT GGCCGACCTC
GAGGCCGAGC TGCAGGGCTT GAAGCCGATG CTGCGGCCGG CCGCCACCTG A
 
Protein sequence
MRDQTAKNRV VVIGAGVAGL TSALALSARG LDVTVVERAA TPGGKIREVG IGAARIDSGP 
TVFTMRWVFE ELFAAAKLNF ADHVRLRPLS VLARHAWNEA TRLDLFADEA RSADAIGVFA
GPAEAKRYRQ FCADSRRIYQ ILEQPFLRAT APSLPSLATA NGLGGLLELR KIRPFTTMWN
ALGDYFRDPR LRQLFGRYST YCGSSPFQAP ATLMLVAHVE QQGVWIIEGG MHALARALAD
CAASQGASIR YDQEVREIMV SGGRAAGVIL ASGERIEAQS VIVNADVGAL PGGLFGDQVQ
RAVPPLAPKL RSLSAMTWSM VAKTEGFPLS RHSVFFSRDY AREFNDIFQR GEVPSEPTVY
VCGQDRIDDD ARLGDGGEEE LLVLINAPPV GDRKPFDEAE IARCAERTFD VLERCGLRIR
LQPDKTTVTT PADFNRLFPA TGGALYGRAS HGWTASFERP GARTKIPGLY LAGGSTHPGP
GVPMAGLSGR AAAASLVADL EAELQGLKPM LRPAAT