Gene RPD_1089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_1089 
Symbol 
ID4021565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp1239009 
End bp1240115 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content67% 
IMG OID637961281 
Productelectron transfer flavoprotein beta-subunit 
Protein accessionYP_568228 
Protein GI91975569 
COG category[C] Energy production and conversion 
COG ID[COG2025] Electron transfer flavoprotein, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.419334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC CAGCCAAGCC TGCCCCGCAG CCCGCCGGAC GCGCCAACGC CAAGAAAGAG 
CTGTCCGAAC ACTTCAAGCA GTACAAACAC GTCTGGGTGT TCGTCGAACA GGAGCGAGGC
CACGTCCATC CGGTTTCCTG GGAACTGATG GGTTCCGGCC GCCGACTCGC CGACAAGCTC
GGCGTCGAAC TCGCGGCGGT GGTGATCGGG CCCGCCGGCG ACGCCACACG CGTCGCGGCG
GCGGAGTCGT TCTGCTACGG CGCCGATCTC GCTTACATCG TCGCCGATGA CGTGCTCGCC
GACTATCGCA ACGAGTCCTA CACCAAGGCG CTGACCGATC TGGTCAACAC CTACAAGCCG
GAAATCCTGC TGCTCGGTGC CACCACGCTC GGCCGGGACC TCGCCGGCGC CGTCGCCACC
ACGCTGCTGA CGGGACTCAC CGCGGACTGC ACCGAACTCG AGGTTGACGC CGACAATTCG
CTCGCCGCGA CCCGGCCGAC CTTCGGCGGC TCGCTGCTCT GCACGATCTA CACGCTGAAT
TTCCGGCCGC AGATGGCGAC GGTGCGGCCG CGGGTGATGG AGATGCCGGA CCGCGTCGAG
AAGCCGGTCG GCCGCATCAT CGAATTTCCG CTCGGCATGG TCGAAGCCGA CATCGTCACC
AAGGTGCTGG CGTTCGTGCC GGACCGTGAC AAGGCGACTT CGAACCTGGC TTACGCCGAC
ATCGTCGTCG CAGGCGGCAT TGGGCTCGGT TCGCCGGAGA ACTTCCAGCT CGTTCGGCAG
CTCGCCGGGG TGCTCGGCGC CGAATATGGC TGCTCGCGGC CGCTGGTCCA GAAGGGCTGG
GTCTCGGCCG ACCGGCAGAT CGGCCAGACC GGCAAGACCA TCCGCCCGAA GCTCTACATC
GCCGCCGGCA TCTCCGGGGC GATCCAGCAT CGCGTCGGCG TGGACGGCGC CGATCTGATC
GTCGCCATCA ACACCGACAA GAATGCGCCG ATCTTCGACT TCGCGCATCT GGCGATCGTC
ACCGACGCGA TCCGGCTGTT GCCGGCGCTG ACCGAAGCAT TCCGCAAGCG GCTGTCGCCG
CACACCCGAG ACCGGATCGC AAGCTGA
 
Protein sequence
MSQPAKPAPQ PAGRANAKKE LSEHFKQYKH VWVFVEQERG HVHPVSWELM GSGRRLADKL 
GVELAAVVIG PAGDATRVAA AESFCYGADL AYIVADDVLA DYRNESYTKA LTDLVNTYKP
EILLLGATTL GRDLAGAVAT TLLTGLTADC TELEVDADNS LAATRPTFGG SLLCTIYTLN
FRPQMATVRP RVMEMPDRVE KPVGRIIEFP LGMVEADIVT KVLAFVPDRD KATSNLAYAD
IVVAGGIGLG SPENFQLVRQ LAGVLGAEYG CSRPLVQKGW VSADRQIGQT GKTIRPKLYI
AAGISGAIQH RVGVDGADLI VAINTDKNAP IFDFAHLAIV TDAIRLLPAL TEAFRKRLSP
HTRDRIAS