Gene RPB_4671 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_4671 
Symbol 
ID3912489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp5284188 
End bp5285192 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content67% 
IMG OID637886576 
Productcytochrome d ubiquinol oxidase, subunit II 
Protein accessionYP_488265 
Protein GI86751769 
COG category[C] Energy production and conversion 
COG ID[COG1294] Cytochrome bd-type quinol oxidase, subunit 2 
TIGRFAM ID[TIGR00203] cytochrome d oxidase, subunit II (cydB) 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.297118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACG ATCTCGCGAC CATCTGGGCC CTCATCATCG CGTTCGCGGT GTTCGTCTAT 
GTGGTGATGG ACGGCTTCGA CCTCGGCCTC GGCATTCTGT TTCCGCTGTT TCGCACCAAG
CGCGACCGCG ATGTCGTGAT GAACAGCGTC GCGCCGGTGT GGGACGGCAA CGAGACCTGG
CTGGTGCTCG GCGGCGGCGG CCTGTTCGCG GCGTTTCCGC TGGCCTATGC GGTGCTGATG
CCGGCGCTGT ATACGCCGAT CATCGCGATG CTGCTCGGCC TGGTGTTTCG CGGCGTCGCG
TTCGAATTTC GCTGGCGCAG CCTGCGCGAG CGCAACCGCT GGGACCTCGC CTTCTTCCTC
GGCTCGCTGA TCGCGACGCT GGCGCAGGGC ATCGCGCTCG GCGCGATCCT GCAAGGCGTC
GCTGTCGAGG GTCGCGCCTA TGCGGGCGGA TGGTGGGACT GGCTGACGCC GTTCAGCGTG
CTGACCGGGC TGGCGCTGGT GACCGGCTAC GCGCTGCTCG GCGCCACCTG GCTGGTGATG
AAGACCACCG GCGAACTGCG CGACCAGGCC TATCGGCTGA GCCGCTGGCT GCTGCTGGCG
ATGCTGATCG CGATCGTCGC CGTCAGCGCC GCGACGCCGT TCCTGAGCTA CGACTATTCG
GAACGCTGGT TCGCCTGGCC GAACGTGCTC GCCACCGCGC AAGTGCCGCT CGCCGTGGCG
ATCGTCACCG CGCTGCTGCT GCGGGCGCTG ACGCAGCGCC GCGACTACCA GCCGTTCCTG
CTGACGCTGT GCCTGTTCGC GCTGTCCTAT GCCGGGCTCG GCATCAGCAT CTGGCCCTAT
GTGGTGCCGC GGAGCATCAC CGTCTGGCAG GCGGCGGCGC CCGAGAGCAG CCAGCTCTTC
ATGCTGGTCG GCGTCGCGAT CCTGGTGCCG ATCATCCTCG TCTACACAGC CTGGGCCTAT
TGGGTGTTTC GCGGCAAGGT CGACCCCGAC AGCGGCTATC ATTGA
 
Protein sequence
MDYDLATIWA LIIAFAVFVY VVMDGFDLGL GILFPLFRTK RDRDVVMNSV APVWDGNETW 
LVLGGGGLFA AFPLAYAVLM PALYTPIIAM LLGLVFRGVA FEFRWRSLRE RNRWDLAFFL
GSLIATLAQG IALGAILQGV AVEGRAYAGG WWDWLTPFSV LTGLALVTGY ALLGATWLVM
KTTGELRDQA YRLSRWLLLA MLIAIVAVSA ATPFLSYDYS ERWFAWPNVL ATAQVPLAVA
IVTALLLRAL TQRRDYQPFL LTLCLFALSY AGLGISIWPY VVPRSITVWQ AAAPESSQLF
MLVGVAILVP IILVYTAWAY WVFRGKVDPD SGYH