Gene RPB_1201 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_1201 
Symbol 
ID3910136 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp1374860 
End bp1376929 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content63% 
IMG OID637883095 
Productcytochrome c1 
Protein accessionYP_484822 
Protein GI86748326 
COG category[C] Energy production and conversion 
COG ID[COG1290] Cytochrome b subunit of the bc complex
[COG2857] Cytochrome c1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAC CATCGACCTA TCAGCCGCAG AGCCCCTTCA TGAAGTGGCT CGAGCAACGC 
CTGCCGATCG CCGGGCTGGT TCATTCGTCG TTCATTGCCT ACCCCACGCC GCGCAACCTG
AACTACTGGT GGACGTTCGG CGCCATTCTC TCGATGATGC TGGCGGTGCA GATCATCACC
GGCATCGTGC TGGCGATGCA CTACACGCCG CACGTCGACT TCGCCTTCGA CTCGGTCGAG
CGGATCGTTC GCGACGTCAA CTACGGCTGG CTGCTGCGCA ACACCCACGC GGCCGGCGCG
TCGATGTTCT TCATCGCGGT CTACATCCAC ATGTTCCGCG GCCTGTATTA CGGGTCGTAC
AAGGCGCCGC GTGAAGTGCT CTGGATCCTC GGCGTGATCA TCTACCTGCT TATGATGGCG
ACCGGCTTCA TGGGCTATGT GCTTCCCTGG GGCCAGATGA GCTTCTGGGG CGCCACCGTG
ATCACCAACC TGTTCTCGGC GGTCCCGTTC GTCGGCGACA GCATCGTGAC CTTGCTGTGG
GGCGGCTATT CGGTCGGCAA CCCGACCCTG AACCGGTTCT TCTCGCTGCA CTATCTGCTG
CCCTTCGTGA TTGCCGGCGT GGTCGTGCTG CACGTCTGGG CGTTGCACGT CACCGGTCAG
AACAACCCGA CCGGCGTCGA GCCGAAGACC GAGAAGGACA CGGTCGCGTT CACGCCCTAC
GCGACGATGA AGGACGTGTT CGGCATGTCC TGCTTCCTGC TGTTCTTTTC CTGGTTCATT
TTCTACATGC CGAACTATCT CGGTGAGGCC GACAACTACA TTCCGGCGAA TCCGGGCGTG
ACGCCGCCGC ATATCGTTCC GGAATGGTAC TACCTGCCGT TCTACGCGAT CCTGCGGTCG
ATCCCGAACA AGCTGATGGG CGTCGTGGCG ATGTTCGGCG CCATCATCGT GCTGCTGTTC
CTGCCCTGGC TCGACAGCGC CAAGGTGCGC TCGTCGCGCT ACCGGCCGCT GGCGAAGCGG
TTCTTCTGGG GCTTCGTGGT GGTCTGCATC ATGCTCGGAT GGCTCGGCTC GAAGCCGGCG
GAGGGCATCT ACACGGTCCT CGCCCGCGTC TTCACCTTCG CCTATTTCGC CTACTTCCTG
ATCGTGTTGC CGCTACTGTC CAGGGTCGAG AAGACGCTGC CGCTGCCGAA CTCGATCTCG
GAGGACGTGC TGAGCAAGGG CAAGACGGCG GGAGCAACCG CGGCGAGCCT GCTCGCCCTG
GTGATGGCCG GGACGCTGAT GTTCGGCGGG GTGCAGAGCG CCAAGGCGGC GGAAGGCGGC
GAGAGTCCGC CGTCGCTGGA GTGGAGCTTT GCCGGCCCGT TCGGCACCTA CGATCGCCCC
CAATTGCAGC GCGGCTTCAA GATCTACAAG GAGGTGTGCT CCGCCTGTCA CTCGCTGAAG
TTGCTGCAGT ATCGCAACCT CGCCGAGCCG GGTGGACCGG GCTTCACGAT CGAACAGGCC
AAGGCGATCG CCGCCGAAGC CTCGATCAAG GACGGCCCGA ACGACGCCGG CGAAATGTTC
GAACGCCCCG GCCGGCTCGC CGACACCTTC CATTCGCCGT TCCCGAACGA GCAGGCGGCG
CGCTCGGCCA ATGGCGGTGC GGTTCCGCCG GACATGTCGC TGCTCGCCAA GGCGCGTTCC
TATCCGCGTG GCTTCCCGCA GTTCGTGTTC GACTTCTTCA CCCAGTTCCA GGAGCAGGGC
CCGAACTACA TCGATGCGCT GCTTCAGGGT TATCAGGACA CGCCGCCGGA GGGCTTCACG
CTGCCGGACG GGGCTTACTA CAACAAGTGG TATCCGGGCC ATTCGATCAA AATGCCGCCG
CCGATTTCGG ACGGTCAGGT GAGCTTCGAC GACGGCAGCC CGGAGACCGT GCCGCAATAT
GCCAAGGACG TCACGGCTTT CCTGATGTGG GCTGCCGAGC CGCATCTCGA AGCCCGCAAG
CGCCTCGGTC TGCAGGTCAT GATCTTCCTG ATCATCCTCA GCGGCCTGCT GTACTTCACC
AAGCGCAAGA TCTGGTCGAA CGTGCACTGA
 
Protein sequence
MSGPSTYQPQ SPFMKWLEQR LPIAGLVHSS FIAYPTPRNL NYWWTFGAIL SMMLAVQIIT 
GIVLAMHYTP HVDFAFDSVE RIVRDVNYGW LLRNTHAAGA SMFFIAVYIH MFRGLYYGSY
KAPREVLWIL GVIIYLLMMA TGFMGYVLPW GQMSFWGATV ITNLFSAVPF VGDSIVTLLW
GGYSVGNPTL NRFFSLHYLL PFVIAGVVVL HVWALHVTGQ NNPTGVEPKT EKDTVAFTPY
ATMKDVFGMS CFLLFFSWFI FYMPNYLGEA DNYIPANPGV TPPHIVPEWY YLPFYAILRS
IPNKLMGVVA MFGAIIVLLF LPWLDSAKVR SSRYRPLAKR FFWGFVVVCI MLGWLGSKPA
EGIYTVLARV FTFAYFAYFL IVLPLLSRVE KTLPLPNSIS EDVLSKGKTA GATAASLLAL
VMAGTLMFGG VQSAKAAEGG ESPPSLEWSF AGPFGTYDRP QLQRGFKIYK EVCSACHSLK
LLQYRNLAEP GGPGFTIEQA KAIAAEASIK DGPNDAGEMF ERPGRLADTF HSPFPNEQAA
RSANGGAVPP DMSLLAKARS YPRGFPQFVF DFFTQFQEQG PNYIDALLQG YQDTPPEGFT
LPDGAYYNKW YPGHSIKMPP PISDGQVSFD DGSPETVPQY AKDVTAFLMW AAEPHLEARK
RLGLQVMIFL IILSGLLYFT KRKIWSNVH