Gene RPC_4788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4788 
Symbol 
ID3972991 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5349195 
End bp5350817 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content62% 
IMG OID637927900 
Productcytochrome-c oxidase 
Protein accessionYP_534629 
Protein GI90426259 
COG category[C] Energy production and conversion 
COG ID[COG0843] Heme/copper-type cytochrome/quinol oxidases, subunit 1 
TIGRFAM ID[TIGR02891] cytochrome c oxidase, subunit I 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAATGG AAACTGCACG GGTTGCTCAC TCCGACCACG CCGACGAACA CGCCCACGCT 
CATCCGACCG GCTGGCGGCG CTACGTCTAT TCCACCAACC ACAAAGACAT CGGCACGATG
TATCTGATCT TCGCGATCGT CGCCGGGCTG ATCGGCGGCG CGATGTCGAT CGCGATCCGC
ATCGAGCTGA TGTATCCCGG CGTGCAGATC TTCCACGAAA GCCACACCTA CAACGTGTTC
GTCACCTCGC ACGGCCTGAT CATGATCTTC TTCATGGTGA TGCCGGCGAT GATCGGCGGC
TTCGGCAACT GGTTCGTGCC GCTGATGATC GGCGCGCCCG ACATGGCGTT TCCGCGGATG
AACAACGTCT CGTTCTGGCT GTTGCCGGCC TCCTTCGCGC TGCTGCTGAC CTCGACCTTC
GTCGAGGGCG AGCCGTCGTC GAACGGCGTC GGCGCCGGCT GGACGATGTA TGCGCCGCTG
TCGACCTCCG GCCATCCCGG GCCGGCGGTG GACTTCGCGA TCCTGGCGCT GCATCTCGCC
GGCGCGTCCT CGATCCTCGG CGCGATCAAC TTCATCACCA CGATCTTCAA CATGCGCGCG
CCCGGCATGA CGCTGCACAA GATGCCGCTG TTCGTGTGGT CGATCCTGGT CACGGTGTTC
CTGCTGCTGC TGGCGCTGCC GGTGTTGGCC GGTGCCATCA CCATGCTGCT CACCGACCGC
AATTTCGGCA CCACGTTCTT CTCCGCGGAA GGCGGCGGCG ATCCGGTGCT GTTCCAGCAT
CTGTTCTGGT TCTTCGGCCA CCCCGAAGTC TACATCCTGA TTTTGCCGGG CTTCGGCATG
ATCAGCCAGA TCGTCTCGAC CTTCTCGAAA AAGCCGGTGT TCGGCTATCT GGGCATGGCC
TACGCCATGG TGGCGATCGG CGTGATCGGC TTCGTGGTCT GGGCGCATCA CATGTACACG
GTGGGCATGT CGAGCGCGAC GCAGGCCTAC TTCGTCGCCG CCACCATGGT GATCGCGGTG
CCGACCGGGG TGAAGATCTT CTCCTGGATC GCCACGATGT GGGGCGGCTC GATCGAATTC
CGCACGCCGA TGCTGTGGGC GATCGGCTTC ATCTTCCTGT TCACGGTCGG CGGCGTCACC
GGCGTGGTGC TGGCCAATGC CGGCGTCGAT CGCGTGCTGC AAGACACCTA TTACGTGGTG
GCGCATTTCC ACTACGTGCT GTCGCTGGGC GCGGTGTTCG CGATCTTCGC CGGCTGGTAC
TACTGGTTCC CGAAGATGAC AGGCTACATG TATAACGAGA CCATCGGCAA GCTGCACTTC
TGGGTCACCT TCATCGGCGT CAATCTGGTG TTCTTCCCGC AGCATTTCCT CGGCCTGTCC
GGCATGCCGC GGCGCTACGT CGACTATCCA GACGCCTTCG CCGGCTGGAA TCTGGTGTCC
TCGATCGGCT CCTACATCTC CGGCTTCGCG GTGCTGATCT TCCTCTACGG CATGGTGCTG
GCGTTCATCA AAAAGGAAAA GGCCGCCGAC AATCCGTGGG GGCCGGGCGC CACCACGCTG
GAATGGACCT TGTCATCGCC GCCGCCGTTC CATCAGTTCG AAATTCTGCC GCGGGTTCGC
TGA
 
Protein sequence
MVMETARVAH SDHADEHAHA HPTGWRRYVY STNHKDIGTM YLIFAIVAGL IGGAMSIAIR 
IELMYPGVQI FHESHTYNVF VTSHGLIMIF FMVMPAMIGG FGNWFVPLMI GAPDMAFPRM
NNVSFWLLPA SFALLLTSTF VEGEPSSNGV GAGWTMYAPL STSGHPGPAV DFAILALHLA
GASSILGAIN FITTIFNMRA PGMTLHKMPL FVWSILVTVF LLLLALPVLA GAITMLLTDR
NFGTTFFSAE GGGDPVLFQH LFWFFGHPEV YILILPGFGM ISQIVSTFSK KPVFGYLGMA
YAMVAIGVIG FVVWAHHMYT VGMSSATQAY FVAATMVIAV PTGVKIFSWI ATMWGGSIEF
RTPMLWAIGF IFLFTVGGVT GVVLANAGVD RVLQDTYYVV AHFHYVLSLG AVFAIFAGWY
YWFPKMTGYM YNETIGKLHF WVTFIGVNLV FFPQHFLGLS GMPRRYVDYP DAFAGWNLVS
SIGSYISGFA VLIFLYGMVL AFIKKEKAAD NPWGPGATTL EWTLSSPPPF HQFEILPRVR