Gene RPC_4498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_4498 
Symbol 
ID3972413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp5009272 
End bp5011263 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content66% 
IMG OID637927609 
Productcarbon-monoxide dehydrogenase, catalytic subunit 
Protein accessionYP_534340 
Protein GI90425970 
COG category[C] Energy production and conversion 
COG ID[COG1151] 6Fe-6S prismane cluster-containing protein 
TIGRFAM ID[TIGR01702] carbon-monoxide dehydrogenase, catalytic subunit 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.325797 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCTGCC TCGATTGTAA CACCTGCTCG TCCGATCCGG CCGCCGTCCA GATGCTGGCG 
GTGGCCGAAG CCAACAACAT CGGCACGGCA TGGAGCCGTC ATGCTCAGCA GCAGCCGCAA
TGCGGCTTCG GCACCACCGG GCTGTGTTGC CGGATCTGTC TGAAGGGACC GTGCCGGATC
GATCCGTTCG GCCAGGGCCC GCAATTCGGC ATCTGCGGCG CCGACCGCGA CACCATCGTC
GCGCGCCACC TGGTCCGCAT GATGGCCGCC GGCGCCGCGG CGCATTCCGA GCACGGCCGC
CACATCGCGC TGGCGATGCT GCATCTCAGC CAGGGCCACC TGCACGACTA TTCGATCCGC
GACGAAGAGA AATTGTTGGT GGTCGCCAAG CGGCTTGGCG TCGCGACCGA GGGCCGCGAT
CTGATGACCA TCGTGCGCGA ACTCGCCGAT CTCACGCTGC ACGATTTCCA GAATCAGAAC
TACGACGAGC CGTGCCATTG GCTGGCGGCG TCGCTGCCGG CGCCGCGGCT AAAGAAGCTC
GGCGATCTCG GACTGTTGCC GCACAATATC GATGCGGCGG TGGCGCAGTC GATGTCGCGC
ACCCATCTCG GCTGCGACGC CGATCCGACC AATTTGATCC TCGGCGGCTT GCGCGTGGCT
TTGGCCGACC TCGACGGCGA GATGCTGGCG ACCGAACTGT CGGACGCCTT GTTCGGCACG
CCGAAGCCGA TCGTCACCAC GTCGAATCTC GGTGTCATTA AGCGCGACGC CGTCAACGTC
GCGGTCAATG GCCACAACCC GCTGTTGTCC GACATCATCT GCGACATCGC CGCCGATCTG
AACGACGAAG CGATCGCCGC CGGCGCCAAG GACGGCATCA ACATCATCGG CATTTGCTGT
ACCGGCAACG AGGTGATGGT GCGCCACGGC ATTCCGCTCG CCACCAACTA TCTGTCGCAG
GAACTGCCGA TCCTGACCGG CGCGCTCGAC GCCATGGTGC TGGACGTGCA GTGCATCATG
CCGTCGCTTC CGCGGGTTGC GGAATGCTTC CACACCAAGA TCATCACCAC TGACAAGCAG
AACAAGATCG CCGGCGCAAC GCATATGGAT TTCCAGGAAG CCAAGGCCTC GGAAAACGCC
AAGTCGATCG TGCGGATGGC GATCGAGGCG TTCAAGCACC GCGATCCGCG GCGGATCCAG
ATCCCCAACA TCACCCAGCA GGCGATCGTC GGATTCAGCA CCGAGGCGAT CGTGGCGGCG
CTCGCCACCA TCAACGCCGC CGATCCGCTG CAGCCTTTGG TCGACAACAT CGTCAACGGC
AATATCCAGG GCGTGGTGCT GTTCGCCGGC TGCAACAACA CCAAGACCCA GCAGGACAGC
GCCTATATCG CGATCGCCCG CTCGCTGGCC AAACGTAACG TACTGGTGCT GGCGACCGGC
TGCGCCGCCG GCGCCTATGC CAAGGCCGGC ATGATGACGC AGGATGCCAC CCGGCAATAT
GCCGGCGAGG GCCTGAAGAG CGTGCTGACC GCGATCGGCG AATCCGCCGG CCTCGGCGGG
CCGCTGCCGC TGGTGCTGCA TATGGGCTCC TGCGTCGACA ACAGCCGCGC CGTGGCGCTG
GCCACCGCGC TCGCCAACAA GCTCGGCGTT GATATTTCCG ATCTGCCGCT GGTGGCCTCG
GCGCCGGAAG CGATGACCGA AAAGGCCGTG GTGATCGGCA GCTGGGCGGT CGCGCTCGGC
ATCCCGACCC ATCTCGGCAC GGTGCCGCCG ATCGTCGGCT CCGACGTCGT CAGCCAGCTC
GTCACCACCA CGGCGCGCGA CCTGCTTGGC GGCTACTTCA TCGTCGAGAC CGATCCGGAA
CTCGCCGCGG ACAAGATGTT CGCCGCGATC CAGGAGCGTC GCGACGGCCT CGGCATCCAG
ACCCTGGCGG TGGCGCCGAT CGCCGCGCTG GCCGCAGCGA AGCGCCCGGT GCTGCCTGCG
AGGTTGACAT GA
 
Protein sequence
MACLDCNTCS SDPAAVQMLA VAEANNIGTA WSRHAQQQPQ CGFGTTGLCC RICLKGPCRI 
DPFGQGPQFG ICGADRDTIV ARHLVRMMAA GAAAHSEHGR HIALAMLHLS QGHLHDYSIR
DEEKLLVVAK RLGVATEGRD LMTIVRELAD LTLHDFQNQN YDEPCHWLAA SLPAPRLKKL
GDLGLLPHNI DAAVAQSMSR THLGCDADPT NLILGGLRVA LADLDGEMLA TELSDALFGT
PKPIVTTSNL GVIKRDAVNV AVNGHNPLLS DIICDIAADL NDEAIAAGAK DGINIIGICC
TGNEVMVRHG IPLATNYLSQ ELPILTGALD AMVLDVQCIM PSLPRVAECF HTKIITTDKQ
NKIAGATHMD FQEAKASENA KSIVRMAIEA FKHRDPRRIQ IPNITQQAIV GFSTEAIVAA
LATINAADPL QPLVDNIVNG NIQGVVLFAG CNNTKTQQDS AYIAIARSLA KRNVLVLATG
CAAGAYAKAG MMTQDATRQY AGEGLKSVLT AIGESAGLGG PLPLVLHMGS CVDNSRAVAL
ATALANKLGV DISDLPLVAS APEAMTEKAV VIGSWAVALG IPTHLGTVPP IVGSDVVSQL
VTTTARDLLG GYFIVETDPE LAADKMFAAI QERRDGLGIQ TLAVAPIAAL AAAKRPVLPA
RLT