Gene RPC_0848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPC_0848 
Symbol 
ID3969845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB18 
KingdomBacteria 
Replicon accessionNC_007925 
Strand
Start bp934436 
End bp937429 
Gene Length2994 bp 
Protein Length997 aa 
Translation table11 
GC content69% 
IMG OID637923964 
Productcarbon-monoxide dehydrogenase 
Protein accessionYP_530737 
Protein GI90422367 
COG category[C] Energy production and conversion
[S] Function unknown 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
[COG3427] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.682595 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAAC CCGCAGCAGC GGCAACGGAG ATCGATGTCG ACCGTCCATG GGTCGGCCGT 
TCGATCGAGC GCGTCGAAGA CGGCGCCCTG CTCACCGGGC GCGGCCGGTT CATCGATGAT
CTCGGAACCC GGCCCGGCAC GCTCTACGCC GCGATCCTGC GCTCGCCGCA CGCCCACGCC
GACATCGTCG CGATCCGCAC CGAGGCCGCA AAGCAGGCCG CGGGCGTCGT CGCGGTGCTC
ACCGGCGAAG ATATCACCGC GCTGACCTCG AGCCTCGTGG TCGGCGTCAA GGCGCCGGTG
CAATGCTGGC CGATCGCGGT CGGTCGCGTG CGCTACGTCG GCGAGGCGGT GGCGATCGTG
GTTGCGACCG ATCGCTACGT CGCCGAGGAC GGCGTCGATC TGATCGAGGT CGACTACCAG
GTGCGTGCGG CGGTGATCGA TCCGCTTGCC GCGCTGTCGG CCGACGCGCC GGTGCTGCAC
GACGGCTTTG CCGGCAACGT CGCCAGCGAT CGCAGCTTCC GCTACGGCGA TCCGGAACGC
GCCTTCGCCG AGGCGCCGCA TCGCATTTCC ATCGCCATCA AGTACCCGCG CAATTCCTGC
ACCCCGATCG AAACCTACGG CGTGGTCGCC GACTACGATG CCGCGGAAGA CGCCTACGAC
GTGCTGGCCA ATTTCCAGGG GCCGTTCAGC ATCCACGCGG TGATCTCGCG CGCCCTGAAG
GTGCCGGGCA ATCGGCTGCG GCTGCGCACC CCGCCGGACT CCGGCGGCAG TTTTGGCATC
AAGCAGGGAA TTTTCCCGTA CATCGTGCTG ATCGCTGCGG CCTCCCGTGT GGTCGGCCGC
CCGGTGAAGT GGATCGAGGA CCGGCTCGAA CATCTCACCG CCTCGGTGTC GGCGACCAAC
CGCGCCACCT GCATCGCCGC CGCCGTCGCC GCCGACGGCA AGATCATGGC GCTGGACTGG
GATCAGGTCG AGGATTGCGG CGCGCATTTG CGCGCGCCGG AGCCGGCGAC GTTGTACCGG
ATGCACGGCA ATCTCACCGG CGCCTATGCG ATCGACAACG TCGCGGTGCG CAACCGCGTC
GTCGTCACCA ACAAGACCCC GACCGGGCTC AACCGCGGCT TCGGCGGCCC GCAGATGTAT
TTCGCGCTAG AGAGGCTGCT GCAGCGCATC GCGGTCGAAC TCGAGCTCGA TCCGCTCGAC
GTGATCAAAC GCAATCTGGT TCCGGCCGGA TCGTTTCCCT ATCGCACCGC GACCGGCGCG
TTGCTGGACT CCGGCGACTA CCAAGAGGCG ATCGCGCGCG GCGTCGACGG CGGCGGGCTC
GCCGCGTTGA AGGCGCGGCG CGATGCGGCG CGCGCTGAGG GCCGGCTCTA CGGCATCGGC
TACACCGCGG TGGTCGAGCC CAGCGTCTCC AACATGGGCT ACATCACCAC GGTGTTGACC
GCGGCGGAGC GCCGCAAGGC CGGGCCGAAG AACGGCGCGC AGGCGACCGC CACCGTGGCG
CTCGATCCGG TCGGCGGCAT CACCGTGCAC GTCGCTTCGG TGCCGCAAGG CCAGGGCCAT
CGCACCGTGC TGTCGCAGGT GGTCGCCGAC GTGTTCGGCG TTGCGCCCAC CGATGTCCGC
GTCAACACCG AGATCGACAC CGCGAAGGAC GCCTGGTCGA TCGCATCCGG AAACTACGCG
AGCCGCTTCG CCGCCGCGGT GGCCGGCACC GCCAAGCTCG CCGCGGGTCG GCTGGCGGGA
CGGCTGGCGC GCGTTGCGGC GAGTCAATTG AACATCGACG TCGCCGACGT GGTGTTCCGC
GGCGGTCGGG TCGGCTCCAA GTCCAACCCC GACAACAGCA TTGCGTTCAC GCGGCTCGCC
GCGCTGAGCC ATTGGTCGCC GGGCTCGTTG CCGGACGATA TCGGCAACAC GCTGCGCGAA
ACAGTGTTCT GGACGCCGCC GGAGCTGGCG GCGCCGGACG ACGCCGACCG GGTGAACTCC
TCGCTGTGCC ACGGCTTCAT CTTCGATTTC TGCGGCGTCG AGATCGATCC GGTCACGCTG
GAAGCTAAGA TCGATCGCTA CGTCACCATG CACGATTGCG GCACCATCCT GCATCCCGGC
ATGGTCGACG GCCAGATCCG CGGCGGCTTC GCGCAGGCGA TCGGCGCCGC GCTGTACGAG
GAATACGCCT ACGCGCCGGA CGGCAGCTTC CTCACCGGCA CGCTCGCCGA TTACTTGCTG
CCGACCACCA TGGAAGTGCC GGAGCCTAAG ATCCTGCACA TGGAGACGCC GTCGCCGTTC
ACCCCGCTCG GCGCCAAAGG CGTCGGCGAA GGCAATTGCA TGTCGACGCC GGTGTGCGTC
GCCAACGCAG TCGCCGACGC GCTGGGCATC AAGGACATCA CCCTGCCGCT GGTGCCGGCG
CGGTTGGCGC AGTTTTTACG CGGAGATGAG CGCGCGGCGC CGGCCGGCGG CCGTGCGCCG
GCACCACCAC GCGCCGGCGG CACCGATCGT AAGCTGCGCG GCGAGGGGAG CGCGTCGGTC
GGCGCGCCGC CGCAACAGGT CTGGACGATG CTGCTCGATC CGGAGACGCT GAAGACGGTG
ATCCCCGGTT GCGAGCGGGT CGAGAAAATC TCCGATACGC ATTTCCGCGC CGAGGTGACG
CTCGGCATTG GCCCGGTGAC CGGGCGCTAT CGGGCCGACG TCAAACTCTC CGATCTCGAT
CCGCCGCGCG CGGTGACGCT CGGCGGCACC GCCGAGGGCG CGCTCGGCTT CGGCGGCGGC
GAGGGCCGCA TCACGCTTGC GCCTGATAGC AACGGCGGCA CCACGATGAC TTACGTCTAT
GAGGCGGCGA TCGGCGGCAA GGTCGCCAGC ATCGGCGGAC GCCTGCTCGA CGGCGCGACG
CGCGTCATCA TCGGCCGGTT CTTCACCGCT CTCGCCGCCA CCGCCGGCGG CAAGCCGGTG
CCGAGCGACT CCTGGCTGAC GCGGCTGCTG CGACTCGTGG GGTGGTCGCG ATGA
 
Protein sequence
MAQPAAAATE IDVDRPWVGR SIERVEDGAL LTGRGRFIDD LGTRPGTLYA AILRSPHAHA 
DIVAIRTEAA KQAAGVVAVL TGEDITALTS SLVVGVKAPV QCWPIAVGRV RYVGEAVAIV
VATDRYVAED GVDLIEVDYQ VRAAVIDPLA ALSADAPVLH DGFAGNVASD RSFRYGDPER
AFAEAPHRIS IAIKYPRNSC TPIETYGVVA DYDAAEDAYD VLANFQGPFS IHAVISRALK
VPGNRLRLRT PPDSGGSFGI KQGIFPYIVL IAAASRVVGR PVKWIEDRLE HLTASVSATN
RATCIAAAVA ADGKIMALDW DQVEDCGAHL RAPEPATLYR MHGNLTGAYA IDNVAVRNRV
VVTNKTPTGL NRGFGGPQMY FALERLLQRI AVELELDPLD VIKRNLVPAG SFPYRTATGA
LLDSGDYQEA IARGVDGGGL AALKARRDAA RAEGRLYGIG YTAVVEPSVS NMGYITTVLT
AAERRKAGPK NGAQATATVA LDPVGGITVH VASVPQGQGH RTVLSQVVAD VFGVAPTDVR
VNTEIDTAKD AWSIASGNYA SRFAAAVAGT AKLAAGRLAG RLARVAASQL NIDVADVVFR
GGRVGSKSNP DNSIAFTRLA ALSHWSPGSL PDDIGNTLRE TVFWTPPELA APDDADRVNS
SLCHGFIFDF CGVEIDPVTL EAKIDRYVTM HDCGTILHPG MVDGQIRGGF AQAIGAALYE
EYAYAPDGSF LTGTLADYLL PTTMEVPEPK ILHMETPSPF TPLGAKGVGE GNCMSTPVCV
ANAVADALGI KDITLPLVPA RLAQFLRGDE RAAPAGGRAP APPRAGGTDR KLRGEGSASV
GAPPQQVWTM LLDPETLKTV IPGCERVEKI SDTHFRAEVT LGIGPVTGRY RADVKLSDLD
PPRAVTLGGT AEGALGFGGG EGRITLAPDS NGGTTMTYVY EAAIGGKVAS IGGRLLDGAT
RVIIGRFFTA LAATAGGKPV PSDSWLTRLL RLVGWSR