Gene RPB_3198 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPB_3198 
Symbol 
ID3910999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris HaA2 
KingdomBacteria 
Replicon accessionNC_007778 
Strand
Start bp3656894 
End bp3659278 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content69% 
IMG OID637885100 
Productcarbon-monoxide dehydrogenase 
Protein accessionYP_486805 
Protein GI86750309 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.294023 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTCA TGCAGGACCG TCCTTCGAAC CTGTCGTCCG ACACCGCCAT CGCGCTGCAA 
AAATTCGGAA TTGGCCAGCC GGTGCGACGC AAGGAGGACG ACACGCTGCT GCGCGGCAAG
GGCCGCTATA CCGACGACTG CAACCTGCCC GGCCAGCTCA CCGCGGTGAT GGTGCGCAGC
CCGCACGCTC ACGGCATCAT CCGCGGCATC GACGCCGAAG CGGCGCGGGC GATGCCCGGC
GTCGTCGGCG TCTACACCGG CGCCGATCTC GCCGCGGCCG GCTATGCCCC GTTCAGCTGC
GGGCTGCCGA TGAAGAGCCG CGACGGCACG CCGCTGCTGC AGACCAACCG CCCGGCGCTG
GCGACCGACA AGGTGCGCTT CGTCGGCGAT CCGGTGGCGT TCGTGGTCGC CGAGACGGCG
ATTCAGGCGC GCGACGCCGC CGAATCGGTC GCGCTCGACA TCGCGCCGCT GCCGGCGGTG
ACCGACGCCG ACGACGCGAT CAAGCCCGGC GCGCCGCAGC TCTACGATCA CATCCCGAAC
AACATCGCGC TCGACTATCA CTTCGGCGAC GCCGCGGCCG TCGAGGCCGC CTTCGCCTCC
GCCGCGCATG TCACCACGCT CGACATCGAG AACACCCGGG TCGCCGCGGT GCCGATGGAG
CCGCGCACCG GGCTCGCCAG TTACGACCGG CAGAACGGCC GCTATACCAT CCAGCTCCCG
ACCCAGGGCG TCGCCGGCAA CCGCAACACG CTGGCGAAGC TGCTCGGCGT GCCGACCGAC
AAGGTGCGGG TGCTGACCGG CCAGGTCGGC GGCTCGTTCG GGATGAAGAA CATCTCCTAT
CCCGAATACA TCTGCATCCT GCACGCGGCG AAAGCGCTCG GCCGGCCGGT GAAGTGGACC
GACGAACGCT CGTCGGCGTT CCTGTCCGAC AGCCACGGCC GCGGCCAGCA GATCCGCGCC
GAGCTGGCGC TCGATGCCGC CGGCAAGTTT CTGGCGATCC GCCTCAGTGG CACCGGCAAT
CTCGGCGCCT ACATCACCGG CGTGGCGCCG CTGCCGCTGT CGCTCAACAC CGGCAAGAAC
ATCGGCAGCG TGTATCGCAC GCCGCTGCTC GGCGTCGACA TCAAATGCGT CGTCACCAAC
GTCACGCTGA TGGGCGCCTA TCGCGGCGCC GGCCGGCCCG AGGCGAACTA CTTCCTGGAG
CGGCTGATCG ATCGCGCCGC CGACGAGATC GGCATCGACC GCCTCGCCTT GCGCAAGCGC
AACTTCATCA AGCCGCAGCA ATTGCCGTTC ACCGCCTGCT CGGGCGTCAC CTATGACAGC
GGCGATTTCG GCGGCGTGTT CGCGCAGGCG CTGGAGCTGT CGGACCATGC CGGCTTCGCC
CAACGCAAGA AGGAGAGCCG CAAGCGCGGC AAACTGCGCG GCATCGCGGT CGGCTCCTAT
CTCGAAGTCA CCGCCCCGCC GAGCGCCGAA CTCGGCAAGA TCGTGTTCGA GGAAGACGGC
ACAATTCGGC TGATCACCGG CACGCTCGAC TACGGCCAGG GCCACGCCAC GCCGTTCGCG
CAGGTGCTGA GTACGTATCT CGGCGTGCCG TTCGACCGCA TCCGGCTCGA ACAGGGCGAC
AGCGACGTCG TCCACACCGG CAACGGCACC GGCGGCTCGC GCTCGATCAC CGCCAGCGGC
ATGGCGATCG TCGAGGCGTC GCAGCAAGTG ATCGCCAAGG GCAAGGCCGC GGCGTCGCAT
CTCTTGGAGA CCGCGGAGGC CGACATCGAA TTCGCCGATG GTCGCTTCAC CGTGGCGGGC
ACCGATCGCA GCATCGGCAT CATGGAGCTG GCGCAGCGGC TGCGCGAGGC GAAACTCCCC
GACGGCGTGC CGGCGTCGCT CGACGTCGAT CACACCGTCA AGGCGGTCCC CTCCGCCTTC
CCCAATGGCT GCCACGTCGC CGAGGTCGAG ATCGATCCCG ACACCGGCGT CACCCGCGTG
GTGCGCTACA CCGCGGTCAA TGATTTCGGC GTCGTGGTCA ATCCGATGAT CGTCGCAGGC
CAGTTGCACG GCGGCGTCGC GCAAGGCATC GGCCAGGCGC TGATGGAGAA GATGTCCTAT
GACGGCGACG GCCAGCCGAT CACCGGCTCG CTGCAGGACT ACGCGCTGCC GCGCGCCGAG
GACATTCCGC CGATGGCGGT CGGCGATCAC CCCGTGCCTG CGCCCGGCAA TCCGCTCGGC
ACCAAGGGCT GCGGCGAAGC CGGCTGCGCC GGCTCGCTGG CGAGCGTCGT CAATGCCGTG
CTCGACGCGC TGAAAGACCA CGGCGTCAAA TCCCTCGACA TGCCGCTGAC CTCGGAGAAG
GTCTGGCGCG CGATCCGGGA GGCGAAGGAG ACGGCGGCGG CGTGA
 
Protein sequence
MSFMQDRPSN LSSDTAIALQ KFGIGQPVRR KEDDTLLRGK GRYTDDCNLP GQLTAVMVRS 
PHAHGIIRGI DAEAARAMPG VVGVYTGADL AAAGYAPFSC GLPMKSRDGT PLLQTNRPAL
ATDKVRFVGD PVAFVVAETA IQARDAAESV ALDIAPLPAV TDADDAIKPG APQLYDHIPN
NIALDYHFGD AAAVEAAFAS AAHVTTLDIE NTRVAAVPME PRTGLASYDR QNGRYTIQLP
TQGVAGNRNT LAKLLGVPTD KVRVLTGQVG GSFGMKNISY PEYICILHAA KALGRPVKWT
DERSSAFLSD SHGRGQQIRA ELALDAAGKF LAIRLSGTGN LGAYITGVAP LPLSLNTGKN
IGSVYRTPLL GVDIKCVVTN VTLMGAYRGA GRPEANYFLE RLIDRAADEI GIDRLALRKR
NFIKPQQLPF TACSGVTYDS GDFGGVFAQA LELSDHAGFA QRKKESRKRG KLRGIAVGSY
LEVTAPPSAE LGKIVFEEDG TIRLITGTLD YGQGHATPFA QVLSTYLGVP FDRIRLEQGD
SDVVHTGNGT GGSRSITASG MAIVEASQQV IAKGKAAASH LLETAEADIE FADGRFTVAG
TDRSIGIMEL AQRLREAKLP DGVPASLDVD HTVKAVPSAF PNGCHVAEVE IDPDTGVTRV
VRYTAVNDFG VVVNPMIVAG QLHGGVAQGI GQALMEKMSY DGDGQPITGS LQDYALPRAE
DIPPMAVGDH PVPAPGNPLG TKGCGEAGCA GSLASVVNAV LDALKDHGVK SLDMPLTSEK
VWRAIREAKE TAAA