Gene RPD_2257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_2257 
Symbol 
ID4022742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp2519972 
End bp2522347 
Gene Length2376 bp 
Protein Length791 aa 
Translation table11 
GC content67% 
IMG OID637962452 
Productcarbon-monoxide dehydrogenase 
Protein accessionYP_569393 
Protein GI91976734 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID[TIGR02416] carbon-monoxide dehydrogenase, large subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.336955 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGACC ATACCTCGAA CGCGTCGCTC GACACCGCTA TCGCACTGCA AAAATTCGGC 
GTCGGCCAGC CGGTCCGGCG CAAGGAAGAC GACACGCTGG TTCGCGGCAA GGGACACTAC
ACCGACGATT TCAACCTGCC CGGCCAGCTG TTTGCGGTGA TGGTCCGCAG CCCGCACGCC
CATGGGGTGC TTCGCGGCAT CAACGCCGAG GCGGCGCGCG GCATGGCGGG CGTCCGCGGC
GTGTTCACCG GCGCGGACCT CGCCAGCGCC AATTACGCCC CGTTCACCTG CGGCCTGCCG
CTCAAAAATC GTGACGGCAC GCCGCTGCAT CAGACCAACC GCCACGCGCT GGCCACCGAC
AAGGTCCGCT TCGTCGGCGA TCCGGTGGCG TGCGTCGTTG CCGACACTTT GGCGCAGGCG
CGTGACGCCG CAGAGGCCGT CGAGCTCGAC ATCGAGCCGC TGCCCGCCGT CACCGAGGCC
GACGAAGCGA CCAAGCCCGG CGCGCCGCAG CTCTACGATC ACATCGAAAA CAACGTCGCG
CTCGATTATC ACTTCGGCGA CGCCGAGGCC GTGAACGCCG CCTTCGCTTC GGCCGCGCAT
GTCACCAGGC TCGACATCGA GAACAGCCGC GTCGCGGTGG TGTCGATGGA GCCGCGCGCA
GGCCTCGCCT GCTACGACAG GAACGACGGC CGCTACACCA TTCAGGTGCC GACCCAGGGC
GTCGCCGGTA ACCGCAACGG CCTCGCCAAA TTGCTCGGCG TGCCCAACGA CAAGGTCCGC
CTGCTCACAG GCCATGTCGG CGGCTCATTC GGGATGAAGA ACATCAACTA TCCCGAATAT
ATCTGCATCC TGCACGCGGC GAAGGCGCTC GGACGTCCGG TGAAGTGGAC CGACGAGCGC
TCGACCGCGT TCCTGTCCGA CAGCCACGGC CGCGGCCAGC AGATCCACGC CGAGCTTGCG
CTCGACGCCG CCGGCAAGTT CCTGGCGATC CGCATCTCCG GCACCGGCAA TCTCGGCGCC
TACATCACCG GCGTTGCGCC GCTGCCGCTG TCGCTCAACA TCGGCAAGAA TATCGGCAGC
GTCTATCGCA CGCCGCTGCT ATCGGTCGAC ATCAAATGCG TGCTCACCAA CGTCACGCTG
ATGGGCGCCT ATCGCGGCGC CGGGCGTCCC GAGGCCAATT ACTATCTCGA ACGACTGATC
GACCGCGCCG CCGATGAAAT GAGCATCGAC CGGCTGACAC TGCGCAAACG CAACTTCATC
AAGCCCTCGC AGATGCCGTT CAAGGCCTGT TCGGGCGTCA CCTATGATTG CGGCGACTTC
GCCGGCGTGT TCGCTCACGC GCTGGAGCTG GCGGACTACG CCGGCTTTTC GAAGCGCAAG
AAGGACAGCC GCAAGCGCGG CAGGCTGCGC GGCATCGCGG TCGGCTCCTA TCTCGAAGTC
ACCGCGCCGC CGAACAGCGA ACTCGGCAAG ATCGTGTTCG AAGCCGATGG CCGCGTGAGG
CTGATCACCG GCACGCTCGA CTACGGCCAG GGCCACGCCA CGCCGTTCGC GCAGGTGATG
AGCGCCCAAC TGGGCGTGCC GTTCGAGAGC GTCACGCTCG AACAGGGCGA CAGCGATCTC
GTCCACACCG GCAACGGCAC CGGCGGCTCG CGCTCGATCA CCGCGAGCGG CATGGCGATC
GTCGAGGCCG CCGCGCAGGT CATCGCCAAG GGCAAGGCCG CGGCGTCACA TCTGCTGGAA
ACCTCGGAAG CCGATATCGA ATTTTCGGGC GGGCGATTCA CCGTCGCCGG CACCGACCGC
AGCATCGGCA TCATCGAGCT GGCGCAGCGG CTGCGCGACA GCAAGATGCC CGACGGCGCG
CCGTCGTCGC TCGACGTCGA CCACACCGTC AAGGAAGTGC CGTCGACCTT CCCGAACGGC
TGCCACGTCG CCGAGGTCGA GATCGATCCC GACACCGGCG TCACCCAGGT GGTGCGCTAC
GTCGCGGTCA ATGATTTCGG CGTCGTGGTC AATCCGATGA TCGTCGCGGG CCAGTTGCAC
GGCGGCGTGG CGCAGGGCAT CGGCCAGGCG CTGATGGAGA AAGTGAGCTA CGACGCCGAC
GGCCAGCCGA TCACCGGCTC GCTGCAGGAC TACGCCCTGC CGCGCGCCGA GGATATTCCG
GCGATGACGA TCGGCGACCA TCCGGTGCCG GCCACCAACA ATCCGCTCGG CACCAAGGGC
TGCGGCGAAG CCGGCTGCGC GGGTAGCATG TCCACCGTGA TCAATGCCGC GCTCGACGCC
CTGCGAGATT TCGGCGTGAC CCATCTCGAC ATGCCGCTGA CCTCCGAGAA GGTCTGGCGG
GCGATCAAGG ACGCCAAGGC GACGCAGGCG GCTTGA
 
Protein sequence
MQDHTSNASL DTAIALQKFG VGQPVRRKED DTLVRGKGHY TDDFNLPGQL FAVMVRSPHA 
HGVLRGINAE AARGMAGVRG VFTGADLASA NYAPFTCGLP LKNRDGTPLH QTNRHALATD
KVRFVGDPVA CVVADTLAQA RDAAEAVELD IEPLPAVTEA DEATKPGAPQ LYDHIENNVA
LDYHFGDAEA VNAAFASAAH VTRLDIENSR VAVVSMEPRA GLACYDRNDG RYTIQVPTQG
VAGNRNGLAK LLGVPNDKVR LLTGHVGGSF GMKNINYPEY ICILHAAKAL GRPVKWTDER
STAFLSDSHG RGQQIHAELA LDAAGKFLAI RISGTGNLGA YITGVAPLPL SLNIGKNIGS
VYRTPLLSVD IKCVLTNVTL MGAYRGAGRP EANYYLERLI DRAADEMSID RLTLRKRNFI
KPSQMPFKAC SGVTYDCGDF AGVFAHALEL ADYAGFSKRK KDSRKRGRLR GIAVGSYLEV
TAPPNSELGK IVFEADGRVR LITGTLDYGQ GHATPFAQVM SAQLGVPFES VTLEQGDSDL
VHTGNGTGGS RSITASGMAI VEAAAQVIAK GKAAASHLLE TSEADIEFSG GRFTVAGTDR
SIGIIELAQR LRDSKMPDGA PSSLDVDHTV KEVPSTFPNG CHVAEVEIDP DTGVTQVVRY
VAVNDFGVVV NPMIVAGQLH GGVAQGIGQA LMEKVSYDAD GQPITGSLQD YALPRAEDIP
AMTIGDHPVP ATNNPLGTKG CGEAGCAGSM STVINAALDA LRDFGVTHLD MPLTSEKVWR
AIKDAKATQA A