Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPD_2257 |
Symbol | |
ID | 4022742 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB5 |
Kingdom | Bacteria |
Replicon accession | NC_007958 |
Strand | - |
Start bp | 2519972 |
End bp | 2522347 |
Gene Length | 2376 bp |
Protein Length | 791 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637962452 |
Product | carbon-monoxide dehydrogenase |
Protein accession | YP_569393 |
Protein GI | 91976734 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR02416] carbon-monoxide dehydrogenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.336955 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGGACC ATACCTCGAA CGCGTCGCTC GACACCGCTA TCGCACTGCA AAAATTCGGC GTCGGCCAGC CGGTCCGGCG CAAGGAAGAC GACACGCTGG TTCGCGGCAA GGGACACTAC ACCGACGATT TCAACCTGCC CGGCCAGCTG TTTGCGGTGA TGGTCCGCAG CCCGCACGCC CATGGGGTGC TTCGCGGCAT CAACGCCGAG GCGGCGCGCG GCATGGCGGG CGTCCGCGGC GTGTTCACCG GCGCGGACCT CGCCAGCGCC AATTACGCCC CGTTCACCTG CGGCCTGCCG CTCAAAAATC GTGACGGCAC GCCGCTGCAT CAGACCAACC GCCACGCGCT GGCCACCGAC AAGGTCCGCT TCGTCGGCGA TCCGGTGGCG TGCGTCGTTG CCGACACTTT GGCGCAGGCG CGTGACGCCG CAGAGGCCGT CGAGCTCGAC ATCGAGCCGC TGCCCGCCGT CACCGAGGCC GACGAAGCGA CCAAGCCCGG CGCGCCGCAG CTCTACGATC ACATCGAAAA CAACGTCGCG CTCGATTATC ACTTCGGCGA CGCCGAGGCC GTGAACGCCG CCTTCGCTTC GGCCGCGCAT GTCACCAGGC TCGACATCGA GAACAGCCGC GTCGCGGTGG TGTCGATGGA GCCGCGCGCA GGCCTCGCCT GCTACGACAG GAACGACGGC CGCTACACCA TTCAGGTGCC GACCCAGGGC GTCGCCGGTA ACCGCAACGG CCTCGCCAAA TTGCTCGGCG TGCCCAACGA CAAGGTCCGC CTGCTCACAG GCCATGTCGG CGGCTCATTC GGGATGAAGA ACATCAACTA TCCCGAATAT ATCTGCATCC TGCACGCGGC GAAGGCGCTC GGACGTCCGG TGAAGTGGAC CGACGAGCGC TCGACCGCGT TCCTGTCCGA CAGCCACGGC CGCGGCCAGC AGATCCACGC CGAGCTTGCG CTCGACGCCG CCGGCAAGTT CCTGGCGATC CGCATCTCCG GCACCGGCAA TCTCGGCGCC TACATCACCG GCGTTGCGCC GCTGCCGCTG TCGCTCAACA TCGGCAAGAA TATCGGCAGC GTCTATCGCA CGCCGCTGCT ATCGGTCGAC ATCAAATGCG TGCTCACCAA CGTCACGCTG ATGGGCGCCT ATCGCGGCGC CGGGCGTCCC GAGGCCAATT ACTATCTCGA ACGACTGATC GACCGCGCCG CCGATGAAAT GAGCATCGAC CGGCTGACAC TGCGCAAACG CAACTTCATC AAGCCCTCGC AGATGCCGTT CAAGGCCTGT TCGGGCGTCA CCTATGATTG CGGCGACTTC GCCGGCGTGT TCGCTCACGC GCTGGAGCTG GCGGACTACG CCGGCTTTTC GAAGCGCAAG AAGGACAGCC GCAAGCGCGG CAGGCTGCGC GGCATCGCGG TCGGCTCCTA TCTCGAAGTC ACCGCGCCGC CGAACAGCGA ACTCGGCAAG ATCGTGTTCG AAGCCGATGG CCGCGTGAGG CTGATCACCG GCACGCTCGA CTACGGCCAG GGCCACGCCA CGCCGTTCGC GCAGGTGATG AGCGCCCAAC TGGGCGTGCC GTTCGAGAGC GTCACGCTCG AACAGGGCGA CAGCGATCTC GTCCACACCG GCAACGGCAC CGGCGGCTCG CGCTCGATCA CCGCGAGCGG CATGGCGATC GTCGAGGCCG CCGCGCAGGT CATCGCCAAG GGCAAGGCCG CGGCGTCACA TCTGCTGGAA ACCTCGGAAG CCGATATCGA ATTTTCGGGC GGGCGATTCA CCGTCGCCGG CACCGACCGC AGCATCGGCA TCATCGAGCT GGCGCAGCGG CTGCGCGACA GCAAGATGCC CGACGGCGCG CCGTCGTCGC TCGACGTCGA CCACACCGTC AAGGAAGTGC CGTCGACCTT CCCGAACGGC TGCCACGTCG CCGAGGTCGA GATCGATCCC GACACCGGCG TCACCCAGGT GGTGCGCTAC GTCGCGGTCA ATGATTTCGG CGTCGTGGTC AATCCGATGA TCGTCGCGGG CCAGTTGCAC GGCGGCGTGG CGCAGGGCAT CGGCCAGGCG CTGATGGAGA AAGTGAGCTA CGACGCCGAC GGCCAGCCGA TCACCGGCTC GCTGCAGGAC TACGCCCTGC CGCGCGCCGA GGATATTCCG GCGATGACGA TCGGCGACCA TCCGGTGCCG GCCACCAACA ATCCGCTCGG CACCAAGGGC TGCGGCGAAG CCGGCTGCGC GGGTAGCATG TCCACCGTGA TCAATGCCGC GCTCGACGCC CTGCGAGATT TCGGCGTGAC CCATCTCGAC ATGCCGCTGA CCTCCGAGAA GGTCTGGCGG GCGATCAAGG ACGCCAAGGC GACGCAGGCG GCTTGA
|
Protein sequence | MQDHTSNASL DTAIALQKFG VGQPVRRKED DTLVRGKGHY TDDFNLPGQL FAVMVRSPHA HGVLRGINAE AARGMAGVRG VFTGADLASA NYAPFTCGLP LKNRDGTPLH QTNRHALATD KVRFVGDPVA CVVADTLAQA RDAAEAVELD IEPLPAVTEA DEATKPGAPQ LYDHIENNVA LDYHFGDAEA VNAAFASAAH VTRLDIENSR VAVVSMEPRA GLACYDRNDG RYTIQVPTQG VAGNRNGLAK LLGVPNDKVR LLTGHVGGSF GMKNINYPEY ICILHAAKAL GRPVKWTDER STAFLSDSHG RGQQIHAELA LDAAGKFLAI RISGTGNLGA YITGVAPLPL SLNIGKNIGS VYRTPLLSVD IKCVLTNVTL MGAYRGAGRP EANYYLERLI DRAADEMSID RLTLRKRNFI KPSQMPFKAC SGVTYDCGDF AGVFAHALEL ADYAGFSKRK KDSRKRGRLR GIAVGSYLEV TAPPNSELGK IVFEADGRVR LITGTLDYGQ GHATPFAQVM SAQLGVPFES VTLEQGDSDL VHTGNGTGGS RSITASGMAI VEAAAQVIAK GKAAASHLLE TSEADIEFSG GRFTVAGTDR SIGIIELAQR LRDSKMPDGA PSSLDVDHTV KEVPSTFPNG CHVAEVEIDP DTGVTQVVRY VAVNDFGVVV NPMIVAGQLH GGVAQGIGQA LMEKVSYDAD GQPITGSLQD YALPRAEDIP AMTIGDHPVP ATNNPLGTKG CGEAGCAGSM STVINAALDA LRDFGVTHLD MPLTSEKVWR AIKDAKATQA A
|
| |