Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3198 |
Symbol | |
ID | 3910999 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3656894 |
End bp | 3659278 |
Gene Length | 2385 bp |
Protein Length | 794 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637885100 |
Product | carbon-monoxide dehydrogenase |
Protein accession | YP_486805 |
Protein GI | 86750309 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR02416] carbon-monoxide dehydrogenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.294023 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTCA TGCAGGACCG TCCTTCGAAC CTGTCGTCCG ACACCGCCAT CGCGCTGCAA AAATTCGGAA TTGGCCAGCC GGTGCGACGC AAGGAGGACG ACACGCTGCT GCGCGGCAAG GGCCGCTATA CCGACGACTG CAACCTGCCC GGCCAGCTCA CCGCGGTGAT GGTGCGCAGC CCGCACGCTC ACGGCATCAT CCGCGGCATC GACGCCGAAG CGGCGCGGGC GATGCCCGGC GTCGTCGGCG TCTACACCGG CGCCGATCTC GCCGCGGCCG GCTATGCCCC GTTCAGCTGC GGGCTGCCGA TGAAGAGCCG CGACGGCACG CCGCTGCTGC AGACCAACCG CCCGGCGCTG GCGACCGACA AGGTGCGCTT CGTCGGCGAT CCGGTGGCGT TCGTGGTCGC CGAGACGGCG ATTCAGGCGC GCGACGCCGC CGAATCGGTC GCGCTCGACA TCGCGCCGCT GCCGGCGGTG ACCGACGCCG ACGACGCGAT CAAGCCCGGC GCGCCGCAGC TCTACGATCA CATCCCGAAC AACATCGCGC TCGACTATCA CTTCGGCGAC GCCGCGGCCG TCGAGGCCGC CTTCGCCTCC GCCGCGCATG TCACCACGCT CGACATCGAG AACACCCGGG TCGCCGCGGT GCCGATGGAG CCGCGCACCG GGCTCGCCAG TTACGACCGG CAGAACGGCC GCTATACCAT CCAGCTCCCG ACCCAGGGCG TCGCCGGCAA CCGCAACACG CTGGCGAAGC TGCTCGGCGT GCCGACCGAC AAGGTGCGGG TGCTGACCGG CCAGGTCGGC GGCTCGTTCG GGATGAAGAA CATCTCCTAT CCCGAATACA TCTGCATCCT GCACGCGGCG AAAGCGCTCG GCCGGCCGGT GAAGTGGACC GACGAACGCT CGTCGGCGTT CCTGTCCGAC AGCCACGGCC GCGGCCAGCA GATCCGCGCC GAGCTGGCGC TCGATGCCGC CGGCAAGTTT CTGGCGATCC GCCTCAGTGG CACCGGCAAT CTCGGCGCCT ACATCACCGG CGTGGCGCCG CTGCCGCTGT CGCTCAACAC CGGCAAGAAC ATCGGCAGCG TGTATCGCAC GCCGCTGCTC GGCGTCGACA TCAAATGCGT CGTCACCAAC GTCACGCTGA TGGGCGCCTA TCGCGGCGCC GGCCGGCCCG AGGCGAACTA CTTCCTGGAG CGGCTGATCG ATCGCGCCGC CGACGAGATC GGCATCGACC GCCTCGCCTT GCGCAAGCGC AACTTCATCA AGCCGCAGCA ATTGCCGTTC ACCGCCTGCT CGGGCGTCAC CTATGACAGC GGCGATTTCG GCGGCGTGTT CGCGCAGGCG CTGGAGCTGT CGGACCATGC CGGCTTCGCC CAACGCAAGA AGGAGAGCCG CAAGCGCGGC AAACTGCGCG GCATCGCGGT CGGCTCCTAT CTCGAAGTCA CCGCCCCGCC GAGCGCCGAA CTCGGCAAGA TCGTGTTCGA GGAAGACGGC ACAATTCGGC TGATCACCGG CACGCTCGAC TACGGCCAGG GCCACGCCAC GCCGTTCGCG CAGGTGCTGA GTACGTATCT CGGCGTGCCG TTCGACCGCA TCCGGCTCGA ACAGGGCGAC AGCGACGTCG TCCACACCGG CAACGGCACC GGCGGCTCGC GCTCGATCAC CGCCAGCGGC ATGGCGATCG TCGAGGCGTC GCAGCAAGTG ATCGCCAAGG GCAAGGCCGC GGCGTCGCAT CTCTTGGAGA CCGCGGAGGC CGACATCGAA TTCGCCGATG GTCGCTTCAC CGTGGCGGGC ACCGATCGCA GCATCGGCAT CATGGAGCTG GCGCAGCGGC TGCGCGAGGC GAAACTCCCC GACGGCGTGC CGGCGTCGCT CGACGTCGAT CACACCGTCA AGGCGGTCCC CTCCGCCTTC CCCAATGGCT GCCACGTCGC CGAGGTCGAG ATCGATCCCG ACACCGGCGT CACCCGCGTG GTGCGCTACA CCGCGGTCAA TGATTTCGGC GTCGTGGTCA ATCCGATGAT CGTCGCAGGC CAGTTGCACG GCGGCGTCGC GCAAGGCATC GGCCAGGCGC TGATGGAGAA GATGTCCTAT GACGGCGACG GCCAGCCGAT CACCGGCTCG CTGCAGGACT ACGCGCTGCC GCGCGCCGAG GACATTCCGC CGATGGCGGT CGGCGATCAC CCCGTGCCTG CGCCCGGCAA TCCGCTCGGC ACCAAGGGCT GCGGCGAAGC CGGCTGCGCC GGCTCGCTGG CGAGCGTCGT CAATGCCGTG CTCGACGCGC TGAAAGACCA CGGCGTCAAA TCCCTCGACA TGCCGCTGAC CTCGGAGAAG GTCTGGCGCG CGATCCGGGA GGCGAAGGAG ACGGCGGCGG CGTGA
|
Protein sequence | MSFMQDRPSN LSSDTAIALQ KFGIGQPVRR KEDDTLLRGK GRYTDDCNLP GQLTAVMVRS PHAHGIIRGI DAEAARAMPG VVGVYTGADL AAAGYAPFSC GLPMKSRDGT PLLQTNRPAL ATDKVRFVGD PVAFVVAETA IQARDAAESV ALDIAPLPAV TDADDAIKPG APQLYDHIPN NIALDYHFGD AAAVEAAFAS AAHVTTLDIE NTRVAAVPME PRTGLASYDR QNGRYTIQLP TQGVAGNRNT LAKLLGVPTD KVRVLTGQVG GSFGMKNISY PEYICILHAA KALGRPVKWT DERSSAFLSD SHGRGQQIRA ELALDAAGKF LAIRLSGTGN LGAYITGVAP LPLSLNTGKN IGSVYRTPLL GVDIKCVVTN VTLMGAYRGA GRPEANYFLE RLIDRAADEI GIDRLALRKR NFIKPQQLPF TACSGVTYDS GDFGGVFAQA LELSDHAGFA QRKKESRKRG KLRGIAVGSY LEVTAPPSAE LGKIVFEEDG TIRLITGTLD YGQGHATPFA QVLSTYLGVP FDRIRLEQGD SDVVHTGNGT GGSRSITASG MAIVEASQQV IAKGKAAASH LLETAEADIE FADGRFTVAG TDRSIGIMEL AQRLREAKLP DGVPASLDVD HTVKAVPSAF PNGCHVAEVE IDPDTGVTRV VRYTAVNDFG VVVNPMIVAG QLHGGVAQGI GQALMEKMSY DGDGQPITGS LQDYALPRAE DIPPMAVGDH PVPAPGNPLG TKGCGEAGCA GSLASVVNAV LDALKDHGVK SLDMPLTSEK VWRAIREAKE TAAA
|
| |