Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPB_3096 |
Symbol | |
ID | 3910897 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris HaA2 |
Kingdom | Bacteria |
Replicon accession | NC_007778 |
Strand | + |
Start bp | 3527565 |
End bp | 3529913 |
Gene Length | 2349 bp |
Protein Length | 782 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637885000 |
Product | carbon-monoxide dehydrogenase |
Protein accession | YP_486705 |
Protein GI | 86750209 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs |
TIGRFAM ID | [TIGR02416] carbon-monoxide dehydrogenase, large subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGAGA CGGGCGGATT CATTGGAAAG TCGGTTCCGC GGCGCGAAGA CAAGCGGCTT CTGACCGGCA AGGGCGAGTT CGTCGCGGAC CTCAAGCTGC CGTCGATGCT GCACGCGGCG TTCGTTCGCA GCCAGGTCGC CCATGGCCGG ATCAAATCGG TCGATCTGTC GCGCGCCCTG CGGTCACCGG GCGTCGTCTA TGCGATCTCC GGGCCGGATC TCGCCAAGCT GCTGCCACCC GTGCCCGACA CGCAACTGTC GCTGCCGAAG AAATGGACGA CGCGGGTCCA GCACACCTTC CTCAATCCGC AGCAACCGCT GCTCGCTTAC GACAAGGTCC GGCATGTCGG CGAGGCCGTC GCGGTCATTC TCGCAGAGAG CCGCTACCTC GCTGAAGACG CCGCCGAGCT GGTGACCATG GAGATCGAGC CGCTGCCGGC CGTGGTAGAT CCGGAAGCCG GGCTCACCCG CGACAGCGCC GTGTTGCACG AACAATACGA CACCAATCTG ATCGGCGATT TCGCCATTGC CAAAGGCGAC GTCGAGACAG CGCTGGCGAA CGCGCCGCAC CGCATGAAGC GCCGCTTCTA TCATCATCGC TACGCCGCGA TCCCGATGGA GGGCCGCGGC GTCGCCGCCA ACTACGACGC ACGCACGGAT TCGATCAACA TCTGGTCGGC GTGCCAGGTC ATTCACTGGC TCCGCCGCGA GGCGTCGACG GTGCTCGGCA TGCCCGAAGC TCGCATTCGC TGCGTCGCGC TCGACGTCGG CGGCGGCTTC GGCGTCAAGG GCCACGTCTA TCCCGAAGAA TTGCTGATTC CTTATCTTGC GCGCGAAGTC GGCCGCCCGG TGAAATGGAT CGAGGATCGG CACGAGCATT TCATGAGCGC CTGCCATTCC CGCGACCAGA CCCACGATGT AGAGTTCGGG TTCGACGACG ACGGTCGTCT GCTAGCGTTC CAGGACGAAT TCCTCGTCGA TTGCGGCGCC TGGAATCCGA TCGGCTCCGG CATCGCCTAC AACACTGCGG TGCATTTGCC CGGCCCCTAC AAATTCGAGC ACTTCGCGGT GCGATCGAAG ATCGTCGCCA CCAACAAAGT GCCCAACGCG CCCTATCGCG GTGCCGGCCG CCCCGAAGCG ACGTTCGCGA TGGAGCGGGT GATCGATCTG ATCGCGGCCG AACTCGGCCT CGATCCGGCC GACGTCCGCA TGCGCAACAT GATTCCGGCT TCCGAGATGC CGTATCGTCT CGGCCTTCCC TACCGGGACG GCGAGCCGAT CGTCTACGAC AGCGGCGACT ATCCGGAATC GCTGCGCCAG GCATTGGCGG CACTAGGAGG TGTCGACGCC TTCCGCGATC GGCAGCGGGC CGCACGGGCG CAAGGTCGAT ATTTCGGCCT CGGACTCGGA TGCTACGTCG AAGGCACCGG CGTCGGACCG TTCGAAAGCG CTACCGTCCG CGTAGACCCG ACCGGCAAGA TCTATCTCGC AGGCGGCGCC TGCCCGCAGG GACAAGGCAT GGAAACGATC TTCTCGCAGA TCGTCGCGGA TGCCTGGCAG GTTCAACCCG ATGACGTCGT CGTAGCATTG GCGGACACGA GCGTGATCTC GATCGGCTTC GGCACCATCG CGAGCCGCAG CACCGTGAAC TTGTCGGGCG CAATCCACAC CGCGAGCCAA TCGCTGCAGA AGAAGGTCTT CGCGATCGCA GCAGACATGC TCGAATGCTC GCCGGCCGAC CTCGAGCTGC GCAACGGAAC CGTCGGCCTC GTCGGCGTAC CGGGCAGAGA GATTCCCCTC GCCCGCATCG CCAAAGCGGC GATGCCCGGC TGGGACAACA AGCGACCCGC CGGTGTTTCA GCGGGGCTGG AGGAAACCGC CTACTTCGAG CCGCCGACCG TGACCTGGGC TTACGCGACG CACGCCGCGA TCGTCGAGCT CGATGTCGAA CTCGGCCGCG TCGAGATCGA GAAATACGTC ATCGTGCATG ATTGCGGCGT GGTGGTGAAT CCGATGCTGG TCGACGGGCA GATTAACGGC GGCGCCGTGC AGGGCCTCGG CGGCGCGCTG CTGGAGGAAC TAAGCTACGA TTCGGAAGGC CAGTTGCTGG TTGGATCGTT CATGGACTAT CTAGTACCGG GCGCGAGCGA CGTGCCGCAT TTCGAGCTGA AGCACATGCA CTTCCCCTCG CCGCTGAATC CCTATGGCGT GAAAGGCGTC GGAGAAGGTA GCGCGATCGC GCCGCCGGTG GTCATCGCCA ATGCGGTCTC CGATGCACTC TCTCACCTCA AGGTCGAATT CAATTCGACG CCGATCAGGC CTGAGCACAT CGTCACAGCG TTCGGATAA
|
Protein sequence | MAETGGFIGK SVPRREDKRL LTGKGEFVAD LKLPSMLHAA FVRSQVAHGR IKSVDLSRAL RSPGVVYAIS GPDLAKLLPP VPDTQLSLPK KWTTRVQHTF LNPQQPLLAY DKVRHVGEAV AVILAESRYL AEDAAELVTM EIEPLPAVVD PEAGLTRDSA VLHEQYDTNL IGDFAIAKGD VETALANAPH RMKRRFYHHR YAAIPMEGRG VAANYDARTD SINIWSACQV IHWLRREAST VLGMPEARIR CVALDVGGGF GVKGHVYPEE LLIPYLAREV GRPVKWIEDR HEHFMSACHS RDQTHDVEFG FDDDGRLLAF QDEFLVDCGA WNPIGSGIAY NTAVHLPGPY KFEHFAVRSK IVATNKVPNA PYRGAGRPEA TFAMERVIDL IAAELGLDPA DVRMRNMIPA SEMPYRLGLP YRDGEPIVYD SGDYPESLRQ ALAALGGVDA FRDRQRAARA QGRYFGLGLG CYVEGTGVGP FESATVRVDP TGKIYLAGGA CPQGQGMETI FSQIVADAWQ VQPDDVVVAL ADTSVISIGF GTIASRSTVN LSGAIHTASQ SLQKKVFAIA ADMLECSPAD LELRNGTVGL VGVPGREIPL ARIAKAAMPG WDNKRPAGVS AGLEETAYFE PPTVTWAYAT HAAIVELDVE LGRVEIEKYV IVHDCGVVVN PMLVDGQING GAVQGLGGAL LEELSYDSEG QLLVGSFMDY LVPGASDVPH FELKHMHFPS PLNPYGVKGV GEGSAIAPPV VIANAVSDAL SHLKVEFNST PIRPEHIVTA FG
|
| |