Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | RPC_0848 |
Symbol | |
ID | 3969845 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhodopseudomonas palustris BisB18 |
Kingdom | Bacteria |
Replicon accession | NC_007925 |
Strand | + |
Start bp | 934436 |
End bp | 937429 |
Gene Length | 2994 bp |
Protein Length | 997 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637923964 |
Product | carbon-monoxide dehydrogenase |
Protein accession | YP_530737 |
Protein GI | 90422367 |
COG category | [C] Energy production and conversion [S] Function unknown |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG3427] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.682595 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAAC CCGCAGCAGC GGCAACGGAG ATCGATGTCG ACCGTCCATG GGTCGGCCGT TCGATCGAGC GCGTCGAAGA CGGCGCCCTG CTCACCGGGC GCGGCCGGTT CATCGATGAT CTCGGAACCC GGCCCGGCAC GCTCTACGCC GCGATCCTGC GCTCGCCGCA CGCCCACGCC GACATCGTCG CGATCCGCAC CGAGGCCGCA AAGCAGGCCG CGGGCGTCGT CGCGGTGCTC ACCGGCGAAG ATATCACCGC GCTGACCTCG AGCCTCGTGG TCGGCGTCAA GGCGCCGGTG CAATGCTGGC CGATCGCGGT CGGTCGCGTG CGCTACGTCG GCGAGGCGGT GGCGATCGTG GTTGCGACCG ATCGCTACGT CGCCGAGGAC GGCGTCGATC TGATCGAGGT CGACTACCAG GTGCGTGCGG CGGTGATCGA TCCGCTTGCC GCGCTGTCGG CCGACGCGCC GGTGCTGCAC GACGGCTTTG CCGGCAACGT CGCCAGCGAT CGCAGCTTCC GCTACGGCGA TCCGGAACGC GCCTTCGCCG AGGCGCCGCA TCGCATTTCC ATCGCCATCA AGTACCCGCG CAATTCCTGC ACCCCGATCG AAACCTACGG CGTGGTCGCC GACTACGATG CCGCGGAAGA CGCCTACGAC GTGCTGGCCA ATTTCCAGGG GCCGTTCAGC ATCCACGCGG TGATCTCGCG CGCCCTGAAG GTGCCGGGCA ATCGGCTGCG GCTGCGCACC CCGCCGGACT CCGGCGGCAG TTTTGGCATC AAGCAGGGAA TTTTCCCGTA CATCGTGCTG ATCGCTGCGG CCTCCCGTGT GGTCGGCCGC CCGGTGAAGT GGATCGAGGA CCGGCTCGAA CATCTCACCG CCTCGGTGTC GGCGACCAAC CGCGCCACCT GCATCGCCGC CGCCGTCGCC GCCGACGGCA AGATCATGGC GCTGGACTGG GATCAGGTCG AGGATTGCGG CGCGCATTTG CGCGCGCCGG AGCCGGCGAC GTTGTACCGG ATGCACGGCA ATCTCACCGG CGCCTATGCG ATCGACAACG TCGCGGTGCG CAACCGCGTC GTCGTCACCA ACAAGACCCC GACCGGGCTC AACCGCGGCT TCGGCGGCCC GCAGATGTAT TTCGCGCTAG AGAGGCTGCT GCAGCGCATC GCGGTCGAAC TCGAGCTCGA TCCGCTCGAC GTGATCAAAC GCAATCTGGT TCCGGCCGGA TCGTTTCCCT ATCGCACCGC GACCGGCGCG TTGCTGGACT CCGGCGACTA CCAAGAGGCG ATCGCGCGCG GCGTCGACGG CGGCGGGCTC GCCGCGTTGA AGGCGCGGCG CGATGCGGCG CGCGCTGAGG GCCGGCTCTA CGGCATCGGC TACACCGCGG TGGTCGAGCC CAGCGTCTCC AACATGGGCT ACATCACCAC GGTGTTGACC GCGGCGGAGC GCCGCAAGGC CGGGCCGAAG AACGGCGCGC AGGCGACCGC CACCGTGGCG CTCGATCCGG TCGGCGGCAT CACCGTGCAC GTCGCTTCGG TGCCGCAAGG CCAGGGCCAT CGCACCGTGC TGTCGCAGGT GGTCGCCGAC GTGTTCGGCG TTGCGCCCAC CGATGTCCGC GTCAACACCG AGATCGACAC CGCGAAGGAC GCCTGGTCGA TCGCATCCGG AAACTACGCG AGCCGCTTCG CCGCCGCGGT GGCCGGCACC GCCAAGCTCG CCGCGGGTCG GCTGGCGGGA CGGCTGGCGC GCGTTGCGGC GAGTCAATTG AACATCGACG TCGCCGACGT GGTGTTCCGC GGCGGTCGGG TCGGCTCCAA GTCCAACCCC GACAACAGCA TTGCGTTCAC GCGGCTCGCC GCGCTGAGCC ATTGGTCGCC GGGCTCGTTG CCGGACGATA TCGGCAACAC GCTGCGCGAA ACAGTGTTCT GGACGCCGCC GGAGCTGGCG GCGCCGGACG ACGCCGACCG GGTGAACTCC TCGCTGTGCC ACGGCTTCAT CTTCGATTTC TGCGGCGTCG AGATCGATCC GGTCACGCTG GAAGCTAAGA TCGATCGCTA CGTCACCATG CACGATTGCG GCACCATCCT GCATCCCGGC ATGGTCGACG GCCAGATCCG CGGCGGCTTC GCGCAGGCGA TCGGCGCCGC GCTGTACGAG GAATACGCCT ACGCGCCGGA CGGCAGCTTC CTCACCGGCA CGCTCGCCGA TTACTTGCTG CCGACCACCA TGGAAGTGCC GGAGCCTAAG ATCCTGCACA TGGAGACGCC GTCGCCGTTC ACCCCGCTCG GCGCCAAAGG CGTCGGCGAA GGCAATTGCA TGTCGACGCC GGTGTGCGTC GCCAACGCAG TCGCCGACGC GCTGGGCATC AAGGACATCA CCCTGCCGCT GGTGCCGGCG CGGTTGGCGC AGTTTTTACG CGGAGATGAG CGCGCGGCGC CGGCCGGCGG CCGTGCGCCG GCACCACCAC GCGCCGGCGG CACCGATCGT AAGCTGCGCG GCGAGGGGAG CGCGTCGGTC GGCGCGCCGC CGCAACAGGT CTGGACGATG CTGCTCGATC CGGAGACGCT GAAGACGGTG ATCCCCGGTT GCGAGCGGGT CGAGAAAATC TCCGATACGC ATTTCCGCGC CGAGGTGACG CTCGGCATTG GCCCGGTGAC CGGGCGCTAT CGGGCCGACG TCAAACTCTC CGATCTCGAT CCGCCGCGCG CGGTGACGCT CGGCGGCACC GCCGAGGGCG CGCTCGGCTT CGGCGGCGGC GAGGGCCGCA TCACGCTTGC GCCTGATAGC AACGGCGGCA CCACGATGAC TTACGTCTAT GAGGCGGCGA TCGGCGGCAA GGTCGCCAGC ATCGGCGGAC GCCTGCTCGA CGGCGCGACG CGCGTCATCA TCGGCCGGTT CTTCACCGCT CTCGCCGCCA CCGCCGGCGG CAAGCCGGTG CCGAGCGACT CCTGGCTGAC GCGGCTGCTG CGACTCGTGG GGTGGTCGCG ATGA
|
Protein sequence | MAQPAAAATE IDVDRPWVGR SIERVEDGAL LTGRGRFIDD LGTRPGTLYA AILRSPHAHA DIVAIRTEAA KQAAGVVAVL TGEDITALTS SLVVGVKAPV QCWPIAVGRV RYVGEAVAIV VATDRYVAED GVDLIEVDYQ VRAAVIDPLA ALSADAPVLH DGFAGNVASD RSFRYGDPER AFAEAPHRIS IAIKYPRNSC TPIETYGVVA DYDAAEDAYD VLANFQGPFS IHAVISRALK VPGNRLRLRT PPDSGGSFGI KQGIFPYIVL IAAASRVVGR PVKWIEDRLE HLTASVSATN RATCIAAAVA ADGKIMALDW DQVEDCGAHL RAPEPATLYR MHGNLTGAYA IDNVAVRNRV VVTNKTPTGL NRGFGGPQMY FALERLLQRI AVELELDPLD VIKRNLVPAG SFPYRTATGA LLDSGDYQEA IARGVDGGGL AALKARRDAA RAEGRLYGIG YTAVVEPSVS NMGYITTVLT AAERRKAGPK NGAQATATVA LDPVGGITVH VASVPQGQGH RTVLSQVVAD VFGVAPTDVR VNTEIDTAKD AWSIASGNYA SRFAAAVAGT AKLAAGRLAG RLARVAASQL NIDVADVVFR GGRVGSKSNP DNSIAFTRLA ALSHWSPGSL PDDIGNTLRE TVFWTPPELA APDDADRVNS SLCHGFIFDF CGVEIDPVTL EAKIDRYVTM HDCGTILHPG MVDGQIRGGF AQAIGAALYE EYAYAPDGSF LTGTLADYLL PTTMEVPEPK ILHMETPSPF TPLGAKGVGE GNCMSTPVCV ANAVADALGI KDITLPLVPA RLAQFLRGDE RAAPAGGRAP APPRAGGTDR KLRGEGSASV GAPPQQVWTM LLDPETLKTV IPGCERVEKI SDTHFRAEVT LGIGPVTGRY RADVKLSDLD PPRAVTLGGT AEGALGFGGG EGRITLAPDS NGGTTMTYVY EAAIGGKVAS IGGRLLDGAT RVIIGRFFTA LAATAGGKPV PSDSWLTRLL RLVGWSR
|
| |