Gene Mchl_2047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_2047 
Symbol 
ID7118747 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp2144655 
End bp2147435 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content72% 
IMG OID643524797 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_002420822 
Protein GI218530006 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
[COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.17987 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.825873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTCA CGGTCAACGG GCAGGTTCAG GAGGCCGCAG CCCGGCCCGG CCAATGCCTG 
CGCACGCTGC TGCGGGATCT CGGCTGGTTC GGGGTCAAGA AGGGCTGCGA TGCGGGCGAT
TGCGGCGCCT GCACCGTCCA TCTCGACGGC GAGCCGGTCC ATTCCTGCCT CATGCCCGCC
TTCCTGGCGG AGGGGCGCGG CGTCACCACG ATCGAAGGCC TCGCCGGCCC CTGCGCGCCG
GGCGACGGGC CGCCCGAACA TCTCCACCCG ATGCAGGACG CGTTCTGCGC CGCGCAGGGC
TTCCAGTGCG GCTTCTGCAC CCCGGGCATG ATCATGACGG CGGCCGCCCT CGACCAGGGC
CAGCGGCAGG ATCTCGGCAC GGCGCTCAAG GGCAATCTCT GCCGCTGCAC CGGCTACCGG
GCGATCCGCG ACGCCGTGGC CGGGGTCGCC CATGCCGATG CTTCGCAGAC GGCGGAGGGC
GGCCCGGTCG GCCGCAGCCT GCCGGCTCCG GCGAGCCGCG CCCTCGTCTC CGGCCGGACC
GCCTACACCT TCGACACCGC CGTGCCGGGT CTGCTGCACC TGAAGGTGCT GCGCACCCCC
CACGCCCATG CCCGTATCCG CAGCATCGAC CGGGCGGCGG CGCTGGCCAT GCCCGGCGTG
GTCGCGGTGC TCACCCACGA GGACGCTCCG CGGCGCCGTT TCTCGACCGG GCGGCACGAG
AACCCTTTCG ATGACGCCGC CGACACGGGC GTGCTCGATT CCGTGATCCG CTTCCACGGC
CAGCGCGTCG CGGCGGTGGT CGCCGAGAGC ATCGCAGCGG CGGCCATGGG CGTGCAAGCG
CTGCGGGTGG AATACGACGT GCAGCCCGCC GTGTTCGATC CGGAGGCGGC CCTTCACCCG
GACGCGCCCC TCGTCCACGA TCCCGCCGAC CGCGACCCGC CGCGGCCCGG CGACGATGCG
CCCCCGCTCC TCGCGCACCC GAACCTCGCC GCCGAGGCGC ACGGCGCGAT CGGCGATGTC
GAGGCGGGCT TCGGCCAAGC CGACCGGATC CACGAGGCGG AATACGTCTC GCAGCGGGTG
CAGCATGTGC ATCTCGAAAC GCACGGCGCG CTGGGCTGGC TCGACGCGGA GGGCCGGCTG
ACCCTGCGCT CCTCGACGCA GGTGCCCTTC CTGACCCGCG ATGCCGTGTG CCGCCTGTTC
GATCTAGACC GGGACCGGGT GCGCGTGCTC TGCGGCCGGG TCGGCGGCGG CTTCGGCGGC
AAGCAGGAGA TGCTGACCGA GGACCTCGTG GCCCTCGCCG TGCTGCGGAC CGGCCGGCCG
GTCTCCTACG AGATGACCCG CGAGGAGAAT TTTTGCGCGG CGACGACGCG CCATCCGATG
CGGGTGCGGG TGAAGATCGG CGCGCGGGCT GACGGGCGCC TCACCGCGCT TTCGCTGAAA
GTGCTCTCGA ACACCGGGGC CTACGGCAAC CATGCCGGCG GCGTGCTGCA TCACGGCTGC
AACGAGAGCA TCGCCGCCTA TGCCTGCCCG AACAAGCGGG TGGAGGGCTA CGCCGTCTAC
ACCCATACCC TGCCGGCCGG GGCCTTTCGC GGGTACGGCC TGAGCCAGAC GATCTTCGCC
GTGGAATCGG CGCTGGACGA ACTGGCCCGC GATCTCGGGA TCGACCCGTT CGCGATGCGC
CGCCTCAACG CGGTGCGGCC CGGCGACCCG ATGGTGTCGA CGAGCCTGGA GCCGCACGAC
GTCGTCTACG GCTCCTACGG GCTCGACCAG TGCCTCGACC GCGCCCAGGC CGCGCTGGCG
GATGGCAGCG GCGAGGCGGC GCCGGGGCCG GACTGGCGCG TCGGCGAGGG CATGGCGATG
GCGATGATCG ATACGATCCC GCCGCGCGGC CACGTCGCCC ATGCCCGCAT CCGCCTCGAA
CCGGACGGGA CCTACGCCCT CGCGGTCGGC ACCGCCGAGT TCGGCAACGG CACCAGCACG
GTGCACGGGC AGATCGCCGC CGAGGTGCTC GGCACCACGC CCGAGCACAT CCGCCTGATC
CAGGCCGACA CCGATGCCGT CCGGCACGAT ACCGGCGCCT ATGGCAGCAC GGGCACCGTG
GTCGCCGGAC AGGCGAATTT CCGCGCGGCC TCGGCGCTCG CCGACCAGCT TCGCGCGGCC
GCCGCCGAGC GGTCCGGCGT TAGTGCGGAC GCGTGCCGCC TGACCCGCGA CGGCGTCGCG
ACGCCCACCG GCCTCGTCTC CCTGACCACC CTCGCGCAAG GGGCCATCTT CGAAGCCGAG
GGCGAGGCCG ACGGGACGCC GCGCTCGGTC GCCTTCAACG TCCAGGCCTT TCGCGTGGCC
GTCCATCCGC AGACCGGCGA GATCCGCATT CTCAAGAGCA TTCAGGCGGC GGATGCCGGC
CGGGTCATCA ACCCGATGCA GTGCCGCGGG CAGATCGAGG GCGGGGTGGC GCAGGCGCTC
GGCGCGGCCC TGCACGAGGA CTACCGCTTC GACGAAACGG GCGCGGTCGT CACGCGGACC
TTGCGCAACT ACCACATCCC GGCGATGGCC GACGTGCCGG TGACCGAGGT TCTGTTCGCC
GACACCCACG ACACGGTCGG TCCACTGGGC GCGAAATCGA TGAGCGAGGC GCCCTACAAT
CCCGTCGCCG CCGCCTTGGG CAATGCGATC CGGGACGCCA CCGGCGTACG GCTGACCGCG
ACGCCGTTCG CCCCCGACCG GATCTTTCGA CAGGTCATGG CGGCGCAAGA GAACGAGGAA
CAGGAGACGG CGGACGCATG A
 
Protein sequence
MRLTVNGQVQ EAAARPGQCL RTLLRDLGWF GVKKGCDAGD CGACTVHLDG EPVHSCLMPA 
FLAEGRGVTT IEGLAGPCAP GDGPPEHLHP MQDAFCAAQG FQCGFCTPGM IMTAAALDQG
QRQDLGTALK GNLCRCTGYR AIRDAVAGVA HADASQTAEG GPVGRSLPAP ASRALVSGRT
AYTFDTAVPG LLHLKVLRTP HAHARIRSID RAAALAMPGV VAVLTHEDAP RRRFSTGRHE
NPFDDAADTG VLDSVIRFHG QRVAAVVAES IAAAAMGVQA LRVEYDVQPA VFDPEAALHP
DAPLVHDPAD RDPPRPGDDA PPLLAHPNLA AEAHGAIGDV EAGFGQADRI HEAEYVSQRV
QHVHLETHGA LGWLDAEGRL TLRSSTQVPF LTRDAVCRLF DLDRDRVRVL CGRVGGGFGG
KQEMLTEDLV ALAVLRTGRP VSYEMTREEN FCAATTRHPM RVRVKIGARA DGRLTALSLK
VLSNTGAYGN HAGGVLHHGC NESIAAYACP NKRVEGYAVY THTLPAGAFR GYGLSQTIFA
VESALDELAR DLGIDPFAMR RLNAVRPGDP MVSTSLEPHD VVYGSYGLDQ CLDRAQAALA
DGSGEAAPGP DWRVGEGMAM AMIDTIPPRG HVAHARIRLE PDGTYALAVG TAEFGNGTST
VHGQIAAEVL GTTPEHIRLI QADTDAVRHD TGAYGSTGTV VAGQANFRAA SALADQLRAA
AAERSGVSAD ACRLTRDGVA TPTGLVSLTT LAQGAIFEAE GEADGTPRSV AFNVQAFRVA
VHPQTGEIRI LKSIQAADAG RVINPMQCRG QIEGGVAQAL GAALHEDYRF DETGAVVTRT
LRNYHIPAMA DVPVTEVLFA DTHDTVGPLG AKSMSEAPYN PVAAALGNAI RDATGVRLTA
TPFAPDRIFR QVMAAQENEE QETADA