Gene M446_4793 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_4793 
Symbol 
ID6129406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp5266262 
End bp5269039 
Gene Length2778 bp 
Protein Length925 aa 
Translation table11 
GC content76% 
IMG OID641644930 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001771557 
Protein GI170742902 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs
[COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.515425 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0128393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCTGA CGGTCAACGG CCTCTCCCAG GAGGCGGCGC CGCGCCCCGG CCAGTGCCTG 
CGCACCCTGC TGCGCGACCT CGGCTGGTTC GGCGTGAAGA AGGGCTGCGA CGCCGGCGAT
TGCGGCGCCT GCACGGTCCA TCTCGACGGG GAGCCGGTGC ATAGCTGCCT CGTCCCGGCC
CTGCGGGCGG AGGGGCGCGC CGTCACGACG ATCGAGGGCC TGTCCGGCCC CTGCGGGCCC
GATGCGGCGG TGCCCGACCG GCTGCACCCG GTCCAGGAGG CCTTCCTGGC CGCGCAGGGC
TTCCAGTGCG GCTTCTGCAC GCCTGGCATG GTGATGACCG CCGCCGCCCT CGACCAGGGC
CAGCGGCGCG ACCTCGGCGC CGCCCTCAAG GGCAATCTCT GCCGCTGCAC CGGCTACCGG
GCGATCGCGG ACGCGATCGC CGGGATCGCC GACGCGATCG CCGGCAGCGC GGACGCGAGT
GCGGGCAGCG TCGCGGCCGA GGGCGACGGG GCGGGGGGTG ATCCCTGCGG CCGCAGCCTG
CCCGCCCCGG CCGGGCCGGC GGTGGTGAGC GGGCGGGCGC GCTACACGTT CGACCTCGCG
GTCGAGGGGT TGCTCCACCT CAAGGTGCTG CGCTCGCCCC ACGCCCACGC GGAGATCCTG
CAGGTCGACC GCGCCGCCGC CCTCGCGGTG CCGGGGGTGG TGGCGGTATT CACCCACGAG
GACGTGCCGG CATCCCGCTA CTCGACCGGG CGCCACGAGG ATCCGCGCGA CGACGCGCCC
GACACCCTGA TGCTCGACCG GATCGTGCGC TTCGTCGGGC AGCGGGTCGC CGCGGTGGTG
GCGGAGAGCG AGGCGGCGGC CGAGGCGGGC GTGGCCGCCC TCGTCGTGAC CTACGCGCCG
CGCCCGGCCC TGCTCGATGC CGAGCGGGCC CTCGATCCGG ACGCGCCGCG CGTCCACGAT
CCGGGGCCGC CCGGCGCCGA CGCCCCGCCG CTCCACCCTC ATCCGAACAT CGCGGCCGAG
GTGCACGGAC AGGTCGGCGA CGTCGAGGCC GGCTTCGCCG AGGCCGACCT GGTGATCGAG
GGCACGTACC GCTCGCAGCG CATCCAGCAC GCCCACCTGG AGACGCACGG GGCGCTGGGC
TGGCGCGACG AGGCGGGCCG CCTCGTCCTT CGCACCAGCA GCCAGGTGCC CTTCCTGACC
CGCGACGCCC TCGCGGCGCT GCTGGGCCTC GACCGGGCGC AGGTGCGGGT GCTGTGCGGG
CGGGTCGGCG GCGGCTTCGG CGGCAAGCAG GAGATGCTGA CCGAGGACTT GGTGGCCCTG
GCGGTGCTGC GCCTCGGGCG GCCGGTGAAG TGGGAATTCA CGCGCGGCGA GCAGTTCACC
GGGGCGACGA CCCGCCACCC GATGCGGGTC CGCGTCAAGC TCGGCGCGCG GCGCGACGGC
ACGCTCACCG CCATCGCCCT CGACGTGCTC GCCGAGACCG GCGCCTACGC CAACCACGCG
GGCGGCGTGC TCCACCACGG CTGCAACGAG GTGATCGGCG TCTATCGCTG CCCGAACAAG
CGGGTGGACG GCGTCTCGGT CTACACCCAC ACGGTGCCGG CGGGGGCCTT CCGGGGCTAC
GGCCTGAGCC AGACCATCTT CGCGGTCGAA TCGGCGATGG ACGACCTCGC GCGCGGCCTC
GGCCTCGACC CCTACCTCCT GCGCCGCCGC AACGCGGTGC GGCCGGGCGA TCCCCTGGTC
TCGACCAGCC TGGAGCCCCA CGACGTCGCC TACGGCTCCT ACGGGCTCGA CCAATGCCTC
GACCGCGCCG AGGCGGCGAT GCGGGAGCCC GGCGGCGAGG CCCCGCCCGG GCCCGGCTGG
CGCGTCGGCG AGGGCATGGC GATGGCGATG ATCGACACGA TCCCGCCCCG CGGGCACCGC
GCCGAGGCCC GCCTCTCGCT CACCGGGGCG GGCACCTACG CGCTCGCGGT CGGCACCGCC
GAGTTCGGCA ACGGCACCGC GACGGTGCAC GGGCAGATCG CCGCCTCGGT GCTCGGGACG
CGGCCGGGGC GGGTGCGCCT GCACGCCTCC GACACCGACG CGGTCGGCCA CGACACCGGC
GCCTACGGCA GCACCGGCAC GGTGGTGGCC GGGCAGGCGA CGTTGCGGGC GGCGGAGGAC
CTGGCGCGGG CGATCCGCGC CGCGGCGGCG GCCCGCACCG GCACGGACCC GGCGGCGTGC
CGGCTCGCGG GCGAGGCGGT CGAGACCCCG GCCGGCCCGG TGCCGCTCGC GGATCTCGCG
CCGCTCGACG CGGTCGGGCG CGCGGATGGC AGCCCGCGCT CCGTGGCCTT CAACGTGCAG
GCTTTCCGGG TCGCCGTGCA TCCGGGGACC GGCGAGGTGC GGATCCTGCG CAGCGTGCAC
GCGGCGGATG CGGGCCGGGT CATCAACCCG ATGCAGTGCC GCGGCCAGAT CGAGGGCGGC
GTGGCGCAGG CGCTCGGCGC GGCGCTCTAC GAGGAGGTGC GCCTCGACGG CGCGGGCCGC
GTGGAGACCC AGAGCTTCCG CAGCTACCAT ATCCCGGCCT TCGCGGACGT GCCGCGCACC
GAGGTCCTCT TCGCCGACAC CTACGACCGG ATCGGTCCGC TCGGCGCGAA ATCGATGAGC
GAGAGCCCGT TCAACCCGGT GGCGGCGGCG CTCGGCAACG CCATCCGCGA CGCGACCGGT
GCCCGGCTCA CCGAGACTCC CTTCGCGCCG GACCGGATCT ACCGGGCGGT CGCGGCGGCG
CGCACCTGCG GGTCCTGA
 
Protein sequence
MMLTVNGLSQ EAAPRPGQCL RTLLRDLGWF GVKKGCDAGD CGACTVHLDG EPVHSCLVPA 
LRAEGRAVTT IEGLSGPCGP DAAVPDRLHP VQEAFLAAQG FQCGFCTPGM VMTAAALDQG
QRRDLGAALK GNLCRCTGYR AIADAIAGIA DAIAGSADAS AGSVAAEGDG AGGDPCGRSL
PAPAGPAVVS GRARYTFDLA VEGLLHLKVL RSPHAHAEIL QVDRAAALAV PGVVAVFTHE
DVPASRYSTG RHEDPRDDAP DTLMLDRIVR FVGQRVAAVV AESEAAAEAG VAALVVTYAP
RPALLDAERA LDPDAPRVHD PGPPGADAPP LHPHPNIAAE VHGQVGDVEA GFAEADLVIE
GTYRSQRIQH AHLETHGALG WRDEAGRLVL RTSSQVPFLT RDALAALLGL DRAQVRVLCG
RVGGGFGGKQ EMLTEDLVAL AVLRLGRPVK WEFTRGEQFT GATTRHPMRV RVKLGARRDG
TLTAIALDVL AETGAYANHA GGVLHHGCNE VIGVYRCPNK RVDGVSVYTH TVPAGAFRGY
GLSQTIFAVE SAMDDLARGL GLDPYLLRRR NAVRPGDPLV STSLEPHDVA YGSYGLDQCL
DRAEAAMREP GGEAPPGPGW RVGEGMAMAM IDTIPPRGHR AEARLSLTGA GTYALAVGTA
EFGNGTATVH GQIAASVLGT RPGRVRLHAS DTDAVGHDTG AYGSTGTVVA GQATLRAAED
LARAIRAAAA ARTGTDPAAC RLAGEAVETP AGPVPLADLA PLDAVGRADG SPRSVAFNVQ
AFRVAVHPGT GEVRILRSVH AADAGRVINP MQCRGQIEGG VAQALGAALY EEVRLDGAGR
VETQSFRSYH IPAFADVPRT EVLFADTYDR IGPLGAKSMS ESPFNPVAAA LGNAIRDATG
ARLTETPFAP DRIYRAVAAA RTCGS