Gene M446_3047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagM446_3047 
Symbol 
ID6135042 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium sp. 4-46 
KingdomBacteria 
Replicon accessionNC_010511 
Strand
Start bp3373481 
End bp3375682 
Gene Length2202 bp 
Protein Length733 aa 
Translation table11 
GC content74% 
IMG OID641643238 
Productaldehyde oxidase and xanthine dehydrogenase molybdopterin binding 
Protein accessionYP_001769892 
Protein GI170741237 
COG category[C] Energy production and conversion 
COG ID[COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.866626 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0234093 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCAGG ACCTGAACCG AGCCCCGTCC CGCCGCGGCG TCCTGGCCGC CGCCGCCGCC 
GTCGCCGGCG GCTTCAGCCT CGGCTTCCAC GTTCCCGGCG CGGGCGCCGA GGAGGCCGCG
CCGGGCGCCG CCGCGGCCTC CGCGCCCGAG ATCAATGCCT GGGTGGTGGT GAGGCCCGAC
GAGACCGTGG TGATCCGCAT CGCCCGCTCC GAGATGGGCC AGGGCACCCT GACGGGCCTC
GCCCAGCTCG TGGCCGAGGA GCTCGGCTGC GACTGGGACC GGGTCACGAC CGAGTACCCG
ACGCCGGGCC AGAACCTCGC CCGCCGCCGC GTCTGGGGCG ATTTCTCGAC CGGCGGCAGC
CGCGGCATCC GCGAATCCTA CGTCGCGGTC CGCCAGGGCG GCGCCGCCGC CCGCCTGATG
CTGGTCTCCG CGGCGGCGGC CGGATGGGGC GTGCCGGCCG GAGAGTGCCG GGTCGAGAAG
GGCGTGATCG CGCATCCGGG CTCGGGCCGC ACCACGACCT TCGGAGCGGT CGCCTCCCGC
GCCGCCGCCA TGCCGCCGCC CAAGGACGTT CCCCTCAAGG ATCCGAAGGA CTGGACCATC
GCCGGCAAGC CGCTGAAGCG GCTCGACACC GCCGCCAAGC TCGACGGCAG CCAGGTCTAC
GGCATGGACC TGCGGCTCGA CGGCATGCTC AACGCCGCGA TCCGCGACTG CCCGGTGGTG
GGCGGCCGGG TCAGGAGCTT CGACGCCGCC GCGGTGGAGC GGATGCCCGG CGTCAGGAAG
GTGGTGCCGG TCGGGGAGAC CGCCGTCGCG GTGGTGGCCG ACACCTTCTG GCGGGCCAGG
ACCGCCGTCG AGGCGCTGCC GATCGTCTGG GACGAGGGCC CGAACGCCGC CGCGTCGAGC
GCGACCATCG CCGCCATGCT GCGGGAGGGC CTCGACGCCG AGGAGGCCTT CGTCGGCAAC
AAGACCGGCG ACGCCAAGGC GGCCCTGCGG GACGCCGCCA GGGTGGTCGA GGCGACCTAC
GCGGTCCCGT TCCAGAACCA CGCCACCATG GAGCCGATGA ACGCCACCGC CCGCTGGACG
CCGGAGCGGT GCGAGGTCTG GACGCCGACC CAGAACGGCG AGGCGGCGCT GGCCGCGGCG
GCCGAGGCGG CCGGCCTCTC CCCGCGGCAA TGCGAGGTCT ACAAGATCCA CCTCGGCGGC
GGCTTCGGGC GGCGCGGCGC GACGCAGGAC TGGGTGCGGC AGGCGGTGCT GATCGCCCGG
GAGATGCCCG GGACGCCGAT CAAGCTGATC TGGACCCGCG AGGAGGACAT GACGCACGGC
CGCTACCACC CGGTCACGCA GTGCCGGATG CGCGCCGCCC TCGACCGGGA CGGACAGCTC
ACGGGCCTGC ACATGCGCAT CTCCGGCCAG TCGATCCTGG CGGCGATCGT GCCGGGCCGG
CTCGGGCCGG ACGGCAAGGA TCCGGTGACC TTCCAGGGCC TCAATCCCGG CGGGGCCGAG
GCAGCGATCG GCTACACGAT CCCCAACCTC CTGATCGACC ACGCGATGCG CAACCCGCAC
ATCCTGCCGG GCTTCTGGCG GGGCGTGAAC ACCAACCCGA ACGCCATCTA CCTCGAGTGC
TTCATGGACG AGGTGGCGCA CGCGGCCGGG CAGGATCCCC TCGCCTTCCG CCGCAAGCTG
ATGGCGAACC ATCCCAAGCA CCTCGCGGTG CTGAACGCGG TCGCGGAGCG GATCGGCTGG
GATCGCCCGC CGCCCGCGGG CGTGCATCGC GGCCTCGCCC AGATCATGGG CTTCGGCAGC
TACGTGGCGG GCGCCGCCGA GGTGTCGCTG CGCGACGACG GCGGCGTCCG GATCCACCGG
ATCGTCGCGG CGACGGATCC GGGCGTGGCG GTCAACCCGC AGCAGATCGC CGCCCAGGTC
GAGGGCTCCT TCGTGTACGG GCTCTCGGCG GCCCTCCACG GCGAGTGCAC GGTCAAGGAC
GGGCGCATCG AGCAGACCAA CTTCGACACC TACCCGGTGA TGCGGATGGA CGAGATGCCG
GCGGTCGAGG CGATCCTGAT GCCCTCGGGC GGCTTCATCG GCGGCGTCGG CGAGCCGACC
ATCGCGGTCG CGGCGCCGGC GGTCCTCAAC GCCGTCTTCG CGGCCACGGG CAAGCGGGTG
CGCCAGCTCC CGCTGCGCAA CACGGACCTG CGGCGCGCGT GA
 
Protein sequence
MTQDLNRAPS RRGVLAAAAA VAGGFSLGFH VPGAGAEEAA PGAAAASAPE INAWVVVRPD 
ETVVIRIARS EMGQGTLTGL AQLVAEELGC DWDRVTTEYP TPGQNLARRR VWGDFSTGGS
RGIRESYVAV RQGGAAARLM LVSAAAAGWG VPAGECRVEK GVIAHPGSGR TTTFGAVASR
AAAMPPPKDV PLKDPKDWTI AGKPLKRLDT AAKLDGSQVY GMDLRLDGML NAAIRDCPVV
GGRVRSFDAA AVERMPGVRK VVPVGETAVA VVADTFWRAR TAVEALPIVW DEGPNAAASS
ATIAAMLREG LDAEEAFVGN KTGDAKAALR DAARVVEATY AVPFQNHATM EPMNATARWT
PERCEVWTPT QNGEAALAAA AEAAGLSPRQ CEVYKIHLGG GFGRRGATQD WVRQAVLIAR
EMPGTPIKLI WTREEDMTHG RYHPVTQCRM RAALDRDGQL TGLHMRISGQ SILAAIVPGR
LGPDGKDPVT FQGLNPGGAE AAIGYTIPNL LIDHAMRNPH ILPGFWRGVN TNPNAIYLEC
FMDEVAHAAG QDPLAFRRKL MANHPKHLAV LNAVAERIGW DRPPPAGVHR GLAQIMGFGS
YVAGAAEVSL RDDGGVRIHR IVAATDPGVA VNPQQIAAQV EGSFVYGLSA ALHGECTVKD
GRIEQTNFDT YPVMRMDEMP AVEAILMPSG GFIGGVGEPT IAVAAPAVLN AVFAATGKRV
RQLPLRNTDL RRA