Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_4793 |
Symbol | |
ID | 6129406 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 5266262 |
End bp | 5269039 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 641644930 |
Product | aldehyde oxidase and xanthine dehydrogenase molybdopterin binding |
Protein accession | YP_001771557 |
Protein GI | 170742902 |
COG category | [C] Energy production and conversion |
COG ID | [COG1529] Aerobic-type carbon monoxide dehydrogenase, large subunit CoxL/CutL homologs [COG2080] Aerobic-type carbon monoxide dehydrogenase, small subunit CoxS/CutS homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.515425 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0128393 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCTGA CGGTCAACGG CCTCTCCCAG GAGGCGGCGC CGCGCCCCGG CCAGTGCCTG CGCACCCTGC TGCGCGACCT CGGCTGGTTC GGCGTGAAGA AGGGCTGCGA CGCCGGCGAT TGCGGCGCCT GCACGGTCCA TCTCGACGGG GAGCCGGTGC ATAGCTGCCT CGTCCCGGCC CTGCGGGCGG AGGGGCGCGC CGTCACGACG ATCGAGGGCC TGTCCGGCCC CTGCGGGCCC GATGCGGCGG TGCCCGACCG GCTGCACCCG GTCCAGGAGG CCTTCCTGGC CGCGCAGGGC TTCCAGTGCG GCTTCTGCAC GCCTGGCATG GTGATGACCG CCGCCGCCCT CGACCAGGGC CAGCGGCGCG ACCTCGGCGC CGCCCTCAAG GGCAATCTCT GCCGCTGCAC CGGCTACCGG GCGATCGCGG ACGCGATCGC CGGGATCGCC GACGCGATCG CCGGCAGCGC GGACGCGAGT GCGGGCAGCG TCGCGGCCGA GGGCGACGGG GCGGGGGGTG ATCCCTGCGG CCGCAGCCTG CCCGCCCCGG CCGGGCCGGC GGTGGTGAGC GGGCGGGCGC GCTACACGTT CGACCTCGCG GTCGAGGGGT TGCTCCACCT CAAGGTGCTG CGCTCGCCCC ACGCCCACGC GGAGATCCTG CAGGTCGACC GCGCCGCCGC CCTCGCGGTG CCGGGGGTGG TGGCGGTATT CACCCACGAG GACGTGCCGG CATCCCGCTA CTCGACCGGG CGCCACGAGG ATCCGCGCGA CGACGCGCCC GACACCCTGA TGCTCGACCG GATCGTGCGC TTCGTCGGGC AGCGGGTCGC CGCGGTGGTG GCGGAGAGCG AGGCGGCGGC CGAGGCGGGC GTGGCCGCCC TCGTCGTGAC CTACGCGCCG CGCCCGGCCC TGCTCGATGC CGAGCGGGCC CTCGATCCGG ACGCGCCGCG CGTCCACGAT CCGGGGCCGC CCGGCGCCGA CGCCCCGCCG CTCCACCCTC ATCCGAACAT CGCGGCCGAG GTGCACGGAC AGGTCGGCGA CGTCGAGGCC GGCTTCGCCG AGGCCGACCT GGTGATCGAG GGCACGTACC GCTCGCAGCG CATCCAGCAC GCCCACCTGG AGACGCACGG GGCGCTGGGC TGGCGCGACG AGGCGGGCCG CCTCGTCCTT CGCACCAGCA GCCAGGTGCC CTTCCTGACC CGCGACGCCC TCGCGGCGCT GCTGGGCCTC GACCGGGCGC AGGTGCGGGT GCTGTGCGGG CGGGTCGGCG GCGGCTTCGG CGGCAAGCAG GAGATGCTGA CCGAGGACTT GGTGGCCCTG GCGGTGCTGC GCCTCGGGCG GCCGGTGAAG TGGGAATTCA CGCGCGGCGA GCAGTTCACC GGGGCGACGA CCCGCCACCC GATGCGGGTC CGCGTCAAGC TCGGCGCGCG GCGCGACGGC ACGCTCACCG CCATCGCCCT CGACGTGCTC GCCGAGACCG GCGCCTACGC CAACCACGCG GGCGGCGTGC TCCACCACGG CTGCAACGAG GTGATCGGCG TCTATCGCTG CCCGAACAAG CGGGTGGACG GCGTCTCGGT CTACACCCAC ACGGTGCCGG CGGGGGCCTT CCGGGGCTAC GGCCTGAGCC AGACCATCTT CGCGGTCGAA TCGGCGATGG ACGACCTCGC GCGCGGCCTC GGCCTCGACC CCTACCTCCT GCGCCGCCGC AACGCGGTGC GGCCGGGCGA TCCCCTGGTC TCGACCAGCC TGGAGCCCCA CGACGTCGCC TACGGCTCCT ACGGGCTCGA CCAATGCCTC GACCGCGCCG AGGCGGCGAT GCGGGAGCCC GGCGGCGAGG CCCCGCCCGG GCCCGGCTGG CGCGTCGGCG AGGGCATGGC GATGGCGATG ATCGACACGA TCCCGCCCCG CGGGCACCGC GCCGAGGCCC GCCTCTCGCT CACCGGGGCG GGCACCTACG CGCTCGCGGT CGGCACCGCC GAGTTCGGCA ACGGCACCGC GACGGTGCAC GGGCAGATCG CCGCCTCGGT GCTCGGGACG CGGCCGGGGC GGGTGCGCCT GCACGCCTCC GACACCGACG CGGTCGGCCA CGACACCGGC GCCTACGGCA GCACCGGCAC GGTGGTGGCC GGGCAGGCGA CGTTGCGGGC GGCGGAGGAC CTGGCGCGGG CGATCCGCGC CGCGGCGGCG GCCCGCACCG GCACGGACCC GGCGGCGTGC CGGCTCGCGG GCGAGGCGGT CGAGACCCCG GCCGGCCCGG TGCCGCTCGC GGATCTCGCG CCGCTCGACG CGGTCGGGCG CGCGGATGGC AGCCCGCGCT CCGTGGCCTT CAACGTGCAG GCTTTCCGGG TCGCCGTGCA TCCGGGGACC GGCGAGGTGC GGATCCTGCG CAGCGTGCAC GCGGCGGATG CGGGCCGGGT CATCAACCCG ATGCAGTGCC GCGGCCAGAT CGAGGGCGGC GTGGCGCAGG CGCTCGGCGC GGCGCTCTAC GAGGAGGTGC GCCTCGACGG CGCGGGCCGC GTGGAGACCC AGAGCTTCCG CAGCTACCAT ATCCCGGCCT TCGCGGACGT GCCGCGCACC GAGGTCCTCT TCGCCGACAC CTACGACCGG ATCGGTCCGC TCGGCGCGAA ATCGATGAGC GAGAGCCCGT TCAACCCGGT GGCGGCGGCG CTCGGCAACG CCATCCGCGA CGCGACCGGT GCCCGGCTCA CCGAGACTCC CTTCGCGCCG GACCGGATCT ACCGGGCGGT CGCGGCGGCG CGCACCTGCG GGTCCTGA
|
Protein sequence | MMLTVNGLSQ EAAPRPGQCL RTLLRDLGWF GVKKGCDAGD CGACTVHLDG EPVHSCLVPA LRAEGRAVTT IEGLSGPCGP DAAVPDRLHP VQEAFLAAQG FQCGFCTPGM VMTAAALDQG QRRDLGAALK GNLCRCTGYR AIADAIAGIA DAIAGSADAS AGSVAAEGDG AGGDPCGRSL PAPAGPAVVS GRARYTFDLA VEGLLHLKVL RSPHAHAEIL QVDRAAALAV PGVVAVFTHE DVPASRYSTG RHEDPRDDAP DTLMLDRIVR FVGQRVAAVV AESEAAAEAG VAALVVTYAP RPALLDAERA LDPDAPRVHD PGPPGADAPP LHPHPNIAAE VHGQVGDVEA GFAEADLVIE GTYRSQRIQH AHLETHGALG WRDEAGRLVL RTSSQVPFLT RDALAALLGL DRAQVRVLCG RVGGGFGGKQ EMLTEDLVAL AVLRLGRPVK WEFTRGEQFT GATTRHPMRV RVKLGARRDG TLTAIALDVL AETGAYANHA GGVLHHGCNE VIGVYRCPNK RVDGVSVYTH TVPAGAFRGY GLSQTIFAVE SAMDDLARGL GLDPYLLRRR NAVRPGDPLV STSLEPHDVA YGSYGLDQCL DRAEAAMREP GGEAPPGPGW RVGEGMAMAM IDTIPPRGHR AEARLSLTGA GTYALAVGTA EFGNGTATVH GQIAASVLGT RPGRVRLHAS DTDAVGHDTG AYGSTGTVVA GQATLRAAED LARAIRAAAA ARTGTDPAAC RLAGEAVETP AGPVPLADLA PLDAVGRADG SPRSVAFNVQ AFRVAVHPGT GEVRILRSVH AADAGRVINP MQCRGQIEGG VAQALGAALY EEVRLDGAGR VETQSFRSYH IPAFADVPRT EVLFADTYDR IGPLGAKSMS ESPFNPVAAA LGNAIRDATG ARLTETPFAP DRIYRAVAAA RTCGS
|
| |