Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_3072 |
Symbol | |
ID | 7118350 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 3253109 |
End bp | 3255079 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643525823 |
Product | putative molybdopterin biosynthesis protein MoeA/LysR substrate binding-domain-containing protein |
Protein accession | YP_002421838 |
Protein GI | 218531022 |
COG category | [H] Coenzyme transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0303] Molybdopterin biosynthesis enzyme [COG1910] Periplasmic molybdate-binding protein/domain |
TIGRFAM ID | [TIGR00177] molybdenum cofactor synthesis domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0205551 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.985751 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGCGG ATCGCGAGAC GGATTTTACC CGGCGCCTCG CCGCGGCCGC CCGGCAGGAG CAGTTCCTGA CGGTGATGAG CCGCGAGGAC GCGCATGCGG CCTTCCGCGC GGCCCTTCCC CACGCGACGC TTCCACCGGA GACCGTTCCG CTGGCCCAGG CGCTCGGGCG GGTGCTCGCC GGCGACATCG CCTCGCCGAT CGACGTGCCC CCCTTCGACC GCGCCTTGGT GGACGGCTTT GCGCTGCGGG CCGCCGACAC CGAGGGCGCC AACGCCGCGC GGCCCCGCCG GCTCGCCCTC AACCGCGAGA TCCTGGCCTG CGGCGTCGCC CCAACGGGTT CGGTCGCCAC AGACACCGCG ACGCCGATCG CCACCGGCGG GATGATCCCG CGCGGGGCCG ACGCCGTGGT GATGGTCGAG CAGACCGAGT TCTTCGAAGA CGCCATCGAC GTGACGGGTC CGGTCCGGCC GGGGCAGTTC GTCGGCTATG CCGGCGCCGA CATGGCCTCG GGCGAAACTG TCCTGCGTAA GGGAGCGGTG GTGACCGCCC GCGAGATCGG CATGCTCGCC GCCTGTGGGC TCGCCGAGAT CGCGGTGGTG CGCCGGCCCC GCGTCGCCGT ACTCTCGACG GGCGACGAGC TGGTCGCGCC GGGCGGCGAA CTACGACCGG GCGCCATCTA CGATTCGAAC GGCGCCATCG TCGCGGCCTC GGTCGCCGAG AACGGCGGCG AGCCCGTGCC GCTCGGCATC GTCCGCGACG ACGAGGCGGC GCTCGACGCC GCCCTGCGCG ATGCGCTGAC CAGGAGCGAC CTCGTCGTGC TCTCCGGCGG CACCTCGAAG GGCGCGGGCG ATGTCTCGCA CCGCATCCTG TCGCGGCTGG GCCCGCCCGG CATCCTCGTC CACGGCGTCG CGCTGAAACC CGGCAAGCCG CTCTGCCTCG CCGTCACCGA GGGCAAGGCG GTGGTGGTGC TACCAGGCTT CCCGACCTCG GCGATGTTTA CCTTCCACGA ATTCGTAGTG CCGCTGGTGC GCGCGCTGGC CGGCCTGCCG CCGCGGGAGG AGGAGGCAGT GTCGGCGCGC CTGCCGCAGC GGCTGACCTC CGAACTCGGC CGCACCGAAT TCGTGATGGC CTCGCTCGCT CAGGCAGCGG ACGGCTTGGT CGCCCTGCCG CTGCCGAAAG GCTCCGGCTC CGTTACCGCC TTCTCGCAGG CCGATGGCTT CTTCGCCGTG CCGGCCGCGC GCTCAGGCGT CGAGGCGGGA GAAACGGTCT CGGTGGTGCG CCTCGGGGCC GGCGTGCGGC CGCCGGACCT CACCATCATC GGCAGCCACT GCATCGGGCT CGACCGGGTG GTCGGACTGC TGGCCGAGCA AGGCTTTCGC GCTCGCACCG TGTGGGTCGG CTCGGCCGGG GGGCTCGCGG CCCTGCGCCG GGGCGAATGC GACCTGGCCG CCATGCACCT GCTCGACCCC GAGACCGGCC GCTACAACGC GCCCTTCCTC GAACCCGGCA TGACACTCGC CCTCGGCTGG CGCCGGCTCC AGGGCGTGGT GTTCCGTAAG AACGATGCCC GCTTCGAGGG CCGAAGCGCC GCCGACGCGG TGAACGCGGC GCTGGCCGAC CCCGACGCGG TGATGGTCAA CCGCAACGCC GGCTCCGGCA CCCGCCTCCT CGTCGATGGC CTGATCGGCG CGGCCCGGCG GGCCGGCTTC TGGAACCAGC CGCGCTCGCA CAACGCCGTC GCGGCAGCGG TGGCACAGGG CCGAGCCGAT TGGGGTGTCG CGATCGCGAG CGTCGCAACG GCCTATGGCC TCGGCTTCCT GCCGCTGGCG CAGGAGCATT ACGACTTCGC CTATCGCCAG GCGGACCGCG AGAAGCCCGC GCTCGCCGCC TTCCTGGCGC TGCTCGGCAC ACGCGCGGCG GATGCAGCTC TGAACGAACT CGGGTTCGAA CCCGGCGGAG GCGAATCGTG A
|
Protein sequence | MSADRETDFT RRLAAAARQE QFLTVMSRED AHAAFRAALP HATLPPETVP LAQALGRVLA GDIASPIDVP PFDRALVDGF ALRAADTEGA NAARPRRLAL NREILACGVA PTGSVATDTA TPIATGGMIP RGADAVVMVE QTEFFEDAID VTGPVRPGQF VGYAGADMAS GETVLRKGAV VTAREIGMLA ACGLAEIAVV RRPRVAVLST GDELVAPGGE LRPGAIYDSN GAIVAASVAE NGGEPVPLGI VRDDEAALDA ALRDALTRSD LVVLSGGTSK GAGDVSHRIL SRLGPPGILV HGVALKPGKP LCLAVTEGKA VVVLPGFPTS AMFTFHEFVV PLVRALAGLP PREEEAVSAR LPQRLTSELG RTEFVMASLA QAADGLVALP LPKGSGSVTA FSQADGFFAV PAARSGVEAG ETVSVVRLGA GVRPPDLTII GSHCIGLDRV VGLLAEQGFR ARTVWVGSAG GLAALRRGEC DLAAMHLLDP ETGRYNAPFL EPGMTLALGW RRLQGVVFRK NDARFEGRSA ADAVNAALAD PDAVMVNRNA GSGTRLLVDG LIGAARRAGF WNQPRSHNAV AAAVAQGRAD WGVAIASVAT AYGLGFLPLA QEHYDFAYRQ ADREKPALAA FLALLGTRAA DAALNELGFE PGGGES
|
| |