Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mchl_1891 |
Symbol | |
ID | 7113619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium chloromethanicum CM4 |
Kingdom | Bacteria |
Replicon accession | NC_011757 |
Strand | + |
Start bp | 1952708 |
End bp | 1954600 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643524655 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_002420682 |
Protein GI | 218529866 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0976553 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.536126 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAC CCGTCCGTCC CAAAGATCTT CCCAACAGCA ATCCCGCCAG CGTGACCACC GGCCCCGTGC AGGGCTCGCG CAAGGTCTAT GCGGAGGCGC CCGGCCGCCC CGACATCCGC GTGCCCTACC GGGAGATCGC CCTCTCCGAC CCGAAGGAGG AGCCGGTGCG AGTCTACGAC CCGTCGGGCC CCTACACCGA GACCGATGCG GCCATCGACC TCGAGAAGGG TCTGGCCCCG GTCCGCGAGC CGTGGATCGT CGGGCGCGGC TACGCGGCGG TGAAGCCGCG CGAAGTGAAG CCGGAGGACA ACGGATTCGC GGCTGCCGAC AAGCTCGTGG CGCCGTGCCC CGCCGAGCGG ACGATCCGCC GGGCCGAGCC GGGGCAGTTG GTCACGCAGT ACGAATTCGC CCGCGCCGGG ATCATCACGG AAGAGATGAT CTACGTGGCG CATCGCGAGA ACGCCTGTCG CGCGCAGATG CTGGAGCGGG CGGAAGCCGC GCTCGCCGAC GGCGACAGCT TCGGCGCGGC GGTGCCGCCC TTCATCACGC CCGAATTCGT CCGCGACGAG GTGGCCCGCG GCCGCGCGAT CATCCCGGCC AACATCAACC ACCTCGAACT CGAGCCGATG GCGATCGGCC GCAATTTTTT GGTGAAAATC AACGCCAATA TCGGCAACTC GGCGGTGACG TCTTCCGCGG CTGAGGAAGT CGAAAAACTG GTCTGGTCGA TCCGCTGGGG CGCGGACACG GTCATGGACC TCTCGACGGG CCGCAACATC CACAACATCC GCTCGTGGAT CGTGCGCAAC TCGCCCGTGC CGATCGGCAC CGTGCCGATC TATCAGGCGC TGGAAAAGGT CGGCGGCGAC CCGCTGAAGC TCGATTGGGA GGTGTTCAAG GACACGCTCA TCGAGCAGGC CGAGCAGGGC ATCGACTACT TCACGATCCA TGCCGGCGTG CGGCTGGCTC ACGTGCCGCT GACCGCGCGG CGCACCACCG GCATCGTGTC GCGCGGCGGC TCGATCATGG CGCGCTGGTG CCTCGCCGGG CACCGCGAAT CGTTCCTCTA TGAGCGGTTC GACGAGATCT GCGACATCAT GCGGGCCTAC GACGTGTCGT TCTCGCTCGG CGACGGCCTG CGCCCGGGCT CGATCGCGGA TGCCAACGAC GCGGCCCAGT TCGCCGAGCT GGAGACCCTG GGCGAACTCA CCAAGATCGC CTGGGACAAG GGCTGCCAGA CCATGATCGA GGGCCCCGGC CACGTGCCGA TGCACAAGAT CAAGGTCAAC ATGGAGAAGC AGCTGCGCGA GTGCGGCGAG GCGCCGTTCT ACACCCTCGG CCCGCTGACC ACCGACATCG CTCCGGGCTA CGACCACATC ACCTCGGGCA TCGGCGCGGC GATGATCGGC TGGTTCGGCA CGGCGATGCT CTGCTACGTC ACGCCGAAGG AGCATCTCGG CCTGCCGAAC CGCGACGACG TGAAGACCGG CGTCATCACC TACAAGATCG CCGCGCACGC CGCCGACCTC GCCAAGGGTC ACCCCGCCGC GCAGCTCCGC GACGACGCCC TCAGCCGCGC CCGGTTCGAC TTCCGTTGGG AGGACCAGTT CAACCTCTCG CTGGATCCCG ACACGGCGCG CGCCTACCAC GACGAGACCC TGCCGAAGGA CGCGCACAAG GTCGCCCATT TCTGCTCGAT GTGCGGCCCG AAATTCTGCT CGATGAAGAT CACGCAGGAT CTGCGCGCCG ACGTGCTCGC CATGGAGGAG GCCGGTATCG TCATCGGCCA AGCCCAGCCG ATGAGCGACG CCGAGCGCCA GGCCGGCATG GCGGCCAAGT CGCAGGAGTT CCTGGAAGAG GGCGGCAAGC TCTACGTCGA CGCGGCGGAG TAA
|
Protein sequence | MNAPVRPKDL PNSNPASVTT GPVQGSRKVY AEAPGRPDIR VPYREIALSD PKEEPVRVYD PSGPYTETDA AIDLEKGLAP VREPWIVGRG YAAVKPREVK PEDNGFAAAD KLVAPCPAER TIRRAEPGQL VTQYEFARAG IITEEMIYVA HRENACRAQM LERAEAALAD GDSFGAAVPP FITPEFVRDE VARGRAIIPA NINHLELEPM AIGRNFLVKI NANIGNSAVT SSAAEEVEKL VWSIRWGADT VMDLSTGRNI HNIRSWIVRN SPVPIGTVPI YQALEKVGGD PLKLDWEVFK DTLIEQAEQG IDYFTIHAGV RLAHVPLTAR RTTGIVSRGG SIMARWCLAG HRESFLYERF DEICDIMRAY DVSFSLGDGL RPGSIADAND AAQFAELETL GELTKIAWDK GCQTMIEGPG HVPMHKIKVN MEKQLRECGE APFYTLGPLT TDIAPGYDHI TSGIGAAMIG WFGTAMLCYV TPKEHLGLPN RDDVKTGVIT YKIAAHAADL AKGHPAAQLR DDALSRARFD FRWEDQFNLS LDPDTARAYH DETLPKDAHK VAHFCSMCGP KFCSMKITQD LRADVLAMEE AGIVIGQAQP MSDAERQAGM AAKSQEFLEE GGKLYVDAAE
|
| |