Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mext_1610 |
Symbol | |
ID | 5834179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium extorquens PA1 |
Kingdom | Bacteria |
Replicon accession | NC_010172 |
Strand | + |
Start bp | 1798088 |
End bp | 1799980 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641367408 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001639080 |
Protein GI | 163851037 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0307755 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.0806317 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAC CCGTCCGTCC CAAAGATCTT CCCAACAGCA ATCCCGCCAG CGTGACCACC GGCCCCGTGC AGGGCTCGCG CAAGGTCTAT GCGGAGGCGC CCGGCCGCCC CGACATTCGC GTGCCCTACC GGGAGATCGC CCTCTCCGAC CCGAAGGAGG AGCCGGTGCG GGTCTACGAC CCGTCGGGCC CCTACACCGA GACCGATGCG GCCATCGACC TCGAGAAGGG TCTGGCCCCG GTCCGCGAGC CGTGGATCGT CGGGCGCGGC TACGCCGCCG TGAAGCCGCG CGAGGTGAAG CCGGAGGACA ACGGATTCGC GGCTGCCGAC AAGCTCGTGG CGCCATGCCC CGCCGAGCGG ACGATCCGCC GGGCCGAGCC GGGGCAGTTG GTGACGCAGT ACGAATTCGC CCGCGCCGGG ATCATCACGG AAGAGATGAT CTATGTGGCG CATCGCGAGA ACGCCTGCCG CGCGCAGATG CTGGAGCGGG CGCAAGCCGC GCTCGCCGAC GGCGACAGTT TCGGCGCAGC GGTGCCGCCC TTTATCACGC CCGAATTCGT CCGCGACGAG GTGGCCCGCG GGCGTGCCAT CATCCCGGCC AACATCAACC ACCTCGAACT CGAGCCGATG GCGATCGGCC GCAATTTTTT GGTGAAAATC AACGCCAATA TCGGCAACTC GGCGGTGACG TCTTCCGCGG CTGAGGAAGT CGAAAAACTG GTCTGGTCGA TCCGCTGGGG TGCGGACACG GTCATGGACC TCTCGACGGG CCGCAACATC CACAACATCC GCTCGTGGAT CGTGCGCAAC TCGCCCGTCC CGATCGGCAC CGTGCCGATC TATCAGGCGC TGGAAAAGGT CGGCGGCGAC CCGCTGAAGC TCGATTGGGA GGTGTTCAAG GACACGCTCA TCGAACAGGC CGAGCAGGGC ATCGACTACT TCACGATCCA TGCCGGCGTG CGGCTGGCCC ATGTGCCGCT GACCGCGCGG CGCACCACCG GCATCGTGTC GCGCGGCGGC TCGATCATGG CGCGCTGGTG CCTCGCCGGG CACCGCGAAT CGTTCCTCTA CGAGCGGTTC GACGAGATCT GCGACATCAT GCGGGCCTAC GACGTGTCGT TCTCGCTCGG CGACGGCCTG CGTCCCGGCT CGATTGCGGA TGCCAACGAC GCGGCCCAGT TCGCCGAACT GGAGACCCTG GGCGAACTCA CCAAGATCGC CTGGGACAAG GGCTGCCAGA CCATGATCGA GGGCCCCGGC CACGTGCCGA TGCACAAGAT CAAGGTCAAC ATGGAGAAGC AGCTGCGCGA GTGCGGCGAG GCGCCGTTCT ACACCCTCGG CCCGCTGACC ACCGACATCG CGCCGGGCTA CGACCATATC ACCTCGGGCA TCGGCGCGGC GATGATCGGC TGGTTCGGCA CGGCGATGCT CTGCTACGTC ACGCCGAAGG AGCATCTCGG GCTGCCGAAC CGCGACGACG TGAAGACCGG CGTCATCACC TACAAGATCG CCGCGCACGC CGCCGACCTC GCCAAGGGCC ACCCCGCCGC GCAGCTCCGC GACGACGCCC TCAGCCGCGC CCGGTTCGAC TTCCGCTGGG AGGACCAGTT CAACCTCTCG CTGGATCCCG ACACGGCGCG CGCCTACCAC GACGAGACCC TGCCGAAGGA CGCGCACAAG GTCGCCCATT TCTGCTCGAT GTGCGGCCCG AAATTCTGCT CGATGAAGAT CACGCAGGAT CTGCGCGCCG ACGTGCTCGC CATGGAAGAG GCCGGCATCG TCATCGGCCA AGCCCAGCCG ATGAGCGACG CCGAGCGCCA GGCCGGCATG GCGGCCAAGT CGCAGGAGTT CCTGGAAGAG GGCGGCAAGC TCTACGTCGA CGCGGCGGAG TAA
|
Protein sequence | MNAPVRPKDL PNSNPASVTT GPVQGSRKVY AEAPGRPDIR VPYREIALSD PKEEPVRVYD PSGPYTETDA AIDLEKGLAP VREPWIVGRG YAAVKPREVK PEDNGFAAAD KLVAPCPAER TIRRAEPGQL VTQYEFARAG IITEEMIYVA HRENACRAQM LERAQAALAD GDSFGAAVPP FITPEFVRDE VARGRAIIPA NINHLELEPM AIGRNFLVKI NANIGNSAVT SSAAEEVEKL VWSIRWGADT VMDLSTGRNI HNIRSWIVRN SPVPIGTVPI YQALEKVGGD PLKLDWEVFK DTLIEQAEQG IDYFTIHAGV RLAHVPLTAR RTTGIVSRGG SIMARWCLAG HRESFLYERF DEICDIMRAY DVSFSLGDGL RPGSIADAND AAQFAELETL GELTKIAWDK GCQTMIEGPG HVPMHKIKVN MEKQLRECGE APFYTLGPLT TDIAPGYDHI TSGIGAAMIG WFGTAMLCYV TPKEHLGLPN RDDVKTGVIT YKIAAHAADL AKGHPAAQLR DDALSRARFD FRWEDQFNLS LDPDTARAYH DETLPKDAHK VAHFCSMCGP KFCSMKITQD LRADVLAMEE AGIVIGQAQP MSDAERQAGM AAKSQEFLEE GGKLYVDAAE
|
| |