Gene Mext_1610 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMext_1610 
Symbol 
ID5834179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium extorquens PA1 
KingdomBacteria 
Replicon accessionNC_010172 
Strand
Start bp1798088 
End bp1799980 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content67% 
IMG OID641367408 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001639080 
Protein GI163851037 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0307755 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.0806317 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAC CCGTCCGTCC CAAAGATCTT CCCAACAGCA ATCCCGCCAG CGTGACCACC 
GGCCCCGTGC AGGGCTCGCG CAAGGTCTAT GCGGAGGCGC CCGGCCGCCC CGACATTCGC
GTGCCCTACC GGGAGATCGC CCTCTCCGAC CCGAAGGAGG AGCCGGTGCG GGTCTACGAC
CCGTCGGGCC CCTACACCGA GACCGATGCG GCCATCGACC TCGAGAAGGG TCTGGCCCCG
GTCCGCGAGC CGTGGATCGT CGGGCGCGGC TACGCCGCCG TGAAGCCGCG CGAGGTGAAG
CCGGAGGACA ACGGATTCGC GGCTGCCGAC AAGCTCGTGG CGCCATGCCC CGCCGAGCGG
ACGATCCGCC GGGCCGAGCC GGGGCAGTTG GTGACGCAGT ACGAATTCGC CCGCGCCGGG
ATCATCACGG AAGAGATGAT CTATGTGGCG CATCGCGAGA ACGCCTGCCG CGCGCAGATG
CTGGAGCGGG CGCAAGCCGC GCTCGCCGAC GGCGACAGTT TCGGCGCAGC GGTGCCGCCC
TTTATCACGC CCGAATTCGT CCGCGACGAG GTGGCCCGCG GGCGTGCCAT CATCCCGGCC
AACATCAACC ACCTCGAACT CGAGCCGATG GCGATCGGCC GCAATTTTTT GGTGAAAATC
AACGCCAATA TCGGCAACTC GGCGGTGACG TCTTCCGCGG CTGAGGAAGT CGAAAAACTG
GTCTGGTCGA TCCGCTGGGG TGCGGACACG GTCATGGACC TCTCGACGGG CCGCAACATC
CACAACATCC GCTCGTGGAT CGTGCGCAAC TCGCCCGTCC CGATCGGCAC CGTGCCGATC
TATCAGGCGC TGGAAAAGGT CGGCGGCGAC CCGCTGAAGC TCGATTGGGA GGTGTTCAAG
GACACGCTCA TCGAACAGGC CGAGCAGGGC ATCGACTACT TCACGATCCA TGCCGGCGTG
CGGCTGGCCC ATGTGCCGCT GACCGCGCGG CGCACCACCG GCATCGTGTC GCGCGGCGGC
TCGATCATGG CGCGCTGGTG CCTCGCCGGG CACCGCGAAT CGTTCCTCTA CGAGCGGTTC
GACGAGATCT GCGACATCAT GCGGGCCTAC GACGTGTCGT TCTCGCTCGG CGACGGCCTG
CGTCCCGGCT CGATTGCGGA TGCCAACGAC GCGGCCCAGT TCGCCGAACT GGAGACCCTG
GGCGAACTCA CCAAGATCGC CTGGGACAAG GGCTGCCAGA CCATGATCGA GGGCCCCGGC
CACGTGCCGA TGCACAAGAT CAAGGTCAAC ATGGAGAAGC AGCTGCGCGA GTGCGGCGAG
GCGCCGTTCT ACACCCTCGG CCCGCTGACC ACCGACATCG CGCCGGGCTA CGACCATATC
ACCTCGGGCA TCGGCGCGGC GATGATCGGC TGGTTCGGCA CGGCGATGCT CTGCTACGTC
ACGCCGAAGG AGCATCTCGG GCTGCCGAAC CGCGACGACG TGAAGACCGG CGTCATCACC
TACAAGATCG CCGCGCACGC CGCCGACCTC GCCAAGGGCC ACCCCGCCGC GCAGCTCCGC
GACGACGCCC TCAGCCGCGC CCGGTTCGAC TTCCGCTGGG AGGACCAGTT CAACCTCTCG
CTGGATCCCG ACACGGCGCG CGCCTACCAC GACGAGACCC TGCCGAAGGA CGCGCACAAG
GTCGCCCATT TCTGCTCGAT GTGCGGCCCG AAATTCTGCT CGATGAAGAT CACGCAGGAT
CTGCGCGCCG ACGTGCTCGC CATGGAAGAG GCCGGCATCG TCATCGGCCA AGCCCAGCCG
ATGAGCGACG CCGAGCGCCA GGCCGGCATG GCGGCCAAGT CGCAGGAGTT CCTGGAAGAG
GGCGGCAAGC TCTACGTCGA CGCGGCGGAG TAA
 
Protein sequence
MNAPVRPKDL PNSNPASVTT GPVQGSRKVY AEAPGRPDIR VPYREIALSD PKEEPVRVYD 
PSGPYTETDA AIDLEKGLAP VREPWIVGRG YAAVKPREVK PEDNGFAAAD KLVAPCPAER
TIRRAEPGQL VTQYEFARAG IITEEMIYVA HRENACRAQM LERAQAALAD GDSFGAAVPP
FITPEFVRDE VARGRAIIPA NINHLELEPM AIGRNFLVKI NANIGNSAVT SSAAEEVEKL
VWSIRWGADT VMDLSTGRNI HNIRSWIVRN SPVPIGTVPI YQALEKVGGD PLKLDWEVFK
DTLIEQAEQG IDYFTIHAGV RLAHVPLTAR RTTGIVSRGG SIMARWCLAG HRESFLYERF
DEICDIMRAY DVSFSLGDGL RPGSIADAND AAQFAELETL GELTKIAWDK GCQTMIEGPG
HVPMHKIKVN MEKQLRECGE APFYTLGPLT TDIAPGYDHI TSGIGAAMIG WFGTAMLCYV
TPKEHLGLPN RDDVKTGVIT YKIAAHAADL AKGHPAAQLR DDALSRARFD FRWEDQFNLS
LDPDTARAYH DETLPKDAHK VAHFCSMCGP KFCSMKITQD LRADVLAMEE AGIVIGQAQP
MSDAERQAGM AAKSQEFLEE GGKLYVDAAE