Gene Mchl_1891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMchl_1891 
Symbol 
ID7113619 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacterium chloromethanicum CM4 
KingdomBacteria 
Replicon accessionNC_011757 
Strand
Start bp1952708 
End bp1954600 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content68% 
IMG OID643524655 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002420682 
Protein GI218529866 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0976553 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.536126 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAC CCGTCCGTCC CAAAGATCTT CCCAACAGCA ATCCCGCCAG CGTGACCACC 
GGCCCCGTGC AGGGCTCGCG CAAGGTCTAT GCGGAGGCGC CCGGCCGCCC CGACATCCGC
GTGCCCTACC GGGAGATCGC CCTCTCCGAC CCGAAGGAGG AGCCGGTGCG AGTCTACGAC
CCGTCGGGCC CCTACACCGA GACCGATGCG GCCATCGACC TCGAGAAGGG TCTGGCCCCG
GTCCGCGAGC CGTGGATCGT CGGGCGCGGC TACGCGGCGG TGAAGCCGCG CGAAGTGAAG
CCGGAGGACA ACGGATTCGC GGCTGCCGAC AAGCTCGTGG CGCCGTGCCC CGCCGAGCGG
ACGATCCGCC GGGCCGAGCC GGGGCAGTTG GTCACGCAGT ACGAATTCGC CCGCGCCGGG
ATCATCACGG AAGAGATGAT CTACGTGGCG CATCGCGAGA ACGCCTGTCG CGCGCAGATG
CTGGAGCGGG CGGAAGCCGC GCTCGCCGAC GGCGACAGCT TCGGCGCGGC GGTGCCGCCC
TTCATCACGC CCGAATTCGT CCGCGACGAG GTGGCCCGCG GCCGCGCGAT CATCCCGGCC
AACATCAACC ACCTCGAACT CGAGCCGATG GCGATCGGCC GCAATTTTTT GGTGAAAATC
AACGCCAATA TCGGCAACTC GGCGGTGACG TCTTCCGCGG CTGAGGAAGT CGAAAAACTG
GTCTGGTCGA TCCGCTGGGG CGCGGACACG GTCATGGACC TCTCGACGGG CCGCAACATC
CACAACATCC GCTCGTGGAT CGTGCGCAAC TCGCCCGTGC CGATCGGCAC CGTGCCGATC
TATCAGGCGC TGGAAAAGGT CGGCGGCGAC CCGCTGAAGC TCGATTGGGA GGTGTTCAAG
GACACGCTCA TCGAGCAGGC CGAGCAGGGC ATCGACTACT TCACGATCCA TGCCGGCGTG
CGGCTGGCTC ACGTGCCGCT GACCGCGCGG CGCACCACCG GCATCGTGTC GCGCGGCGGC
TCGATCATGG CGCGCTGGTG CCTCGCCGGG CACCGCGAAT CGTTCCTCTA TGAGCGGTTC
GACGAGATCT GCGACATCAT GCGGGCCTAC GACGTGTCGT TCTCGCTCGG CGACGGCCTG
CGCCCGGGCT CGATCGCGGA TGCCAACGAC GCGGCCCAGT TCGCCGAGCT GGAGACCCTG
GGCGAACTCA CCAAGATCGC CTGGGACAAG GGCTGCCAGA CCATGATCGA GGGCCCCGGC
CACGTGCCGA TGCACAAGAT CAAGGTCAAC ATGGAGAAGC AGCTGCGCGA GTGCGGCGAG
GCGCCGTTCT ACACCCTCGG CCCGCTGACC ACCGACATCG CTCCGGGCTA CGACCACATC
ACCTCGGGCA TCGGCGCGGC GATGATCGGC TGGTTCGGCA CGGCGATGCT CTGCTACGTC
ACGCCGAAGG AGCATCTCGG CCTGCCGAAC CGCGACGACG TGAAGACCGG CGTCATCACC
TACAAGATCG CCGCGCACGC CGCCGACCTC GCCAAGGGTC ACCCCGCCGC GCAGCTCCGC
GACGACGCCC TCAGCCGCGC CCGGTTCGAC TTCCGTTGGG AGGACCAGTT CAACCTCTCG
CTGGATCCCG ACACGGCGCG CGCCTACCAC GACGAGACCC TGCCGAAGGA CGCGCACAAG
GTCGCCCATT TCTGCTCGAT GTGCGGCCCG AAATTCTGCT CGATGAAGAT CACGCAGGAT
CTGCGCGCCG ACGTGCTCGC CATGGAGGAG GCCGGTATCG TCATCGGCCA AGCCCAGCCG
ATGAGCGACG CCGAGCGCCA GGCCGGCATG GCGGCCAAGT CGCAGGAGTT CCTGGAAGAG
GGCGGCAAGC TCTACGTCGA CGCGGCGGAG TAA
 
Protein sequence
MNAPVRPKDL PNSNPASVTT GPVQGSRKVY AEAPGRPDIR VPYREIALSD PKEEPVRVYD 
PSGPYTETDA AIDLEKGLAP VREPWIVGRG YAAVKPREVK PEDNGFAAAD KLVAPCPAER
TIRRAEPGQL VTQYEFARAG IITEEMIYVA HRENACRAQM LERAEAALAD GDSFGAAVPP
FITPEFVRDE VARGRAIIPA NINHLELEPM AIGRNFLVKI NANIGNSAVT SSAAEEVEKL
VWSIRWGADT VMDLSTGRNI HNIRSWIVRN SPVPIGTVPI YQALEKVGGD PLKLDWEVFK
DTLIEQAEQG IDYFTIHAGV RLAHVPLTAR RTTGIVSRGG SIMARWCLAG HRESFLYERF
DEICDIMRAY DVSFSLGDGL RPGSIADAND AAQFAELETL GELTKIAWDK GCQTMIEGPG
HVPMHKIKVN MEKQLRECGE APFYTLGPLT TDIAPGYDHI TSGIGAAMIG WFGTAMLCYV
TPKEHLGLPN RDDVKTGVIT YKIAAHAADL AKGHPAAQLR DDALSRARFD FRWEDQFNLS
LDPDTARAYH DETLPKDAHK VAHFCSMCGP KFCSMKITQD LRADVLAMEE AGIVIGQAQP
MSDAERQAGM AAKSQEFLEE GGKLYVDAAE