Gene Msil_2999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2999 
Symbol 
ID7093494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3312191 
End bp3314020 
Gene Length1830 bp 
Protein Length609 aa 
Translation table11 
GC content64% 
IMG OID643466310 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002363272 
Protein GI217979125 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.548936 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCC ACGACAAGCC GAAGAGCCTT CGTCCGGAAA GCGTGACCCA AGGGCCCATG 
GCCGGCTCGC GCAAGATCTA TGTGCATCCG GACGGTCGCT CCGACATCGC CGTGCCGCTG
CGTGAGATCA CCCTCAATCC GTCCGCAAAC GAGCCGCCGG TGCGCGTCTA CGACGCCTCG
GGACCCTATA CGGAGTCCGA GCCCGCGATC GACCTTTCCG CCGGCTTGCC GCGCATCCGG
GACCCTTGGC TGGCGAGGCG CGCGGGGCTC GACTTCTACT CCGGCCGCCC GGTTCAGCCG
GAAGACAATG GTTCGGTGTC GCCGGACCGG CTCGCCCCGC CCTGCCCCGC CAACACCTCG
CCGCGCAAGG GCCGGGACGG CGCGCTGGTG ACGCAATATG AATTCGCCCG CGCAGGCGTG
ATCACCGAAG AAATGATCTA TGTCGCGGCC CGCGAAAACC TTGGCCGAGA GGCGGTGCTG
GAGGGCGCCG AAGCGAAAAT CGCCGACGGC GAAAGTTTCG GCGCGTCGAT CCCGGCCTTC
ATCACGCCGG AATTCGTGCG CGACGAGATC GCCCGCGGCC GCGCCATCAT CCCCGCCAAC
ATCAACCATC CCGAACTCGA GCCGATGATC ATCGGCCGCA ATTTTCTCGT GAAGGTCAAC
GCCAACATCG GCAATTCGGC GGTGACGTCG TCCGTCGCCG AAGAGGTCGA AAAGATGGTG
TGGGCGATCC GCTGGGGCGC CGATACGGTG ATGGATCTCT CGACGGGACG CAACATCCAC
AACATCCGCG ACTGGATCAT CCGCAACTCG CCAGCGCCGA TCGGCACCGT GCCGATCTAC
CAGGCGCTGG AGAAGGTCAA TGGCGATCCG ACCAAACTGA CATTCGAATT GTTCCGCGAC
ACGCTGATCG AACAGGCCGA ACAGGGCGTC GATTATTTCA CGATCCACGC CGGCGTGCGG
CTCGGCTATA TTCCGCTGAC GGCGAAGCGG ACCACGGGCA TCGTCTCGCG CGGCGGCTCG
ATCATGGCCC GCTGGTGCCT TGCCCATCAC AAGGAAAGCT TCCTCTACGA GCGCTTCGAC
GAGCTCTGCG ACGTCATGCG CGCCTATGAC GTCTCCTTCT CGCTCGGCGA CGGCTTGCGC
CCGGGCTCGA TCGCCGACGC CAACGACCGC GCCCAATTCG CCGAATTGGA GACGCTCGGC
GAGCTGACCA AAATCGCCTG GGACAAGGGC TGCCAGGTGA TGATCGAAGG GCCCGGCCAT
GTGCCGATGC ACAAGATCAA GATCAACATG GAAAAGCAGC TGAAGGAATG TCACGAGGCG
CCGTTCTACA CGCTCGGGCC GCTGACAACG GACATTGCGC CAGGCTATGA CCACATCACC
TCCGGCATCG GCGCTGCGAT GATCGGCTGG TTCGGCTGCG CGATGTTGTG TTATGTGACG
CCGAAAGAAC ATCTCGGCCT GCCGGACCGC GACGACGTCA AGACAGGCGT CATTACCTAC
CGCATCGCCG CCCATGCCGG CGACCTCGCC AAGGGCCATC CGGCGGCGCA GCTGCGCGAC
GACGCGCTGT CGCGCGCGAG ATTCGATTTC CGCTGGGAGG ATCAGTTCAA TCTCGGCCTC
GATCCCGACA CGGCGCGGCA GTTCCACGAC GAGACGCTGC CGAAGGATGC GCATAAGACT
GCGCATTTCT GCTCGATGTG CGGTCCGCAA TTCTGTTCGA TGAAGATCAC GCAGGATATT
CGCGACGAGG TCGCCGCGAT GGAACGCGAG AAGGGCATGG CCGACAAGAG CGCCGAATTC
CTGGACGGCG GCGGCAAGCT TTACGTTTAA
 
Protein sequence
MNIHDKPKSL RPESVTQGPM AGSRKIYVHP DGRSDIAVPL REITLNPSAN EPPVRVYDAS 
GPYTESEPAI DLSAGLPRIR DPWLARRAGL DFYSGRPVQP EDNGSVSPDR LAPPCPANTS
PRKGRDGALV TQYEFARAGV ITEEMIYVAA RENLGREAVL EGAEAKIADG ESFGASIPAF
ITPEFVRDEI ARGRAIIPAN INHPELEPMI IGRNFLVKVN ANIGNSAVTS SVAEEVEKMV
WAIRWGADTV MDLSTGRNIH NIRDWIIRNS PAPIGTVPIY QALEKVNGDP TKLTFELFRD
TLIEQAEQGV DYFTIHAGVR LGYIPLTAKR TTGIVSRGGS IMARWCLAHH KESFLYERFD
ELCDVMRAYD VSFSLGDGLR PGSIADANDR AQFAELETLG ELTKIAWDKG CQVMIEGPGH
VPMHKIKINM EKQLKECHEA PFYTLGPLTT DIAPGYDHIT SGIGAAMIGW FGCAMLCYVT
PKEHLGLPDR DDVKTGVITY RIAAHAGDLA KGHPAAQLRD DALSRARFDF RWEDQFNLGL
DPDTARQFHD ETLPKDAHKT AHFCSMCGPQ FCSMKITQDI RDEVAAMERE KGMADKSAEF
LDGGGKLYV