Gene Msil_2229 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_2229 
Symbol 
ID7091351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp2411946 
End bp2413562 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content72% 
IMG OID643465550 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_002362525 
Protein GI217978378 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.355484 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTTTC AAAACCTTCC GCCCGAATTG TTGACCAATG CGGAAATGGG CGAAGCCGAT 
CGGCTCACCA TCGCCTCCGG CACGCCGGGC TATCAACTGA TGGAAAACGC CGGCGCCGCT
GTCGCCGCCG AGGCCGCGCG GCTGTCGCCG AAGGGCGGCC GGATCGCCGT GTTGTGCGGC
CCCGGCAACA ATGGCGGCGA CGGTTTCGTC GCGGCGCGGC TCCTCAAGGC GCGCGGCTTT
TCCGTGACGC TCGGCTTGCT TGGGCCGCGC GAGGCGCTGC ATGGCGACGC CGCGACCGCT
GCGGCTGCGT GGGATGGCGA CGTCTCGGCG CTTGAGGCGC TCGATCTCGA AAGCGCAGAT
GTTGCGATTG ACGCGTTGTT TGGCGCCGGC ATCGCGCGCG ATCTCGACGG GGCGGCGCGC
GACGCCGTGC TCCGCCTCAA TGAGTGGTCG GCGCGGCGCA GGAAGCCGGC GCTCGCGGTC
GACGTGCCCT CGGGCCTCGA TGGAACCAGC GGTGAGATTC GCGGCGTCGC CGTGCGTGCG
GCGCGCACCA TCACCTTCTT CCGCCGCAAG CCCGGCCATC TACTTTTGCC CGGACGCATC
TGCTGCGGCG AGACCGTCGT GGCCGATATC GGCATCAGGG CGGAGGCGCT CGCGGCGATC
GCCCCGAAAA CGGCGGCGAA TGGACCGCAG CTCTGGGGCC GCCTGCTGCC GTTTCCGTCC
ATCGAGGGAC ATAAATATTC GCGCGGCCAT GCGCTTGTCC TGTCCGGCTC GCTGGCGCAC
ACCGGGGCGG CGCGGCTTGC GGCAAGGGGC GCGCTGCGCG CCGGGGCGGG GCTCGTCACG
GTCGCGACCC CGCGCGACGC GCTGGCGGTC CACGCCGCGG CGCTGACCGC TATCATGACA
ACGCCCTGTG ACGGGCCCGA GGAACTGGCG GCGATTCTCG CCGACAGGCG CAAGAACGCG
CTCGTGCTCG GTCCTGGACT TGGCGTCGGC GCCGCGACGC GGGCCCTCGT GACGACCGCG
CTCGCGGCCG CAACGGCCGA TCCCTCGCCC CGCGCGATCG TGCTCGACGC CGACGCTCTG
TCGAGCTTCA AGGGCGCGGC GGCCGAACTC GGGCAGGCGA TCCGCGCCTC AGGCGCGCCG
GTCGTTCTGA CGCCCCATGA CGGCGAATTC GCGCGGCTGT TTGACGGCGC CTCGCCCGAC
GACGCCGATC GCTATGCCGG GCCCCGCCTC CAGCCCGAGG CCGCGTGCGA GGCGCTCAAA
AACCTGCGCT CCGGCTCGAA GCTCACGCGG GCGCGGGCTG CGGCGGTGCT GACCGGCGCC
GTCGTGCTGC TGAAAGGCCC CGACACCGTC GTCGCCGATC CGGATGGGCG CGCGACGATC
GACGATCTCT CGCCGCCCTG GCTCGCCACC GCCGGCTCGG GCGATGTTCT CGCCGGCATG
ATCGGCGGCC TGTGCGCGCA AGCTATGCCG CCCTTCGAGG CGGCCTCCGC CGCCGTCTGG
CTGCATGGGG CGGCGGCGCG CCAATTCGGC GTCGGGCTGA TCTCCGAGGA TTTGCCCGAA
TCGCTGCCCG CCGTCCTGCG CGCTCTCTAC GACAGTCTGG GTCTCGGCCC GCTCTAA
 
Protein sequence
MSFQNLPPEL LTNAEMGEAD RLTIASGTPG YQLMENAGAA VAAEAARLSP KGGRIAVLCG 
PGNNGGDGFV AARLLKARGF SVTLGLLGPR EALHGDAATA AAAWDGDVSA LEALDLESAD
VAIDALFGAG IARDLDGAAR DAVLRLNEWS ARRRKPALAV DVPSGLDGTS GEIRGVAVRA
ARTITFFRRK PGHLLLPGRI CCGETVVADI GIRAEALAAI APKTAANGPQ LWGRLLPFPS
IEGHKYSRGH ALVLSGSLAH TGAARLAARG ALRAGAGLVT VATPRDALAV HAAALTAIMT
TPCDGPEELA AILADRRKNA LVLGPGLGVG AATRALVTTA LAAATADPSP RAIVLDADAL
SSFKGAAAEL GQAIRASGAP VVLTPHDGEF ARLFDGASPD DADRYAGPRL QPEAACEALK
NLRSGSKLTR ARAAAVLTGA VVLLKGPDTV VADPDGRATI DDLSPPWLAT AGSGDVLAGM
IGGLCAQAMP PFEAASAAVW LHGAAARQFG VGLISEDLPE SLPAVLRALY DSLGLGPL