Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msil_2229 |
Symbol | |
ID | 7091351 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylocella silvestris BL2 |
Kingdom | Bacteria |
Replicon accession | NC_011666 |
Strand | - |
Start bp | 2411946 |
End bp | 2413562 |
Gene Length | 1617 bp |
Protein Length | 538 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 643465550 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_002362525 |
Protein GI | 217978378 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.355484 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTTTC AAAACCTTCC GCCCGAATTG TTGACCAATG CGGAAATGGG CGAAGCCGAT CGGCTCACCA TCGCCTCCGG CACGCCGGGC TATCAACTGA TGGAAAACGC CGGCGCCGCT GTCGCCGCCG AGGCCGCGCG GCTGTCGCCG AAGGGCGGCC GGATCGCCGT GTTGTGCGGC CCCGGCAACA ATGGCGGCGA CGGTTTCGTC GCGGCGCGGC TCCTCAAGGC GCGCGGCTTT TCCGTGACGC TCGGCTTGCT TGGGCCGCGC GAGGCGCTGC ATGGCGACGC CGCGACCGCT GCGGCTGCGT GGGATGGCGA CGTCTCGGCG CTTGAGGCGC TCGATCTCGA AAGCGCAGAT GTTGCGATTG ACGCGTTGTT TGGCGCCGGC ATCGCGCGCG ATCTCGACGG GGCGGCGCGC GACGCCGTGC TCCGCCTCAA TGAGTGGTCG GCGCGGCGCA GGAAGCCGGC GCTCGCGGTC GACGTGCCCT CGGGCCTCGA TGGAACCAGC GGTGAGATTC GCGGCGTCGC CGTGCGTGCG GCGCGCACCA TCACCTTCTT CCGCCGCAAG CCCGGCCATC TACTTTTGCC CGGACGCATC TGCTGCGGCG AGACCGTCGT GGCCGATATC GGCATCAGGG CGGAGGCGCT CGCGGCGATC GCCCCGAAAA CGGCGGCGAA TGGACCGCAG CTCTGGGGCC GCCTGCTGCC GTTTCCGTCC ATCGAGGGAC ATAAATATTC GCGCGGCCAT GCGCTTGTCC TGTCCGGCTC GCTGGCGCAC ACCGGGGCGG CGCGGCTTGC GGCAAGGGGC GCGCTGCGCG CCGGGGCGGG GCTCGTCACG GTCGCGACCC CGCGCGACGC GCTGGCGGTC CACGCCGCGG CGCTGACCGC TATCATGACA ACGCCCTGTG ACGGGCCCGA GGAACTGGCG GCGATTCTCG CCGACAGGCG CAAGAACGCG CTCGTGCTCG GTCCTGGACT TGGCGTCGGC GCCGCGACGC GGGCCCTCGT GACGACCGCG CTCGCGGCCG CAACGGCCGA TCCCTCGCCC CGCGCGATCG TGCTCGACGC CGACGCTCTG TCGAGCTTCA AGGGCGCGGC GGCCGAACTC GGGCAGGCGA TCCGCGCCTC AGGCGCGCCG GTCGTTCTGA CGCCCCATGA CGGCGAATTC GCGCGGCTGT TTGACGGCGC CTCGCCCGAC GACGCCGATC GCTATGCCGG GCCCCGCCTC CAGCCCGAGG CCGCGTGCGA GGCGCTCAAA AACCTGCGCT CCGGCTCGAA GCTCACGCGG GCGCGGGCTG CGGCGGTGCT GACCGGCGCC GTCGTGCTGC TGAAAGGCCC CGACACCGTC GTCGCCGATC CGGATGGGCG CGCGACGATC GACGATCTCT CGCCGCCCTG GCTCGCCACC GCCGGCTCGG GCGATGTTCT CGCCGGCATG ATCGGCGGCC TGTGCGCGCA AGCTATGCCG CCCTTCGAGG CGGCCTCCGC CGCCGTCTGG CTGCATGGGG CGGCGGCGCG CCAATTCGGC GTCGGGCTGA TCTCCGAGGA TTTGCCCGAA TCGCTGCCCG CCGTCCTGCG CGCTCTCTAC GACAGTCTGG GTCTCGGCCC GCTCTAA
|
Protein sequence | MSFQNLPPEL LTNAEMGEAD RLTIASGTPG YQLMENAGAA VAAEAARLSP KGGRIAVLCG PGNNGGDGFV AARLLKARGF SVTLGLLGPR EALHGDAATA AAAWDGDVSA LEALDLESAD VAIDALFGAG IARDLDGAAR DAVLRLNEWS ARRRKPALAV DVPSGLDGTS GEIRGVAVRA ARTITFFRRK PGHLLLPGRI CCGETVVADI GIRAEALAAI APKTAANGPQ LWGRLLPFPS IEGHKYSRGH ALVLSGSLAH TGAARLAARG ALRAGAGLVT VATPRDALAV HAAALTAIMT TPCDGPEELA AILADRRKNA LVLGPGLGVG AATRALVTTA LAAATADPSP RAIVLDADAL SSFKGAAAEL GQAIRASGAP VVLTPHDGEF ARLFDGASPD DADRYAGPRL QPEAACEALK NLRSGSKLTR ARAAAVLTGA VVLLKGPDTV VADPDGRATI DDLSPPWLAT AGSGDVLAGM IGGLCAQAMP PFEAASAAVW LHGAAARQFG VGLISEDLPE SLPAVLRALY DSLGLGPL
|
| |