Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Msed_0758 |
Symbol | |
ID | 5103447 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Metallosphaera sedula DSM 5348 |
Kingdom | Archaea |
Replicon accession | NC_009440 |
Strand | - |
Start bp | 690041 |
End bp | 691537 |
Gene Length | 1497 bp |
Protein Length | 498 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640506663 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_001190857 |
Protein GI | 146303541 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0354907 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCACCT CTTCTCAGAT GAGAGTCCTG GAGATTAACT CCGAGGCGTT TGGCGTTTCC ACACTACAAC TCATGGAAAA CGCTGGCAGA TCGGTCGCAG ACGAGATAGA GAGGGAGATG GGGACAAGTT CCCTGAGCGT CATTGTGTTT GTGGGCCACG GTGGGAAGGG TGGTGATGGG CTGGTTACGG CGAGACATCT GGCCGATAGG GGAGCCAACG TGACAGTGAT TACCATGGGC GAAATCAAGC ACAGGGACGC TCTAGTGAAC TATGGGGCTC TGGAAGAAAT GGACTTCTCT GTGAGGGTAT TAAGGATAGA CGACCTAGAT TCCCCACTAA AGGCGGATGT GCTCGTAGAT GCCATGCTGG GGACGGGAGT GAGGGGAAAG GTGAGATATC CATTTAATCA TGCCATCTCG CTTTTCAATG CGTCCAAGGG CTTCAAGGTG GCAATAGATG TTCCCTCGGG GATAGATCCA GATACTGGAG AGGCCCTAGG AGAGTTTGTC TCGCCAGATC TCGTGGTTAC ATTCCACGAC GTGAAACCAG GACTTTTGAA GTACAACTTT AAGTACGTGG TTAAGAAGAT AGGCATTCCT CCAGAGGCAT CAATTTACAT GGGACCGGGA GATCTTCTGA CGCTTAAGCA AAGAGACATG AGAAGCAGAA AAGGTGTAGG AGGGAGGGTT CTAATTGTGG GGGGAAGCTC AACCTTTTCG GGTGCGCCAG CCCTATCGGC GTTAGCTAGC TTGAGGACTG GGGCAGACCT GGTATACGTG GCCTCTCCCG AGAGAACGGC GGAGGCTATC TCCAGCTACT CTCCGGATCT AATTGCGGTT AAGCTCTCTG GGAGGAACTT TAACGAGAGT AACATCAAGG AGCTAGGACC GTGGGTGGAG AAGGCCAACG CTGTGGTTTT CGGGCCTGGC CTGGGCCTAG AGGAGGAGAC TGTCAAGGCA ACCCCAACAT TCGTGGAAAT GGTAATGAGA CTTGGGAAAC CCCTCGTGCT AGACGCTGAT GGCCTGAAGA TAATGAAGGG TTCAAAGCTT TCAAAGAACG TGGTCATTAC CCCTCATCCA GGGGAATTTA AGATCTTCTT TGGCGAGGAA CAGAAGGAGA ACGAAAGAGA AAGGATTAAC CAGGTCGTGG AGAAGGCTAG AACCTGTAAC TGCGTTGTGC TCCTGAAGGG TTATCTAGAC ATCATAAGTG ATGGGTATTC CTTTAGGCTT AACAAGGCTG GAAATCCTGG AATGACTGCA GGAGGAACAG GGGACACACT TACGGGGATC ATTGCGACCT TTATGGCACA GGGGTATTCA CCCTACATTT CAGCTGGATT AGGAGCGCTT GTGAATAGTC TCTCTGGCAC CCTAGCCTAT AGGGAACTAG GGGCACACCT GACAGCGTCT GACGTAGTAT CTAGAATTCC CAAGGTACTA AATGACCCGA TTACAGCCTT TAAGGAGAGG CCGTACAGAA GGGTTATTTC TAGTTGA
|
Protein sequence | MITSSQMRVL EINSEAFGVS TLQLMENAGR SVADEIEREM GTSSLSVIVF VGHGGKGGDG LVTARHLADR GANVTVITMG EIKHRDALVN YGALEEMDFS VRVLRIDDLD SPLKADVLVD AMLGTGVRGK VRYPFNHAIS LFNASKGFKV AIDVPSGIDP DTGEALGEFV SPDLVVTFHD VKPGLLKYNF KYVVKKIGIP PEASIYMGPG DLLTLKQRDM RSRKGVGGRV LIVGGSSTFS GAPALSALAS LRTGADLVYV ASPERTAEAI SSYSPDLIAV KLSGRNFNES NIKELGPWVE KANAVVFGPG LGLEEETVKA TPTFVEMVMR LGKPLVLDAD GLKIMKGSKL SKNVVITPHP GEFKIFFGEE QKENERERIN QVVEKARTCN CVVLLKGYLD IISDGYSFRL NKAGNPGMTA GGTGDTLTGI IATFMAQGYS PYISAGLGAL VNSLSGTLAY RELGAHLTAS DVVSRIPKVL NDPITAFKER PYRRVISS
|
| |