Gene Msed_0758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsed_0758 
Symbol 
ID5103447 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMetallosphaera sedula DSM 5348 
KingdomArchaea 
Replicon accessionNC_009440 
Strand
Start bp690041 
End bp691537 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content51% 
IMG OID640506663 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001190857 
Protein GI146303541 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0354907 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCACCT CTTCTCAGAT GAGAGTCCTG GAGATTAACT CCGAGGCGTT TGGCGTTTCC 
ACACTACAAC TCATGGAAAA CGCTGGCAGA TCGGTCGCAG ACGAGATAGA GAGGGAGATG
GGGACAAGTT CCCTGAGCGT CATTGTGTTT GTGGGCCACG GTGGGAAGGG TGGTGATGGG
CTGGTTACGG CGAGACATCT GGCCGATAGG GGAGCCAACG TGACAGTGAT TACCATGGGC
GAAATCAAGC ACAGGGACGC TCTAGTGAAC TATGGGGCTC TGGAAGAAAT GGACTTCTCT
GTGAGGGTAT TAAGGATAGA CGACCTAGAT TCCCCACTAA AGGCGGATGT GCTCGTAGAT
GCCATGCTGG GGACGGGAGT GAGGGGAAAG GTGAGATATC CATTTAATCA TGCCATCTCG
CTTTTCAATG CGTCCAAGGG CTTCAAGGTG GCAATAGATG TTCCCTCGGG GATAGATCCA
GATACTGGAG AGGCCCTAGG AGAGTTTGTC TCGCCAGATC TCGTGGTTAC ATTCCACGAC
GTGAAACCAG GACTTTTGAA GTACAACTTT AAGTACGTGG TTAAGAAGAT AGGCATTCCT
CCAGAGGCAT CAATTTACAT GGGACCGGGA GATCTTCTGA CGCTTAAGCA AAGAGACATG
AGAAGCAGAA AAGGTGTAGG AGGGAGGGTT CTAATTGTGG GGGGAAGCTC AACCTTTTCG
GGTGCGCCAG CCCTATCGGC GTTAGCTAGC TTGAGGACTG GGGCAGACCT GGTATACGTG
GCCTCTCCCG AGAGAACGGC GGAGGCTATC TCCAGCTACT CTCCGGATCT AATTGCGGTT
AAGCTCTCTG GGAGGAACTT TAACGAGAGT AACATCAAGG AGCTAGGACC GTGGGTGGAG
AAGGCCAACG CTGTGGTTTT CGGGCCTGGC CTGGGCCTAG AGGAGGAGAC TGTCAAGGCA
ACCCCAACAT TCGTGGAAAT GGTAATGAGA CTTGGGAAAC CCCTCGTGCT AGACGCTGAT
GGCCTGAAGA TAATGAAGGG TTCAAAGCTT TCAAAGAACG TGGTCATTAC CCCTCATCCA
GGGGAATTTA AGATCTTCTT TGGCGAGGAA CAGAAGGAGA ACGAAAGAGA AAGGATTAAC
CAGGTCGTGG AGAAGGCTAG AACCTGTAAC TGCGTTGTGC TCCTGAAGGG TTATCTAGAC
ATCATAAGTG ATGGGTATTC CTTTAGGCTT AACAAGGCTG GAAATCCTGG AATGACTGCA
GGAGGAACAG GGGACACACT TACGGGGATC ATTGCGACCT TTATGGCACA GGGGTATTCA
CCCTACATTT CAGCTGGATT AGGAGCGCTT GTGAATAGTC TCTCTGGCAC CCTAGCCTAT
AGGGAACTAG GGGCACACCT GACAGCGTCT GACGTAGTAT CTAGAATTCC CAAGGTACTA
AATGACCCGA TTACAGCCTT TAAGGAGAGG CCGTACAGAA GGGTTATTTC TAGTTGA
 
Protein sequence
MITSSQMRVL EINSEAFGVS TLQLMENAGR SVADEIEREM GTSSLSVIVF VGHGGKGGDG 
LVTARHLADR GANVTVITMG EIKHRDALVN YGALEEMDFS VRVLRIDDLD SPLKADVLVD
AMLGTGVRGK VRYPFNHAIS LFNASKGFKV AIDVPSGIDP DTGEALGEFV SPDLVVTFHD
VKPGLLKYNF KYVVKKIGIP PEASIYMGPG DLLTLKQRDM RSRKGVGGRV LIVGGSSTFS
GAPALSALAS LRTGADLVYV ASPERTAEAI SSYSPDLIAV KLSGRNFNES NIKELGPWVE
KANAVVFGPG LGLEEETVKA TPTFVEMVMR LGKPLVLDAD GLKIMKGSKL SKNVVITPHP
GEFKIFFGEE QKENERERIN QVVEKARTCN CVVLLKGYLD IISDGYSFRL NKAGNPGMTA
GGTGDTLTGI IATFMAQGYS PYISAGLGAL VNSLSGTLAY RELGAHLTAS DVVSRIPKVL
NDPITAFKER PYRRVISS