Gene Smed_1387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1387 
Symbol 
ID5322238 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1461949 
End bp1463421 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content65% 
IMG OID640790329 
Productcarbohydrate kinase, YjeF related protein 
Protein accessionYP_001327068 
Protein GI150396601 
COG category[G] Carbohydrate transport and metabolism
[S] Function unknown 
COG ID[COG0062] Uncharacterized conserved protein
[COG0063] Predicted sugar kinase 
TIGRFAM ID[TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related
[TIGR00197] yjeF N-terminal region 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.470651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAGA CACTTTCAAA TATCTTTGTG ACTCCGGCCG AAATGACATC GGTCGACAAG 
GATGCCGCCC ATTCGGGAAT CGACAGCTTC TCGCTCATGC GTTCAGCCGG AACGGCCGTG
TCGGCCGCCG CCTTGCGACT TTACCCGGGA GCCCTGCGGT TCGTGACGCT TTGCGGCCCC
GGCAACAACG GTGGCGATGG CTATGTCGCC GCAGCCGCGC TGGCGGAAAG CGGCGCATGC
GTCACGGTGT TCGCGCTTGG CGAGCCGGCG AAGTTGAAGG GCGATGCCGC AAGGGCGCGC
GAACGGTGCG CGCTCGCGCC TGGACCGCTC GACGGTTACG AGCCGCAACC GGGCGACGTC
GTTATCGATG CGCTTTTCGG GGCAGGGCTT GCGCGTGAGG TGCCCAAGCA GGCGAGAAAC
GTCATCGGCC GGGTGAACGC CAGTGGAGTA CCGGTAGTCG CCGTCGACCT GCCATCCGGC
ATAGACGGGC GGACGGGTGA GATCCGTGGC GCAAGCTTCG TTGCGGCCCA CACCGTAACG
TTCATGGCTC CAAAGCCCGG CCATCTGCTG ATGCCGGGTC GCGCCCGTTG CGGCACAATC
GAAGTCTTCG ATATCGGCAT CCCGGGGAGG TTCGTTGTCG GCAGGGCCGG TGATTTGCAC
ATCAACACCC CCACGCTCTG GGAAGAGCAC CTCGGCGCGC TCGATCCGGA GGCCCATAAA
TACAAGCGCG GTCATCTCGC AGTATTTTGC GGCGGATCGG CCTCGACCGG GGCCGCGCGA
TTGTCGGCGG CAGCCGGCCT GCGCGCTGGT GCCGGGCTCG TGACGCTTGC CTCTCCAGTG
GAAGCGCTCG CTACGAACGC GTCGCATCTC ACGGCCGTCA TGCTCAAGGA GATCGACGGC
ACAGCCGACC TTAGCGCCTG GCTCAAGGAC AAGCGGCTGA GCACTTTCGT TCTTGGCCCC
GGTTTCGGGG TTGGCAAGAC GGCACGGGAC TTCGTGCTCA TGCTTTGCGA CCGGGCGCTG
GTCCTCGATG CCGACGGCAT CACTTCGTTC AAAGAGGCGG AAGAGCAACT CTTCGACAGG
ATCGCGGAGG AGGGCGGCGA AGTGGTGATG ACGCCGCACG ACGGTGAGTT TGGGCGTATT
TTCCCCGGCA TTGCTGCGGA TACGGCTTTG TCAAAGATCG AGAAGGCACA GGCGGCAGCA
AAACTCAGTC ATTCCGTGAT CGTCTACAAG GGCCCGGATA CCGTCATTGC CGCGCCTTCC
GGCCGGGCGG CCGTCAATGT CAATGCGCCC CCCTGGCTTG CGACGGCAGG CTCCGGCGAC
GTGCTTGCCG GGATCATCGG AGCGCACCTG GCACAGGGCA TGCCTTCATT CGAGGCAGCG
GCGGCGGCAG TCTGGCGCCA CGGCGAAGCA GGTGTCGCAG CTGGCCGTAC GGCGACGGCC
GAAATGCTCA TCGAAAGCAT GCCGCCGCTT TGA
 
Protein sequence
MSQTLSNIFV TPAEMTSVDK DAAHSGIDSF SLMRSAGTAV SAAALRLYPG ALRFVTLCGP 
GNNGGDGYVA AAALAESGAC VTVFALGEPA KLKGDAARAR ERCALAPGPL DGYEPQPGDV
VIDALFGAGL AREVPKQARN VIGRVNASGV PVVAVDLPSG IDGRTGEIRG ASFVAAHTVT
FMAPKPGHLL MPGRARCGTI EVFDIGIPGR FVVGRAGDLH INTPTLWEEH LGALDPEAHK
YKRGHLAVFC GGSASTGAAR LSAAAGLRAG AGLVTLASPV EALATNASHL TAVMLKEIDG
TADLSAWLKD KRLSTFVLGP GFGVGKTARD FVLMLCDRAL VLDADGITSF KEAEEQLFDR
IAEEGGEVVM TPHDGEFGRI FPGIAADTAL SKIEKAQAAA KLSHSVIVYK GPDTVIAAPS
GRAAVNVNAP PWLATAGSGD VLAGIIGAHL AQGMPSFEAA AAAVWRHGEA GVAAGRTATA
EMLIESMPPL