Gene Smed_4228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4228 
Symbol 
ID5319302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp711076 
End bp712794 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content68% 
IMG OID640776033 
Productdihydroxyacetone kinase 
Protein accessionYP_001312966 
Protein GI150376370 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2376] Dihydroxyacetone kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGA TCTTTGACGC TCCGGAGGAT TTCGCGACGA CCGCGCTCGC GGGTTTCGCG 
GCGATCTATC AGCGCAACGT CCGCCTGGTG AAGGGCGGCG TCGTGCGCTC GACCAAGGTG
CCGAAGGGCA AGGTCGCCGT CGTCGTCGGC GGCGGCTCGG GGCACTATCC GGCCTTCGCC
GGATATGTCG GTCCGGGGTT GGCCGACGCG GCGGTGGCAG GCGATGTCTT TGCCTCGCCC
TCGACGGCCG CCGTAGCTCG CGTCTGCCGC CATGCCGACC AGGGCGGCGG CGTACTTCTC
GGCTTCGGCA ATTATGCCGG CGACGTTCTA AATTTCGGGG TGGCGGCCGA GCGGCTTCGC
TCCGAAGGCA TCGACGTGCG CATCCTGCCG GTGACGGACG ACGTGGCGAG CGCACCCGTG
GACAGCTCGG CCAAACGCCG CGGGATTGCC GGCGACCTCG TCGTCTTCAA GATCGCCGGC
GCGGCAGCGG AAGCCGGCAA ATCGCTCGAC GAAGTGGAAC GGCTCGCGCG CTACGCCAAT
GATCGCACCT TTTCCTTCGG TGTCGCCTTC GGCGGCTGCA CGCTTCCGGG CGCCGCCGGG
CCGCTCTTCA CGGTCCCTGA TGGCCAGATG GCGCTGGGCC TCGGCATCCA TGGGGAGCCG
GGTGTCAGCG AGGAGCCGAT TGCGACGGCA AGCGATCTCG CAAAGCTCCT CACCGGCAAG
CTTCTCGCCG AACGTCCCGC AGGCGCCAGC AAGGCGGCCG CCGTGCTGAA CGGGCTCGGC
TCGACCAAAT ATGAAGAACT CTTCGTGCTG TGGACGGCGG TCGCAAAAGA ACTGGCGGAG
GCGTGTGTTG AGGTGGTCGA CCCCGAATGC GGGGAATTCG TCACCAGCCT CGACATGCAG
GGCTGCTCGC TCACCCTTCT CTGGCTGAAT GAAGAGCTGG AGGCGCTCTG GCGTGCGCCC
GCGGATGCAC CGGTGCTGCG CAAGGGCGTG ATCATTGCCG CCGAGCCGGC GACCGATGAG
ATCGCGGACG CAGACGGTCC ACAATCTTTC GCGCCCGCTT CGGAAGCCTC GAAGGCAAGC
GGCAAGTGCA TCGCCCGGCT GATCGGCAAT ATAGCCGATG CGCTGAAGGA GGCGGAAGAG
GAACTCGGCC GTATCGACGC TTTTGCGGGA GACGGCGATC ACGGTCAGGG GATGCGCCGC
GGCTCGGCTG CCGCATCCGA CGCAGCACAG ACGGCAGTCG CGGCGGGGGC GGGTGCTGCA
AGCGTTCTGG CTGCCGCGGG CGACGCCTGG GCCGATCGCG CCGGCGGCAC GTCGGGTGCG
ATATGGGGGC TGGCGCTCCG CTCCTGGAGC AATGCATTCA GCGATGAGGA CAAGGCGAGC
GACGCGGCGA CCGTGGAGGG CTCCCGCCTT GCGCTCGAGG GCGTGACGCG CCTTGGCCGC
GCGCGTGTGG GCGACAAGAC GCTGGTTGAC GCGCTGGTGC CCTTCGTCGA GACGCTGGAA
AGGGAATCCG CTGCCGGAAG GCCGCTCGTA GAAGCCTGGA ACGCGGCGGC AACAGCCGCG
CAGGAGGCAG CGGATGCGAC CTCGGCGCTG ACCCCGAAAC TCGGCCGCGC GCGGCCGCTG
GCCGAGAAGA GCATCGGCCA TCCGGACGCC GGCGCAGTTT CGCTGGCGCT GGTGGCACGG
GTTGCAGGCG CGTTTTTGAA GGAAGCTGCG GCCGGTTGA
 
Protein sequence
MTTIFDAPED FATTALAGFA AIYQRNVRLV KGGVVRSTKV PKGKVAVVVG GGSGHYPAFA 
GYVGPGLADA AVAGDVFASP STAAVARVCR HADQGGGVLL GFGNYAGDVL NFGVAAERLR
SEGIDVRILP VTDDVASAPV DSSAKRRGIA GDLVVFKIAG AAAEAGKSLD EVERLARYAN
DRTFSFGVAF GGCTLPGAAG PLFTVPDGQM ALGLGIHGEP GVSEEPIATA SDLAKLLTGK
LLAERPAGAS KAAAVLNGLG STKYEELFVL WTAVAKELAE ACVEVVDPEC GEFVTSLDMQ
GCSLTLLWLN EELEALWRAP ADAPVLRKGV IIAAEPATDE IADADGPQSF APASEASKAS
GKCIARLIGN IADALKEAEE ELGRIDAFAG DGDHGQGMRR GSAAASDAAQ TAVAAGAGAA
SVLAAAGDAW ADRAGGTSGA IWGLALRSWS NAFSDEDKAS DAATVEGSRL ALEGVTRLGR
ARVGDKTLVD ALVPFVETLE RESAAGRPLV EAWNAAATAA QEAADATSAL TPKLGRARPL
AEKSIGHPDA GAVSLALVAR VAGAFLKEAA AG