Gene Smed_5903 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_5903 
Symbol 
ID5320205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp868905 
End bp869891 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content63% 
IMG OID640777598 
ProductD-isomer specific 2-hydroxyacid dehydrogenase NAD-binding 
Protein accessionYP_001314530 
Protein GI150377935 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0111] Phosphoglycerate dehydrogenase and related dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.22808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCCAA AGGTTCTGAT GACCGCAAAG ACGCTTGCTA CGCCGGGGCT GGCCCTGCTC 
GAGCAAGCCG GCTGCGCGGT CTCCTTTCTT AAGGAAGGCA CGGAAGCGGA ACTCGCAGAA
AGTCTGCGGT CGACTCCCTT CGATGCCGTC ATCTCCCGCA CCTTGGCGCT ACCGGCAATG
ATGATCGAGA CGGCACCTGC CCTGCGCGTC ATCTCCCGCC ACGGTGTCGG CTATAATAAT
GTCGATATCG AAAGCGCCAC CCGGCGCGGA GTGCCGGTGC TGATTGCCGA TGGCGCGAAT
GGCAAATCGG TCGCCGAACT TGCCGTCGGC CTCGCCCTTT CGGTGGCCCG CAAAATCACG
ACGCAAGACG CCTCGATTCG CGCCCGCCAG TGGAATCGCT CTGCCTACGG CCTGCAATTT
GCCGGCAAGA CGGCAGGGAT CGTCGCCTTC GGTGCGATCG GCCGGCGGGT AGCGGAAATT
CTGAGGGCAA TGGACATGCG GATCATCGCC TTCGACCCCC ATGCGCGCGA CCGTTCCACG
ACCGGGGTCG ATTGGACCGA GACGCTGGAC GAACTCCTGC AGGAAAGCGA TCTCGTTTCG
CTTCATTGCC CGTTGACGCC GGAGACCCGC AACATGATCA CCGCGCCGCG GCTGGCGCGG
ATGAAGCCGG GCGCAATCCT GATCAATACC GCGCGTGGCG GCCTGATCGA CGAAAAGGCA
TTGGCCGAGG CCGTTCTTTC CGGACATCTT GCCGGTGCAG GTCTCGACAC CTTCGCCGAT
GAACCCCTCC CCGCCGACCA TCCGTTCCTT TCTCTGCCGC AGATCGTGAT GACTCCGCAT
ATGGGCGGAA GCACCGACGT CGCGCTTGAT GGCGTTGCGA TCAGCGCAGC GCGCAACGTG
CTCGACGTCC TGATCGACGG CAAGGTCGAT CGCCGTCTTC TCGTCAACCC GGCGGTTCTC
GAACACCGCA CCGTCGAAGC AAAGTGA
 
Protein sequence
MGPKVLMTAK TLATPGLALL EQAGCAVSFL KEGTEAELAE SLRSTPFDAV ISRTLALPAM 
MIETAPALRV ISRHGVGYNN VDIESATRRG VPVLIADGAN GKSVAELAVG LALSVARKIT
TQDASIRARQ WNRSAYGLQF AGKTAGIVAF GAIGRRVAEI LRAMDMRIIA FDPHARDRST
TGVDWTETLD ELLQESDLVS LHCPLTPETR NMITAPRLAR MKPGAILINT ARGGLIDEKA
LAEAVLSGHL AGAGLDTFAD EPLPADHPFL SLPQIVMTPH MGGSTDVALD GVAISAARNV
LDVLIDGKVD RRLLVNPAVL EHRTVEAK