Gene Smed_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4034 
Symbol 
ID5318334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp494181 
End bp495254 
Gene Length1074 bp 
Protein Length357 aa 
Translation table11 
GC content64% 
IMG OID640775842 
Productalcohol dehydrogenase 
Protein accessionYP_001312775 
Protein GI150376179 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG1063] Threonine dehydrogenase and related Zn-dependent dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAGCGC TCAGATTTCA TGCGGCAAGG GATCTCAGGA TCGAGGACAT TCAGGCGCCG 
GGCGAACCCG CAGCGGGGCA GGTGCTGGTC CGCAACCGCT TCGTCGGAAT CTGCGGAACG
GACCTGCACG AATATGCCTA CGGTCCGATT TTCGTTCCAA AGGAACCACA TCCGTTTACC
GGCGCCCATG GCCCACAGGT CCTTGGCCAC GAGTTCGGCG GTGTGGTGGA GGCCGTCGGT
GAGGGTGTGA CTTCGGTCCG TGCCGGCGAC CGGGTATCCG TGCAGCCGCT GATCATGCCG
CGCGCCGGCG ACTTCTTCGC CGATCGCGGC CTCTTCCACC TGAGTACCGA TCTGGCCCTT
GCCGGTCTGA GCTGGGCGTG GGGAGGCATG GCGGAATACG CCCTTCTCAA CGAATACAAT
GTCGAGAAGA TTCCCGAAGA GATGAGCGAC GAGGAGGCGG CGCTTGTCGA GCCGAGCGCG
GTCGCCGTCT ACGCTTGCGA CCGGGGCGGG GTCACGGCGG GCAGCAGCGT GCTCGTGACC
GGTGCCGGGC CGATCGGAGT CCTGACCCTT CTTGCCGCAC GCGCGGCCGG TGCTGCGCAG
CTCTTCGTCT CCGACATCAA CGACGCCCGC CTTGAATTCG CTTCATCCAT TCTTCCGGAC
ATAACCCCGA TCAATCCGGG TCGGAGCAAC CCGGGCGATG TGGTGCGCGC GGCTACGGAG
GGCAAAGTCG GGTGCGACGT CGCTATCGAA TGCGTCGGAA ACGAGCACGC GCTCAAGGGC
TGCGTCGACG CCGTGCGCAA GCAGGGTGTC GTGGTTCAGA CCGGGCTGCA TCCGCACGAG
AACCCGATCG ACTGGTTTCA GGTCACGTTC AAGGACATCG ATCTGCGCGG CTCCTGGGCC
TACCCGACAC ACTACTGGCC GCGTGTGATC CGCTTGATAG CCTCAGGACA TTTGCCCGCA
AAACAGGTCG TCACCGGTCG TATTGGTCTC GATCGGGCCG TGGCGGACGG CTTCGATGCG
CTTCTCGATC CGGGTGGCAG GCACCTGAAG ATCCTGATCG ACCTCACGAA CTGA
 
Protein sequence
MRALRFHAAR DLRIEDIQAP GEPAAGQVLV RNRFVGICGT DLHEYAYGPI FVPKEPHPFT 
GAHGPQVLGH EFGGVVEAVG EGVTSVRAGD RVSVQPLIMP RAGDFFADRG LFHLSTDLAL
AGLSWAWGGM AEYALLNEYN VEKIPEEMSD EEAALVEPSA VAVYACDRGG VTAGSSVLVT
GAGPIGVLTL LAARAAGAAQ LFVSDINDAR LEFASSILPD ITPINPGRSN PGDVVRAATE
GKVGCDVAIE CVGNEHALKG CVDAVRKQGV VVQTGLHPHE NPIDWFQVTF KDIDLRGSWA
YPTHYWPRVI RLIASGHLPA KQVVTGRIGL DRAVADGFDA LLDPGGRHLK ILIDLTN