Gene Smed_3695 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_3695 
SymboleutB 
ID5318304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp136614 
End bp137618 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content65% 
IMG OID640775508 
Productthreonine dehydratase 
Protein accessionYP_001312441 
Protein GI150375845 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1171] Threonine dehydratase 
TIGRFAM ID[TIGR02991] ectoine utilization protein EutB 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCGTTT CTCTTCCTGT GACGATAGAC GACATCGAGG CGGCGGCGCG GCGCATTTCC 
GGCCGCGTAC GGACCACGCC GCTGGTAGAC TCGGCATCCT TGAGTGAACG CATTGGTGTG
CCGGTCGGCC TCAAGCTCGA GCACCACCAG ACGACCGGCA GCTTCAAGCT GCGCGGCGCG
ACCAATGCGG TGCTTTCGCT TTCTCCCGGC GACCGGGCAC TCGGCGTAAT CGCGGCGTCG
ACGGGCAATC ATGGCCGGGC ACTGGCCCAT GCGGCCAAGG CGGAGGGCTC GGTCGCGACG
ATTTGCATGT CGCATCTGGT GCCGCTGAAC AAGGTTTCCG AAATCAGGCG CCTTGGCGCC
CATGTGCGGA TAATAGGCAA TTCCCAGGAC GAGGCGCAGG TAGAAGTCGA ACGGCTGGTC
GCCGAAAATG GTCTCGTGAT GGTGCCACCT TTCGACAATG CGGCGGTCGT TGCGGGGCAG
GGGACGCTCG GACTTGAGAT CGTTACGCAG ATGCCGGATG TATCGACGGT ACTTGTGCCG
GTCTCGGGTG GCGGGCTTGC CGCCGGTGTG GCGGCAGCCG TCAAGGCGCG CAAACCGGCG
ACGCGCGTCA TCGGGTTGAC TATGGAGCGC GGCGCGGCGA TGCAGGCGAG CATCGCTGCC
GGCGGGCCGG TGCTCGTCGA CGAATATCCG AGCCTGGCGG ATTCGCTTGG CGGCGGCATA
GGGCTGGACA ATCGCGTCAC GTTCCGCATG TGCCGGGAAC TCCTCGACGA GATCATCCTT
CTGACGGAGG ATGAAATCGC TGCCGGCATG CGTCATGCCT ATGCTGAGGA GCGCGAGATC
GTGGAAGGGG CGGGCGCCGT CGGCATCGCC GCGCTTCTTG CCGGAAAGAT CAGGGACATC
GACGGCCCGA TCGCGATCAT CCTGTCGGGC CGCAATGTCG ACATGGACCT GCACCTGCGC
GTGATGAACG GCGAAGCGGA TCCGTTTCGC GGGGAGGCGG CATGA
 
Protein sequence
MSVSLPVTID DIEAAARRIS GRVRTTPLVD SASLSERIGV PVGLKLEHHQ TTGSFKLRGA 
TNAVLSLSPG DRALGVIAAS TGNHGRALAH AAKAEGSVAT ICMSHLVPLN KVSEIRRLGA
HVRIIGNSQD EAQVEVERLV AENGLVMVPP FDNAAVVAGQ GTLGLEIVTQ MPDVSTVLVP
VSGGGLAAGV AAAVKARKPA TRVIGLTMER GAAMQASIAA GGPVLVDEYP SLADSLGGGI
GLDNRVTFRM CRELLDEIIL LTEDEIAAGM RHAYAEEREI VEGAGAVGIA ALLAGKIRDI
DGPIAIILSG RNVDMDLHLR VMNGEADPFR GEAA