Gene Smed_2488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2488 
Symbol 
ID5323349 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2572435 
End bp2574213 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content63% 
IMG OID640791426 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001328155 
Protein GI150397688 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGACA AACTCCAGCG CCGCTTGCGC TCGCAGGACT GGTTCGACAA TCCGGACCAT 
ATCGATCTTA CGGCACTCTA TCTCGAGCGC TTCATGAACT ACGGCGTAAC GCCGGAGGAG
TTGCGTTCCG GCAAGCCCGT CATCGGCATT GCCCAGAGCG GCAGCGATCT CACCCCATGC
AATCGGGTCC ACATGGATCT GGCGAAGCGT GTTCGCGACG GCATCCGCGA TGCAGGCGGC
ATTCCGATCG AATTCCCGAC GCACCCGATC TTCGAGAACT GCAAGCGTCC GACCGCCGCC
CTCGACCGCA ACCTCGCCTA TCTGGGCCTG GTGGAAATTC TCTACGGCTA TCCGCTGGAT
GGCGTGGTGC TGACCACCGG CTGCGACAAG ACCACGCCGT CCGCCCTGAT GGCGGCTTCG
ACGGTCGATA TTCCGGCGAT CGTGCTCTCC GGCGGACCCA TGCTCGATGG CTGGCACGAA
GGCGACCTCG TCGGCTCGGG CACGGTGATC TGGCGCATGC GCCGCAAACT GGCGGCAGGC
GAGATCGATC GCGAGGAATT CATGCAGGCC GCGCTCGATT CGGCGCCTTC GGTCGGCCAT
TGCAACACCA TGGGCACGGC TTCGACTATG AACGCCATGG CCGAAGCGCT CGGCATGTCG
CTCACCGGCT GCGGCGCCAT CCCCGCTGCC TATCGCGAGC GTGGGCAGAT GGCCTATCGT
ACCGGACGCC GCGCCGTGGA ACTCGTCATG GAGGACCTGA AGCCCTCCGA CATCCTGACC
CGCGAAGCCT TTCTCAACGC TATCCGAGTC AACTCGGCGA TCGGCGGGTC GACCAATGCC
CAGCCACATC TTGCGGCCAT GGCGAAGCAT GCGGGCGTGG AACTTTATCC GGACGACTGG
CAGGTTCATG GCTTCGATAT CCCGCTGCTC GCCAACATCC AGCCGGCGGG TGCCTATCTC
GGCGAACGCT ACCACCGCGC GGGCGGCACA CCCGCCATCA TGTGGGAACT GCTGCAGGCC
GGAAAGCTCG ACGGTAGCTG TCGCACCGTC ACGGGAAAAT CCATGGCCGA GAACCTCGAG
GGACGCGAAT CGACGGACCG TGAGGTCATC AGGCGCTTTG AGGAGCCGCT CAGGGAAAAG
GCGGGCTTCC TGGTGCTGAA GGGCAATCTC TTCGACTTCG CGATCATGAA GATGAGCGTG
GTCTCAGACG ATTTCAGGAA GCGGTATCTT CAGGAGCCGG GGCGCGAGGG TGTTTTCGAA
GGCAAGGCCG TGGTCTTCGA TGGTTCGGAA GACTATCACA AGCGCATCAA CGACCCCCAA
CTCGACATCG ATGAAGACAC CATCCTGGTG ATCCGCGGTG CGGGGCCGCT CGGCTGGCCG
GGATCGGCGG AAGTCGTGAA CATGCAGCCG CCGGATCACC TCCTGAAGCG TGGCATCAAG
AGCCTGCCCA CCATCGGTGA CGGTCGCCAG TCCGGCACGG CGGACAGCCC CTCCATCCTG
AACGCCTCGC CGGAGAGTGC GGCCGGCGGC GGTCTCGCCT GGTTGCGGAA CGGCGATGTG
ATCCGGATCG ATTTCAACCT GGGCCTTTGC GACATGCTGG TTTCCGACGG CGAGATCGAA
AGGCGAAAGG CGGATGGCAT ACCGGCCGTG CCGTCCGACG CCACGCCCTG GCAGCGTATC
TATCGCAAAT CCGTCACCCA GCTTTCCGAT GGAGCGGTGC TCGAGGGTGC CGCAGACTTC
CGGCAAATCG CCAAAAACAT GCCGCGCCAC AATCACTAG
 
Protein sequence
MADKLQRRLR SQDWFDNPDH IDLTALYLER FMNYGVTPEE LRSGKPVIGI AQSGSDLTPC 
NRVHMDLAKR VRDGIRDAGG IPIEFPTHPI FENCKRPTAA LDRNLAYLGL VEILYGYPLD
GVVLTTGCDK TTPSALMAAS TVDIPAIVLS GGPMLDGWHE GDLVGSGTVI WRMRRKLAAG
EIDREEFMQA ALDSAPSVGH CNTMGTASTM NAMAEALGMS LTGCGAIPAA YRERGQMAYR
TGRRAVELVM EDLKPSDILT REAFLNAIRV NSAIGGSTNA QPHLAAMAKH AGVELYPDDW
QVHGFDIPLL ANIQPAGAYL GERYHRAGGT PAIMWELLQA GKLDGSCRTV TGKSMAENLE
GRESTDREVI RRFEEPLREK AGFLVLKGNL FDFAIMKMSV VSDDFRKRYL QEPGREGVFE
GKAVVFDGSE DYHKRINDPQ LDIDEDTILV IRGAGPLGWP GSAEVVNMQP PDHLLKRGIK
SLPTIGDGRQ SGTADSPSIL NASPESAAGG GLAWLRNGDV IRIDFNLGLC DMLVSDGEIE
RRKADGIPAV PSDATPWQRI YRKSVTQLSD GAVLEGAADF RQIAKNMPRH NH