Gene Smed_0434 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0434 
Symbol 
ID5321268 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp470311 
End bp472035 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content62% 
IMG OID640789369 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001326126 
Protein GI150395659 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.325257 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.619807 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA AGAAAAAAGA ACTGAGAAGC CGTCATTGGT ATGGTGGCAC GCACAAAGAC 
GGCTTCATTC ATCGTTCCTG GATGAAGAAC CAGGGCTTTC CCGATCATGT TTTCGACGGA
CGGCCGATCA TCGGCATCTG CAACACCTGG TCGGAGCTCA CGCCCTGCAA CAGCCATCTG
CGCATTCTTG CCGAAGGTGT GAAGCGTGGC GTTTGGGAAG CGGGAGGCTT TCCGGTGGAG
TTTCCGGTGT CGTCGCTCGG GGAGACGCAG ATGCGCCCGA CCGCGATGCT CTTCCGCAAT
CTGCTCGCAA TGGACGTCGA AGAGGCGATC CGCGCCTATG ACATCGACGG GGTCGTGCTG
CTCGGCGGCT GCGACAAGAC CACCCCGGGC CAACTGATGG GCGCGGCCTC GGTCGATCTC
CCGACGATCG TGGTGTCCTC CGGCCCTATG CTGAACGGCA AGTGGAAGGG AAAGGACATC
GGCTCGGGCA CGGATGTCTG GAAATTCTCC GAAGCCGTGC GCGCCGGTGA AATGAGCCTG
CAGGAATTCA TGGCCGCCGA AAGCGGCATG TCGCGTTCGC CGGGTGTCTG CATGACCATG
GGCACCGCGA CCACTATGGC TTCAGTCGTG GAAGCCATGG GCTTATCGCT GCCGACAAAC
GCCGCCCTGC CCGCAGTCGA CGCTCGCCGC ATGGCGCTCG CGCATATGAC CGGCAAGCGC
ATCGTCGAAA TGGTGCATGA GGATCTGAGG CTGTCGAAGA TCCTGACGAA GGAGAACTTC
GAGAACGGCA TTATCGCCAA TGCCGCCGTG GGCGGCTCGA CCAACGCGGT AGTACACATG
CTGGCGATCG CCGGGCGTGC GGGTATCGAT CTCTGTCTTG AGGATTTCGA TAGGGTGGGC
GGCCAGGTGC CTTGCATCGT CAACTGCATG CCATCGGGAA AGTATCTGAT CGAAGATCTC
GCTTATGCGG GCGGCCTGCC CGCCGTGATG AGTCGTATCC AGCACCTGCT TCATGCCGAC
GCGCCAACCG TTTTCGGCGT TCCGATCAGT AAATACTGGG AGGGTGCAGA GGTCTATAAC
GACGACGTCA TCCGCCCGCT GGACAACCCG CTGCGCGCCG CGGCCGGCAT TCGCGTCCTG
AAGGGCAATC TCGCGCCCAA CGGCGCGGTG ATCAAGCCGT CGGCAGCGAG CGAACACCTT
CTGACCCACG AAGGACCCGC CTTTGTCTTC GAGACAATCG AAGACCTTAG GGCCAGGATC
GACGATCCTG ACCTGCCGGT GACCGAAAAC ACGATCCTCG TTCTCAAGGG TTGCGGCCCG
AAGGGATATC CAGGCATGGC CGAGGTCGGC AACATGCCGA TTCCGCGAAG GCTCGTCGAA
AGGGGCGTGC GCGACATGGT ACGCATCTCG GATGCACGCA TGTCCGGCAC CGCTTTCGGC
ACGGTGGTTC TCCATGTAAG CCCGGAAGCC GATGCGGGCG GCCCGCTGGC GATCGTCCGG
ACCGGAGACC TGATCCGTCT CGACGCAATG AAGGGCGAAT TGAACCTGCT CATCGGCGAG
GAAGAGCTGG CGGCCCGCAT GGCGGCCTGG CGGCCGCCGG AAAAGAAATG GCAGCGAGGC
TATTACAAAC TCTATCACGA CACCGTGCTG CAGGCCGACA AGGGTGCCGA CCTCGATTTC
CTCGTCGGCA AGAGCGGCAG CGAGGTGCTC CGTGAAAGTC ACTGA
 
Protein sequence
MSDKKKELRS RHWYGGTHKD GFIHRSWMKN QGFPDHVFDG RPIIGICNTW SELTPCNSHL 
RILAEGVKRG VWEAGGFPVE FPVSSLGETQ MRPTAMLFRN LLAMDVEEAI RAYDIDGVVL
LGGCDKTTPG QLMGAASVDL PTIVVSSGPM LNGKWKGKDI GSGTDVWKFS EAVRAGEMSL
QEFMAAESGM SRSPGVCMTM GTATTMASVV EAMGLSLPTN AALPAVDARR MALAHMTGKR
IVEMVHEDLR LSKILTKENF ENGIIANAAV GGSTNAVVHM LAIAGRAGID LCLEDFDRVG
GQVPCIVNCM PSGKYLIEDL AYAGGLPAVM SRIQHLLHAD APTVFGVPIS KYWEGAEVYN
DDVIRPLDNP LRAAAGIRVL KGNLAPNGAV IKPSAASEHL LTHEGPAFVF ETIEDLRARI
DDPDLPVTEN TILVLKGCGP KGYPGMAEVG NMPIPRRLVE RGVRDMVRIS DARMSGTAFG
TVVLHVSPEA DAGGPLAIVR TGDLIRLDAM KGELNLLIGE EELAARMAAW RPPEKKWQRG
YYKLYHDTVL QADKGADLDF LVGKSGSEVL RESH