Gene Smed_2723 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2723 
Symbol 
ID5323593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2835974 
End bp2837812 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content64% 
IMG OID640791668 
Productdihydroxy-acid dehydratase 
Protein accessionYP_001328388 
Protein GI150397921 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGCCT ATCGCTCCCG CACCACCACC CACGGCCGCA ACATGGCCGG CGCCCGCGGC 
CTCTGGCGCG CGACGGGCAT GAAGGACAGC GACTTCGGCA AACCCATTAT TGCGGTGGTG
AACTCCTTTA CGCAGTTCGT ACCGGGCCAC GTCCATCTCA AGGACCTTGG CCAGCTCGTC
GCGCGGGAAA TCGAGGCGGC GGGCGGCGTC GCCAAGGAAT TCAACACGAT TGCCGTCGAC
GACGGCATCG CCATGGGCCA TGACGGCATG CTCTATTCGC TGCCCTCGCG CGAGATCATC
GCCGACAGCG TCGAATATAT GGTCAACGCC CATTGCGCCG ACGCCATGGT CTGCATCTCC
AATTGCGACA AGATCACCCC CGGCATGCTG ATGGCGGCAC TGCGCCTCAA TATTCCGGCC
GTCTTCGTTT CCGGCGGCCC CATGGAGGCC GGCAAGGTGG TTCTGCACGG AAAGACGCAT
GCGCTCGACC TCGTCGACGC GATGGTCGCG GCCGCCGACG ACAAGGTGTC CGACGAGGAC
GTCCAGATCA TCGAGCGCTC CGCCTGCCCG ACCTGCGGCT CCTGCTCCGG TATGTTCACA
GCCAACTCGA TGAACTGTCT GACCGAAGCT CTCGGCCTGT CGCTGCCCGG CAATGGCTCG
ACGCTTGCGA CCCATGCCGA CCGCAAGCGT CTGTTCGTCG AAGCCGGTCA CCTCATCGTC
GACATTGCCC GCCGCTATTA CGAGCAGGAG GACGAGCGCG TTCTGCCGCG CTCGATCGCC
AGCAAACAGG CCTTCGAAAA TGCCATGGCG CTCGACATCG CCATGGGCGG CTCGACCAAC
ACGGTGCTCC ACATCCTTGC CGCGGCTTAC GAAGGCGAGA TCGACTTCAC CATGGACGAC
ATCGACCGGC TGTCGCGCAA GGTGCCGTGC TTGTCAAAAG TCGCGCCGGC AAAAGCCGAC
GTGCATATGG AAGACGTGCA CCGGGCCGGC GGCATCATGT CGATCCTCGG CGAACTGGAC
AAAGGCGGCC TGATCAACCG CGACTGCCCG ACGGTACATG CCGAAACACT CGGCGACGCC
ATCGACCGCT GGGACATCAC CCGCACGTCG AGCGAAACGG TCCGAAACTT CTTCCGCGCC
GCTCCTGGCG GCATTCCGAC GCAGACCGCC TTCAGCCAGG CGGCCCGCTG GGACGAACTC
GATACCGATC GCCAGAACGG CGTCATCCGT TCGGTGGAAC ATCCCTTCTC CAAGGATGGC
GGCCTTGCGG TGCTCAAGGG CAACATCGCG CTTGACGGCT GCATCGTGAA AACCGCGGGC
GTCGACGAGA GTATCCTGAA ATTCTCCGGT CCGGCACGAG TCTTCGAAAG CCAGGACTCG
GCCGTGAAGG GGATCCTCGC CAATGAGGTC AAGGCCGGCG ACGTCGTCGT CATCCGCTAT
GAAGGACCGA AGGGCGGGCC GGGCATGCAG GAAATGCTCT ATCCGACGAG CTACCTGAAG
TCGAAGGGTC TCGGCAAGGC CTGCGCATTG ATCACCGACG GCCGCTTCTC CGGCGGTACG
TCCGGTCTTT CGATCGGCCA TGTCTCGCCC GAGGCGGCGA ATGGCGGAAC GATCGGGCTC
GTGCGCGAAG GTGACATGAT CGACATCGAC ATTCCGAACC GCACCATCGT TCTCCGGGTC
GACGAGGCCG AACTTGCCGC TCGCCGCAAG GAGCAGGATG CCAAGGGCTG GAAGCCGGTC
GAGCAGCGCA AGCGCCGGGT GACGACCGCG CTGAAGGCCT ATGCGGCTTT CGCGACCTCC
GCCGACCGCG GCGCCGTGAG AGATCTGGGC GACCGCTAA
 
Protein sequence
MPAYRSRTTT HGRNMAGARG LWRATGMKDS DFGKPIIAVV NSFTQFVPGH VHLKDLGQLV 
AREIEAAGGV AKEFNTIAVD DGIAMGHDGM LYSLPSREII ADSVEYMVNA HCADAMVCIS
NCDKITPGML MAALRLNIPA VFVSGGPMEA GKVVLHGKTH ALDLVDAMVA AADDKVSDED
VQIIERSACP TCGSCSGMFT ANSMNCLTEA LGLSLPGNGS TLATHADRKR LFVEAGHLIV
DIARRYYEQE DERVLPRSIA SKQAFENAMA LDIAMGGSTN TVLHILAAAY EGEIDFTMDD
IDRLSRKVPC LSKVAPAKAD VHMEDVHRAG GIMSILGELD KGGLINRDCP TVHAETLGDA
IDRWDITRTS SETVRNFFRA APGGIPTQTA FSQAARWDEL DTDRQNGVIR SVEHPFSKDG
GLAVLKGNIA LDGCIVKTAG VDESILKFSG PARVFESQDS AVKGILANEV KAGDVVVIRY
EGPKGGPGMQ EMLYPTSYLK SKGLGKACAL ITDGRFSGGT SGLSIGHVSP EAANGGTIGL
VREGDMIDID IPNRTIVLRV DEAELAARRK EQDAKGWKPV EQRKRRVTTA LKAYAAFATS
ADRGAVRDLG DR