Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4478 |
Symbol | |
ID | 5318343 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 960818 |
End bp | 962563 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776279 |
Product | dihydroxy-acid dehydratase |
Protein accession | YP_001313211 |
Protein GI | 150376615 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism |
COG ID | [COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase |
TIGRFAM ID | [TIGR00110] dihydroxy-acid dehydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.724246 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAAGA AAGCTGAGTG GCCGCGCAAG CTCCGTTCGC AGGATTGGTT CGGCGGTACT GGCAAGAATG CCATCATGCA TCGCTCCTGG ATGAAAAATC AGGGGCTTCC CGCTGACACT TTTGACGGGC GGCCGATCAT CGGCATCTGC AATACCTGGT CCGAGCTCAC GCCGTGCAAC GCGCATCTTC GCGACCTTGC AGAGCGCGTG AAGCGCGGGG TCTACGAGGC GGGGGGCTTC CCGGTGGAAT TCCCCGTCTT TTCGACCGGG GAAAGCACAC TTCGCCCGAC GGCGATGATG TTCCGCAACC TCGCGGCCAT GGATGTCGAG GAATCGATTC GCGGCAATCC GATCGACGGC GTCGTGCTGC TCGGCGGCTG CGACAAGACG ACCCCCAGCC TCCTGATGGG AGCGGCCAGC GTCGACATTC CGGCGATCGT CGTTTCCGGC GGTCCCATGC TCAACGGAAA GTGGCGCGGC AAGGATGTCG GTTCGGGCAC CGCGATCTGG CAATTCTCGG AAATGGTCAA ATCCGGTGAG ATGACGCTGG AGGAGTTCAT GGACGCCGAG CAGGGCATGG CCCGTTCCGC GGGAAGCTGT ATGACCATGG GTACGGCCTC GACCATGGCG TCGATGGCCG AAGCGCTCGG CATGACCCTT TCGGGCAACG CCGCCATTCC GGCGGTGGAT GCGCGCCGTC GGGTTATTTC GCAGCTTACC GGACGCCGCA TCGTCGAGAT GGTCAAGGAG GATCTGAAGC CCTCCGACAT ACTGACCAAG GAGGCATTCG AGAACGCTAT CCGCGTCAAC GGCGCCGTCG GCGGCTCCAC CAATGCGGTG CTGCATCTTC TCGCGCTGGC AGGTCGCGTC GGCGTTGATC TATCGCTCGA CGACTGGGAC AGGCTCGGCC GCGACGTGCC CACCATCGTC AACCTCCAGC CCTCCGGCAA GTATCTGATG GAGGAATTCT ATTATGCCGG CGGCCTGCCG GTCGTGATCA AGGCGGTCGC GGAGATGGGC CTGCTGCACA ATGATGCCAT CACCGTCAGC GGCGACACGA TCTGGAACGA CGTCAAGGGC GTGGTCAACT ACAACGACGA CGTGATCCTG CCGCGGGAAA GGGCGCTCAC GAAATCGGGT GGTATCGCAG TGCTGCGCGG CAATCTCGCG CCGCGCGGCG CCGTCCTGAA GCCATCAGCT GCCTCGCCGA ACCTGATGCA GCACAAGGGC CGCGCGGTCG TTTTCGAGAG CATCGAGGAT TATCACGCAC GCATCAACCG TGAGGACCTC GACATCGACG AGACCTGCAT CATGGTGCTG AAATATTGCG GTCCCAAAGG CTATCCCGGC ATGGCCGAGG TCGGCAATAT GGGTCTGCCG CCCAAGGTTC TGAAAAAGGG AGTGACGGAC ATGATCCGCA TTTCGGATGC GCGTATGTCG GGCACGGCCT ACGGTACGGT CATACTCCAT ACCGCGCCGG AAGCGGCGGA AGGCGGTCCC TTGGCGCTCG TGGAGAACGG CGATCTGATC GAGGTCGACA TCCCGAACCG CACGCTGCAT CTCCATGTTT CCGACGAGGA ACTTGCGCGT CGCCGCGCGG CCTGGGTGTC GCCGGTCAAG CCGCTGAGCG GCGGCTATGG CAGCCTCTAT ATGAAGACGG TCATGCAGGC CGACGCCGGC GCCGATCTGG ACTTCCTTGT CGGCCCACGT GGCGATCGCA TTCCGATGGA TCGAGACAGC CATTGA
|
Protein sequence | MKKKAEWPRK LRSQDWFGGT GKNAIMHRSW MKNQGLPADT FDGRPIIGIC NTWSELTPCN AHLRDLAERV KRGVYEAGGF PVEFPVFSTG ESTLRPTAMM FRNLAAMDVE ESIRGNPIDG VVLLGGCDKT TPSLLMGAAS VDIPAIVVSG GPMLNGKWRG KDVGSGTAIW QFSEMVKSGE MTLEEFMDAE QGMARSAGSC MTMGTASTMA SMAEALGMTL SGNAAIPAVD ARRRVISQLT GRRIVEMVKE DLKPSDILTK EAFENAIRVN GAVGGSTNAV LHLLALAGRV GVDLSLDDWD RLGRDVPTIV NLQPSGKYLM EEFYYAGGLP VVIKAVAEMG LLHNDAITVS GDTIWNDVKG VVNYNDDVIL PRERALTKSG GIAVLRGNLA PRGAVLKPSA ASPNLMQHKG RAVVFESIED YHARINREDL DIDETCIMVL KYCGPKGYPG MAEVGNMGLP PKVLKKGVTD MIRISDARMS GTAYGTVILH TAPEAAEGGP LALVENGDLI EVDIPNRTLH LHVSDEELAR RRAAWVSPVK PLSGGYGSLY MKTVMQADAG ADLDFLVGPR GDRIPMDRDS H
|
| |