Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_4867 |
Symbol | |
ID | 5318852 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009620 |
Strand | + |
Start bp | 1368718 |
End bp | 1370493 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640776652 |
Product | sulfoacetaldehyde acetyltransferase |
Protein accession | YP_001313584 |
Protein GI | 150376988 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] |
TIGRFAM ID | [TIGR03457] sulfoacetaldehyde acetyltransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.562185 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGA CCACCGAAGA AGCCTTCGTC AAAGTCCTCC AGATGCACGG CATCGAGCAT GCTTTCGGCA TCATCGGCTC GGCGATGATG CCCGTATCGG ATCTCTTTCC GAAGGCGGGC ATCAAATTCT GGGACTGCGC CCATGAGACC AATGCCGGCA TGATGGCGGA TGGCTTCAGC CGGGCCACCG GGACCATGTC GATGGCGATC GGCCAGAACG GTCCCGGCGT CACCGGCTTC ATCACCGCCA TGAAGACGGC CTACTGGAAT CACACGCCGC TGCTGATGGT TACGCCGCAG GCGGCCAACA AGACGATCGG GCAGGGCGGC TTCCAGGAAG TCGACCAGAT GGCCATGTTC GAGGAGATGG TCTGCTATCA GGAAGAGGTG CGGGACCCGA GCCGCATTCC TGAAGTCCTG AACCGAGTCA TCGAGAAGGC CTGGCGGGGT TGTGCGCCGG CGCAGATCAA CATCCCCAGG GATTTCTGGA CGCATGTGAT CGACGTGGAT CTGCCGCGTA TCGTCCGCTT CGAGCGGCCG GCCGGCGGAC CGGCGGCGAT CGCGCAGGCG GCCCGGCTGC TTTCGGAGGC GAAATTCCCG GTCATTCTCA ATGGCGCCGG TGTCGTCATC GGCAATGCAA TTCAGGAATC GATGGCGCTC GCCGAACGGC TGAATGCACC GGTGTGCTGC GGATATCAGC ATAACGACGC CTTTCCGGGC AGCCATCGCC TGTCGGTCGG GCCGCTTGGC TATAACGGCT CGAAGGCGGC GATGGAACTG ATCAGCAAGG CCGATGTCGT GCTGGCATTG GGCACGCGGC TCAACCCTTT CTCGACGCTG CCGGGCTATG GCATCGACTA CTGGCCGAAG AATGCGTCCA TCATACAGGT CGACATCAAT CCCGACCGGA TCGGCTTGAC GAAGAAGGTG ACGGTCGGGA TTTGCGGCGA CGCGAAACAG GTGGCGCAGC AGATCCTTCA GCAGCTTGCC CCCGTCGCCG GCGAAGCAAG CCGCGAGGAA CGCAAGGCGC TGGTCCACCA GACGCGCTCG GCGTGGCTGC AGCAGCTCTC TTCGATGGAC CACGAGGATG ACGATCCGGG TACGGAGTGG AACGTTGGGG CGCGCCAACG GGAGCCCGAT CGCATGTCGC CGCGCCAGGT CTGGCGCGCG ATCCAGGCCG TGCTTCCGAA AGAGGCGATC ATCTCCACCG ACATCGGCAA CAACTGCGCC ATCGGCAACG CCTATCCGAG TTTCGAGCAG GGCAGAAAGT ATCTGGCTCC CGGCATGTTC GGCCCCTGCG GCTATGGCTT CCCGTCGATC GTGGGCGCCA AGATCGGCTG CCCGGATGTG CCGGTCGTCG GCTTTGCCGG CGACGGCGCC TTCGGCATCT CGATGAACGA GATGACGTCG ATCGGCCGCG AGGGATGGCC GGCCATCACC ATGGTAATCT TCCGGAACTA CCAGTGGGGT GCCGAGAAAC GCAATACGAC GCTCTGGTAC GACAACAACT TCGTAGGCAC CGAGCTCAAT CCGAACCTGA GCTACGCCAA GGTTGCGGAC GGCTCCGGGC TCAAAGGGAT CACCGTCGAC ACGCCGGCCG CACTCAAGGA GGCGCTTTCG AAGGCGATCG AGGACCAGGC CAGGGGTATC ACCACCTTCG TCGAGGTCAT TCTCAATCAG GAACTCGGCG AGCCCTTCCG GCGCGACGCG ATGAAGAAAC CGGTGGCGGT GGCGGGCATC GATCGGGCCG ATATGCAGCC CCAGAGCCGC AGGTAG
|
Protein sequence | MKMTTEEAFV KVLQMHGIEH AFGIIGSAMM PVSDLFPKAG IKFWDCAHET NAGMMADGFS RATGTMSMAI GQNGPGVTGF ITAMKTAYWN HTPLLMVTPQ AANKTIGQGG FQEVDQMAMF EEMVCYQEEV RDPSRIPEVL NRVIEKAWRG CAPAQINIPR DFWTHVIDVD LPRIVRFERP AGGPAAIAQA ARLLSEAKFP VILNGAGVVI GNAIQESMAL AERLNAPVCC GYQHNDAFPG SHRLSVGPLG YNGSKAAMEL ISKADVVLAL GTRLNPFSTL PGYGIDYWPK NASIIQVDIN PDRIGLTKKV TVGICGDAKQ VAQQILQQLA PVAGEASREE RKALVHQTRS AWLQQLSSMD HEDDDPGTEW NVGARQREPD RMSPRQVWRA IQAVLPKEAI ISTDIGNNCA IGNAYPSFEQ GRKYLAPGMF GPCGYGFPSI VGAKIGCPDV PVVGFAGDGA FGISMNEMTS IGREGWPAIT MVIFRNYQWG AEKRNTTLWY DNNFVGTELN PNLSYAKVAD GSGLKGITVD TPAALKEALS KAIEDQARGI TTFVEVILNQ ELGEPFRRDA MKKPVAVAGI DRADMQPQSR R
|
| |