Gene Smed_4867 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4867 
Symbol 
ID5318852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp1368718 
End bp1370493 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content63% 
IMG OID640776652 
Productsulfoacetaldehyde acetyltransferase 
Protein accessionYP_001313584 
Protein GI150376988 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR03457] sulfoacetaldehyde acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.562185 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATGA CCACCGAAGA AGCCTTCGTC AAAGTCCTCC AGATGCACGG CATCGAGCAT 
GCTTTCGGCA TCATCGGCTC GGCGATGATG CCCGTATCGG ATCTCTTTCC GAAGGCGGGC
ATCAAATTCT GGGACTGCGC CCATGAGACC AATGCCGGCA TGATGGCGGA TGGCTTCAGC
CGGGCCACCG GGACCATGTC GATGGCGATC GGCCAGAACG GTCCCGGCGT CACCGGCTTC
ATCACCGCCA TGAAGACGGC CTACTGGAAT CACACGCCGC TGCTGATGGT TACGCCGCAG
GCGGCCAACA AGACGATCGG GCAGGGCGGC TTCCAGGAAG TCGACCAGAT GGCCATGTTC
GAGGAGATGG TCTGCTATCA GGAAGAGGTG CGGGACCCGA GCCGCATTCC TGAAGTCCTG
AACCGAGTCA TCGAGAAGGC CTGGCGGGGT TGTGCGCCGG CGCAGATCAA CATCCCCAGG
GATTTCTGGA CGCATGTGAT CGACGTGGAT CTGCCGCGTA TCGTCCGCTT CGAGCGGCCG
GCCGGCGGAC CGGCGGCGAT CGCGCAGGCG GCCCGGCTGC TTTCGGAGGC GAAATTCCCG
GTCATTCTCA ATGGCGCCGG TGTCGTCATC GGCAATGCAA TTCAGGAATC GATGGCGCTC
GCCGAACGGC TGAATGCACC GGTGTGCTGC GGATATCAGC ATAACGACGC CTTTCCGGGC
AGCCATCGCC TGTCGGTCGG GCCGCTTGGC TATAACGGCT CGAAGGCGGC GATGGAACTG
ATCAGCAAGG CCGATGTCGT GCTGGCATTG GGCACGCGGC TCAACCCTTT CTCGACGCTG
CCGGGCTATG GCATCGACTA CTGGCCGAAG AATGCGTCCA TCATACAGGT CGACATCAAT
CCCGACCGGA TCGGCTTGAC GAAGAAGGTG ACGGTCGGGA TTTGCGGCGA CGCGAAACAG
GTGGCGCAGC AGATCCTTCA GCAGCTTGCC CCCGTCGCCG GCGAAGCAAG CCGCGAGGAA
CGCAAGGCGC TGGTCCACCA GACGCGCTCG GCGTGGCTGC AGCAGCTCTC TTCGATGGAC
CACGAGGATG ACGATCCGGG TACGGAGTGG AACGTTGGGG CGCGCCAACG GGAGCCCGAT
CGCATGTCGC CGCGCCAGGT CTGGCGCGCG ATCCAGGCCG TGCTTCCGAA AGAGGCGATC
ATCTCCACCG ACATCGGCAA CAACTGCGCC ATCGGCAACG CCTATCCGAG TTTCGAGCAG
GGCAGAAAGT ATCTGGCTCC CGGCATGTTC GGCCCCTGCG GCTATGGCTT CCCGTCGATC
GTGGGCGCCA AGATCGGCTG CCCGGATGTG CCGGTCGTCG GCTTTGCCGG CGACGGCGCC
TTCGGCATCT CGATGAACGA GATGACGTCG ATCGGCCGCG AGGGATGGCC GGCCATCACC
ATGGTAATCT TCCGGAACTA CCAGTGGGGT GCCGAGAAAC GCAATACGAC GCTCTGGTAC
GACAACAACT TCGTAGGCAC CGAGCTCAAT CCGAACCTGA GCTACGCCAA GGTTGCGGAC
GGCTCCGGGC TCAAAGGGAT CACCGTCGAC ACGCCGGCCG CACTCAAGGA GGCGCTTTCG
AAGGCGATCG AGGACCAGGC CAGGGGTATC ACCACCTTCG TCGAGGTCAT TCTCAATCAG
GAACTCGGCG AGCCCTTCCG GCGCGACGCG ATGAAGAAAC CGGTGGCGGT GGCGGGCATC
GATCGGGCCG ATATGCAGCC CCAGAGCCGC AGGTAG
 
Protein sequence
MKMTTEEAFV KVLQMHGIEH AFGIIGSAMM PVSDLFPKAG IKFWDCAHET NAGMMADGFS 
RATGTMSMAI GQNGPGVTGF ITAMKTAYWN HTPLLMVTPQ AANKTIGQGG FQEVDQMAMF
EEMVCYQEEV RDPSRIPEVL NRVIEKAWRG CAPAQINIPR DFWTHVIDVD LPRIVRFERP
AGGPAAIAQA ARLLSEAKFP VILNGAGVVI GNAIQESMAL AERLNAPVCC GYQHNDAFPG
SHRLSVGPLG YNGSKAAMEL ISKADVVLAL GTRLNPFSTL PGYGIDYWPK NASIIQVDIN
PDRIGLTKKV TVGICGDAKQ VAQQILQQLA PVAGEASREE RKALVHQTRS AWLQQLSSMD
HEDDDPGTEW NVGARQREPD RMSPRQVWRA IQAVLPKEAI ISTDIGNNCA IGNAYPSFEQ
GRKYLAPGMF GPCGYGFPSI VGAKIGCPDV PVVGFAGDGA FGISMNEMTS IGREGWPAIT
MVIFRNYQWG AEKRNTTLWY DNNFVGTELN PNLSYAKVAD GSGLKGITVD TPAALKEALS
KAIEDQARGI TTFVEVILNQ ELGEPFRRDA MKKPVAVAGI DRADMQPQSR R