Gene Smed_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_2008 
Symbol 
ID5322867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp2057495 
End bp2059273 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content61% 
IMG OID640790945 
Productacetolactate synthase 3 catalytic subunit 
Protein accessionYP_001327676 
Protein GI150397209 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.901977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAA CTGAGAACCA GATGACGGGC GCGGAGATCG TTCTCCAGGC TTTGAAAGAC 
AATGGCGTCG AACACATCTT CGGCTATCCC GGCGGCGCCG TGCTTCCAAT CTACGACGAG
ATTTTCCAGC AGGAGGACAT ACAGCACATC CTCGTCCGCC ACGAGCAGGG CGCCGGCCAC
ATGGCCGAAG GGTATGCCCG CTCCACCGGC AAGGTCGGCG TCATGCTGGT GACATCCGGC
CCGGGGGCGA CCAATGCGGT TACCCCGCTG CAGGACGCGC TGATGGACTC CATCCCTCTC
GTCTGCATCT CGGGGCAGGT ACCGACATCG CTGATCGGTT CTGACGCCTT CCAGGAGTGC
GACACGGTCG GAATTACGCG GCCTTGCACC AAGCACAACT GGCTGGTCAG GGACGTCAAC
GAACTCGCCC GCGTCATCCA CGAAGCTTTC CGCGTCGCCC AGTCGGGCCG CCCCGGACCG
GTCGTCGTCG ACATTCCGAA GGACATCCAG TTTGCGACCG GTGCCTATAC GCCGCCTTCG
GCCGTTCCAA CGCAGAAGAG CTATCAGCCG AAAACCCAGG GCGACCTGAA GAAGATCGAG
GAGGCGGTTG CGCTCATGAA GTCCGCGCGC CAGCCGGTTA TCTATTCGGG CGGCGGCGTC
ATCAATTCCG GCCCGCAAGC CGCACATTTC CTGCGCGAGC TGGTCGAACT CACCGATTTT
CCGATCACGT CGACGCTGAT GGGCCTCGGC GCCTATCCTG CCTCCGGCAA GAACTGGCTT
GGCATGCTCG GCATGCATGG AACCTATGAA GCCAATCTCG CCATGCACGA TTGTGACGTC
ATGATCTGCA TAGGAGCGCG CTTTGACGAC CGTATCACCG GCCGGCTCAA TGCCTTCTCG
CCGAATTCAA AGAAGATCCA CATCGATATC GACCCGTCCT CGATCAACAA GAACGTCCGT
GTCGACGTTC CGATCATCGG CGACGTCGCC GCAGTCCTCG AGGACATGGT TCGCCTGTGG
CGTGCTGCGG CCAAGACCGT CGACCGGACG CGGCTCGAGG ATTGGTGGAA ATCGATCACC
AAGTGGCGCG CGCGCAATTC GCTCGCCTAT ACGCCGAGTG CCGATGTCAT CATGCCGCAA
TATGCGATCC AGCGGCTTTA CGAGCTCACC AAGGACCGCG ACACCTACAT CACCACCGAA
GTGGGACAGC ACCAGATGTG GGCGGCGCAG TTCTTCGGAT TCGAGGAGCC GAACCGGTGG
ATGACGTCGG GCGGCCTCGG CACCATGGGC TACGGCTTCC CGGCCGCCGT GGGCGTTCAG
GTTGCGCATC CGGATAGCCT CGTCATCGAT ATCGCGGGCG ACGCCTCGAT CCAGATGTGC
ATCCAGGAAA TGTCCTGCGC GGTTCAGTAC GGCCTGCCGG TCAAGATCTT CATCCTGAAC
AATCAATATA TGGGCATGGT CCGGCAGTGG CAGCAACTGC TCCACGGCAA CCGACTGTCG
CATTCCTATA CCGAGGCGAT GCCTGACTTC GTCAAGCTCG CCGAGGCCTA TGGCGGCGTT
GGCATCCGTT GCGAAAAGCC GGGAGAGCTC GACGAAGCAA TCAAGCAGAT GATCGATACT
CCGGCTCCGG TCATTTTCGA TTGCCGAGTC GCAAATCTCG CCAATTGCTT CCCGATGATC
CCCTCGGGCA AGGCGCATAA CGAAATGCTG CTCCCCGACG AGGCCACGGA CGAGGCGGTT
GCCAACGCCA TCGACGCCAA GGGCCGCCAG CTCGTTTGA
 
Protein sequence
MSGTENQMTG AEIVLQALKD NGVEHIFGYP GGAVLPIYDE IFQQEDIQHI LVRHEQGAGH 
MAEGYARSTG KVGVMLVTSG PGATNAVTPL QDALMDSIPL VCISGQVPTS LIGSDAFQEC
DTVGITRPCT KHNWLVRDVN ELARVIHEAF RVAQSGRPGP VVVDIPKDIQ FATGAYTPPS
AVPTQKSYQP KTQGDLKKIE EAVALMKSAR QPVIYSGGGV INSGPQAAHF LRELVELTDF
PITSTLMGLG AYPASGKNWL GMLGMHGTYE ANLAMHDCDV MICIGARFDD RITGRLNAFS
PNSKKIHIDI DPSSINKNVR VDVPIIGDVA AVLEDMVRLW RAAAKTVDRT RLEDWWKSIT
KWRARNSLAY TPSADVIMPQ YAIQRLYELT KDRDTYITTE VGQHQMWAAQ FFGFEEPNRW
MTSGGLGTMG YGFPAAVGVQ VAHPDSLVID IAGDASIQMC IQEMSCAVQY GLPVKIFILN
NQYMGMVRQW QQLLHGNRLS HSYTEAMPDF VKLAEAYGGV GIRCEKPGEL DEAIKQMIDT
PAPVIFDCRV ANLANCFPMI PSGKAHNEML LPDEATDEAV ANAIDAKGRQ LV