Gene Msil_3009 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3009 
Symbol 
ID7093504 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3322095 
End bp3323852 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content62% 
IMG OID643466320 
Productacetolactate synthase, large subunit, biosynthetic type 
Protein accessionYP_002363282 
Protein GI217979135 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.161058 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAATT TGATCACCGG AGCCGAGATG GTCGTCCGGG CGCTGCAGGA CCAGGGCGTC 
GACAGTATCT TCGGCTATCC GGGCGGCGCG GTGCTGCCGA TCTATGACGC CCTGTTCCAT
CAGAACCAGA TCGTTCACGT GCTGGTGCGC CATGAGCAGG GCGCGGCCCA TGCGGCGGAG
GGCTATGCGC GCTCGAGCGG CAAGGTCGGC GTCGTTCTCG TGACCTCGGG GCCCGGCGCG
ACCAACGCCA TCACCGGCCT CACCGACGCG TTGATGGATT CGATCCCGCT GGTTTGCATC
ACCGGACAGG TTCCGACGCA TCTCATCGGC TCCGACGCGT TCCAGGAATG CGACACGGTT
GGCATCACTC GCCATTGCAC CAAGCATAAT TACCTCGTGC GTTCGATCGA GGACCTGCCG
CGCGTTCTGC ATGAGGCGTT CTACGTCGCG CAGACGGGCC GCCCCGGTCC CGTCGTCATC
GACATCCCGA AGGACGTTCA ATTCGCGCTT GGCGATTACT TCGGCCCGCA CAAGATCGAG
CACAAGACCT ATAAGCCGAG GCTCGACGGC GACGCCGAGA AGATCGAGCA CGCCGTCACC
ATGATGCTTG CGGCGCGTCG GCCGGTTTTT TACACGGGCG GCGGCGTGAT CAATTCGGGG
CCGGAAGCCT CGCGCCTCCT GCGCGAGCTC GTCGAGCTGA CCGGCTTTCC GATCACCTCT
ACCCTGATGG GCCTCGGCTC CTATCCGGCT TCGGGCGACA AATGGCTCGG CATGCTTGGC
ATGCATGGGA CGTTCGAGGC CAATAACGCC ATGCATGATT GCGATCTCAT GATCGCCGTC
GGATCGCGTT TCGACGACCG CATCACCGGC CGGCTCGACG CTTTCTCGCC CGGCTCGAAG
AAGATCCACA TCGACATCGA TCCCTCCTCG ATCAACAAGA ACGTCAAGAT CGATCTCGGC
ATTATCGGCG ATTGCGCCCA TGTGCTGCGG CAGATGTTAG ACGCCTATCG CGCGCGGAAA
TCGGCGCCCG ACGAAGCGGC GCTGACCCGC TGGTGGCAGG AGATTAACAA ATGGCGCGCG
CGCAAGTCGC TCTCCTTCAA GCAGTCGAGC GCGGTGATCA AGCCGCAATA TGCGGTGCAG
CGCCTGTATG AGCTGACGAA GAATCGCGAC ACCTACATTA CGACGGAAGT CGGCCAGCAT
CAGATGTGGG CGGCGCAGCA TTATCATTTC GAGGAGCCGA ACCGCTGGAT GACCAGCGGC
GGGCTCGGCA CGATGGGCTA CGGCCTGCCG GCGGCGATCG GCGCGCAGAT CGCCCATCCG
GGCGCGCTCG TCGTCGACAT CGCGGGCGAA GCTTCGATTC TGATGAACAT CCAGGAGCTG
TCGACCGCCA TACAATTCCG CCTGCCGGTC AAGATCTTCA TCCTCAACAA TGAATATATG
GGGATGGTCA GGCAATGGCA GGAGCTGCTG CATGGCGGAC GCCTGTCGCA GAGCTATTCG
GAGGCGCTGC CGGATTTCGT CAAGCTCGCC GAAGCCTATG GCGCGCAAGG CATCCGCTGC
TCGGACCCCG CAAGTCTCGA TGACGCCATC ATCGAGATGA TCGATTCGCC GCGTACCGTG
GTGTTCGACT GCATTGTCGA CAAGACCGAA AACTGCCTAC CGATGATTCC CTCGGGCAAG
GCCCATAATG AAATGCTGAT GCCCGACGAA GACGATATAG AGGCCGTGAT CGACGCGGCC
GGCAAGATGC TGGTTTGA
 
Protein sequence
MSNLITGAEM VVRALQDQGV DSIFGYPGGA VLPIYDALFH QNQIVHVLVR HEQGAAHAAE 
GYARSSGKVG VVLVTSGPGA TNAITGLTDA LMDSIPLVCI TGQVPTHLIG SDAFQECDTV
GITRHCTKHN YLVRSIEDLP RVLHEAFYVA QTGRPGPVVI DIPKDVQFAL GDYFGPHKIE
HKTYKPRLDG DAEKIEHAVT MMLAARRPVF YTGGGVINSG PEASRLLREL VELTGFPITS
TLMGLGSYPA SGDKWLGMLG MHGTFEANNA MHDCDLMIAV GSRFDDRITG RLDAFSPGSK
KIHIDIDPSS INKNVKIDLG IIGDCAHVLR QMLDAYRARK SAPDEAALTR WWQEINKWRA
RKSLSFKQSS AVIKPQYAVQ RLYELTKNRD TYITTEVGQH QMWAAQHYHF EEPNRWMTSG
GLGTMGYGLP AAIGAQIAHP GALVVDIAGE ASILMNIQEL STAIQFRLPV KIFILNNEYM
GMVRQWQELL HGGRLSQSYS EALPDFVKLA EAYGAQGIRC SDPASLDDAI IEMIDSPRTV
VFDCIVDKTE NCLPMIPSGK AHNEMLMPDE DDIEAVIDAA GKMLV