Gene EcSMS35_4134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4134 
SymbolilvG 
ID6144247 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4230348 
End bp4231994 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content54% 
IMG OID641618957 
Productacetolactate synthase 2 catalytic subunit 
Protein accessionYP_001746089 
Protein GI170684034 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR00118] acetolactate synthase, large subunit, biosynthetic type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.82212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.957886 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGCG CACAGTGGGT GGTACATGCG TTGCGGGCAC AGGGTGTGAA TACCGTTTTC 
GGTTATCCGG GTGGCGCAAT TATGCCGGTT TACGATGCAT TGTATGACGG CGGCATGGAG
CACTTGCTGT GCCGACATGA ACAGGGTGCG GCAATGGCGG CTATCGGTTA TGCCCGTGCT
ACTGGCAAAA CTGGCGTATG TATCGCCACG TCTGGTCCGG GCGCAACCAA CCTGATAACC
GGGCTTGCGG ACGCACTGTT AGATTCCATC CCCGTTGTTG CTATCACCGG TCAGGTATCC
GCACCGTTTA TCGGCACTGA CGCATTTCAG GAGGTGGATA TTCTGGGGTT GTCGTTAGCC
TGCACCAAGC ACAGTTTCCT GGTGCAATCA CTGGAAGAGT TACCGCGTAT CATGGCCGAA
GCGTTTGACG TCGCCAGCTC TGGTCGTCCT GGTCCGGTTT TGGTCGATAT TCCAAAAGAT
ATCCAGTTAG CCAGCGGCGA TCTGGAACCG TGGTTCACCA CCGTTGAAAA CGAAGTGACT
TTCCCCCATG CCGAAGTCGA GCAAGCGCGC CAGATGCTGG CAAAAGCGCA AAAACCGATG
CTGTACGTTG GCGGTGGCGT TGGCATGGCG CAGGCAGTTC CGGCTTTGCG TGAATTTCTC
GCTACCACAA AAATGCCTGC CACCTGCACG CTGAAAGGGC TGGGCGCAGT TGAAGCAGAT
TATCCGTACT ATCTGGGCAT GCTGGGAATG CACGGCACCA AAGCGGCGAA CTTCGCGGTT
CAGGAGTGCG ATCTACTGAT AGCCGTGGGC GCACGTTTTG ATGACCGGGT GACCGGCAAA
CTGAACACCT TCGCACCACA CGCCAGCGTT ATCCATATGG ATATCGATCC GGCAGAAATG
AACAAGCTGC GTCAGGCACA TGCGGCATTA CAGGGTGATT TAAATGCTCT GTTACCAGCA
TTACAGCAGC CGTTAAATAT CGATGACTGG CAGCAACACT GCGCACAACT GCGTGATGAA
CATGCCTGGC GTTACGACCA TCCTGGTGAC GCTATCTACG CGCCGTTGTT ATTAAAACAA
CTGTCGGATC GTAAACCTGC GGATTGCATC GTGACCACAG ATGTGGGGCA GCACCAGATG
TGGTCTGCCC AGCACATCGT CCACACTCGC CCGGAAAATT TCATCACCTC CAGCGGCTTA
GGCACCATGG GTTTTGGTTT ACCGGCGGCG GTTGGCGCAC AAGTCGCGCG ACCGAACGAT
ACCGTCGTCT GTATCTCCGG TGACGGCTCT TTCATGATGA ATGTGCAAGA GCTGGGCACC
GTAAAACGCA AGCAGTTACC GTTGAAAATC GTCTTACTCG ATAACCAACG GTTAGGGATG
GTTCGACAAT GGCAGCAACT GTTTTTCCAG GAACGATATA GCGAAACCAC CCTTACCGAT
AATCCCGATT TCCTCATGTT AGCCAGCGCC TTCGGCATCC CTGGCCAACA CATCACCCGT
AAAGACCAGG TTGAAGCGGC ACTCGACACC ATGCTGAACA GTGATGGGCC ATACCTGCTT
CATGTCTCAA TCGACGAACT TGAGAACGTC TGGCCGCTGG TGCCGCCAGG TGCCAGTAAT
TCAGAAATGT TGGAGAAATT ATCATGA
 
Protein sequence
MNGAQWVVHA LRAQGVNTVF GYPGGAIMPV YDALYDGGME HLLCRHEQGA AMAAIGYARA 
TGKTGVCIAT SGPGATNLIT GLADALLDSI PVVAITGQVS APFIGTDAFQ EVDILGLSLA
CTKHSFLVQS LEELPRIMAE AFDVASSGRP GPVLVDIPKD IQLASGDLEP WFTTVENEVT
FPHAEVEQAR QMLAKAQKPM LYVGGGVGMA QAVPALREFL ATTKMPATCT LKGLGAVEAD
YPYYLGMLGM HGTKAANFAV QECDLLIAVG ARFDDRVTGK LNTFAPHASV IHMDIDPAEM
NKLRQAHAAL QGDLNALLPA LQQPLNIDDW QQHCAQLRDE HAWRYDHPGD AIYAPLLLKQ
LSDRKPADCI VTTDVGQHQM WSAQHIVHTR PENFITSSGL GTMGFGLPAA VGAQVARPND
TVVCISGDGS FMMNVQELGT VKRKQLPLKI VLLDNQRLGM VRQWQQLFFQ ERYSETTLTD
NPDFLMLASA FGIPGQHITR KDQVEAALDT MLNSDGPYLL HVSIDELENV WPLVPPGASN
SEMLEKLS