Gene EcSMS35_4140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4140 
SymbolilvC 
ID6146382 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4237702 
End bp4239177 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content54% 
IMG OID641618963 
Productketol-acid reductoisomerase 
Protein accessionYP_001746095 
Protein GI170683905 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0059] Ketol-acid reductoisomerase 
TIGRFAM ID[TIGR00465] ketol-acid reductoisomerase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0447832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAACT ACTTCAATAC ACTGAATCTG CGCCAGCAGC TGGCACAGCT GGGCAAATGT 
CGCTTTATGG GGCGCGATGA ATTCGCCGAT GGCGCGAGCT ACCTTCAGGG TAAAAAAGTA
GTCATCGTCG GCTGTGGCGC ACAGGGTCTG AACCAGGGCC TGAACATGCG TGATTCTGGT
CTCGATATCT CCTACGCTCT GCGTAAAGAA GCGATTGCCG AAAAGCGCGC ATCCTGGCGT
AAAGCAACCG AAAATGGTTT TAAAGTGGGT ACTTACGAAG AACTGATCCC ACAGGCGGAT
CTGGTGGTTA ACCTGACGCC GGACAAGCAG CACTCTGATG TAGTGCGCAC CGTACAGCCA
CTGATGAAGG ACGGCGCGGC GCTGGGTTAC TCGCATGGTT TCAACATCGT GGAAGTAGGC
GAGCAGATCC GTAAAGACAT CACCGTAGTA ATGGTTGCGC CGAAATGCCC TGGCACCGAA
GTGCGTGAAG AGTACAAACG TGGATTCGGC GTACCGACGC TGATTGCCGT TCACCCGGAA
AACGATCCGA AAGGCGAAGG CATGGCGATC GCTAAAGCGT GGGCGGCGGC AACCGGTGGT
CATCGTGCGG GCGTGCTGGA ATCCTCTTTC GTTGCGGAAG TGAAATCTGA CCTGATGGGC
GAGCAAACCA TCCTGTGCGG TATGTTGCAG GCTGGCTCTC TGCTGTGCTT CGACAAACTG
GTGGAAGAAG GCACCGATCC GGCATACGCA GAAAAACTGA TTCAGTTCGG CTGGGAAACC
ATCACCGAAG CGCTGAAACA GGGCGGCATC ACGCTGATGA TGGACCGTCT CTCTAACCCG
GCGAAACTGC GTGCTTACGC GCTGTCTGAA CAGCTGAAAG AGATCATGGC GCCGCTGTTC
CAGAAACACA TGGATGACAT CATCTCCGGC GAATTCTCTT CCGGCATGAT GGCAGACTGG
GCCAACGACG ATAAGAAACT GCTGACCTGG CGTGAAGAGA CCGGCAAAAC CGCGTTCGAA
ACCGCGCCGC AGTATGAAGG CAAAATCGGC GAGCAGGAGT ACTTCGATAA AGGCGTACTG
ATGATCGCGA TGGTAAAAGC AGGCGTTGAG TTGGCGTTTG AAACCATGGT TGATTCCGGC
ATCATCGAAG AGTCTGCTTA CTATGAATCA CTGCACGAAC TGCCGCTGAT TGCCAACACT
ATCGCTCGTA AGCGTCTGTA CGAAATGAAC GTGGTTATCT CTGATACCGC CGAGTACGGT
AACTATCTGT TCTCTTACGC TTGTGTGCCG CTGCTGAAAC CGTTTATGGC AGAGCTGCAA
CCGGGCGACT TGGGTAAAGC TATTCCGGAA GGTGCGGTAG ATAACGCGCA GTTGCGTGAT
GTGAACGAAG CGATTCGCAG CCATGCGATT GAGCAGGTAG GTAAGAAACT GCGCGGCTAT
ATGACGGATA TGAAACGTAT TGCTGTTGCA GGTTAA
 
Protein sequence
MANYFNTLNL RQQLAQLGKC RFMGRDEFAD GASYLQGKKV VIVGCGAQGL NQGLNMRDSG 
LDISYALRKE AIAEKRASWR KATENGFKVG TYEELIPQAD LVVNLTPDKQ HSDVVRTVQP
LMKDGAALGY SHGFNIVEVG EQIRKDITVV MVAPKCPGTE VREEYKRGFG VPTLIAVHPE
NDPKGEGMAI AKAWAAATGG HRAGVLESSF VAEVKSDLMG EQTILCGMLQ AGSLLCFDKL
VEEGTDPAYA EKLIQFGWET ITEALKQGGI TLMMDRLSNP AKLRAYALSE QLKEIMAPLF
QKHMDDIISG EFSSGMMADW ANDDKKLLTW REETGKTAFE TAPQYEGKIG EQEYFDKGVL
MIAMVKAGVE LAFETMVDSG IIEESAYYES LHELPLIANT IARKRLYEMN VVISDTAEYG
NYLFSYACVP LLKPFMAELQ PGDLGKAIPE GAVDNAQLRD VNEAIRSHAI EQVGKKLRGY
MTDMKRIAVA G