Gene EcSMS35_0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0079 
SymbolleuA 
ID6143537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp88592 
End bp90163 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content53% 
IMG OID641614980 
Product2-isopropylmalate synthase 
Protein accessionYP_001742196 
Protein GI170683572 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.999327 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC AAGTTATTAT TTTCGATACC ACATTGCGCG ACGGTGAACA GGCGTTACAG 
GCAAGCCTGA GTGCGAAAGA AAAACTGCAA ATTGCGCTGG CCCTTGAGCG TATGGGTGTT
GACGTGATGG AAGTCGGTTT CCCCGTCTCT TCGCCGGGCG ATTTCGAATC GGTGCAGACC
ATCGCCCGCC AGGTCAAAAA CAGCCGCGTA TGTGCGTTAG CTCGCTGCGT GGAAAAGGAT
ATCGACGTGG CAGCTGAATC TCTGAAAGTT GCCGAAGCCT TCCGTATCCA TACCTTTATT
GCCACTTCAC CAATGCACAT TGCCACCAAG CTGCGTAGCA CGCTGGACGA AGTAATCGAA
CGCGCTATCT ATATGGTGAA ACGCGCCCGT AATTACACCG ATGATGTTGA ATTTTCTTGC
GAAGATGCCG GACGCACACC CATTGCCGAT CTGGCGCGCG TGGTTGAAGC GGCGATTAAC
GCCGGTGCCA CCACCATCAA CATTCCGGAC ACCGTGGGCT ACACCATGCC GTTTGAGTTC
GCCGGAATCA TCAGCGGGCT GTATGAACGC GTGCCTAACA TCGACAAAGC CATTATCTCC
GTACATACCC ACGACGATTT AGGCCTGGCA GTTGGCAACT CACTGGCGGC GGTACATGCC
GGAGCGCGCC AGGTGGAAGG TGCAATGAAT GGGATCGGCG AGCGAGCCGG TAACTGTTCG
CTGGAAGAAG TGATCATGGC GATCAAAGTT CGTAAGGATA TTCTCAACGT TCATACCGCC
ATTAATCACC AGGAGATATG GCGCACCAGC CAGTTAGTTA GCCAGATTTG TAATATGCCG
ATCCCGGCAA ACAAAGCCAT TGTTGGCAGC GGCGCATTCG CACACTCCTC CGGTATCCAC
CAGGATGGTG TACTGAAAAA CCGTGAAAAC TACGAAATCA TGACACCAGA ATCTATTGGT
CTGAACCAAA TCCAGCTGAA TCTGACCTCT CGTTCGGGGC GTGCGGCGGT GAAACATCGC
ATGGATGAGA TGGGATATAA AGAAAGTGAA TATAATTTAG ACAACCTGTA CGACGCTTTC
CTGAAGCTGG CGGACAAAAA AGGCCAGGTG TTTGATTACG ATCTGGAGGC GCTGGCCTTC
ATCGGTAAGC AGCAAGAAGA GCCGGAGCAT TTCCGTCTGG ATTACTTCAG CGTGCAGTCA
GGTTCTAACG ATATTGCCAC TGCCGCCGTC AAACTGGCCT GTGGCGAAGA AGTCAAAGCA
GAAGCCGCCA ACGGTAACGG TCCGGTCGAT GCCGTCTACC AGGCGATAAA CCGCATCACT
GACTATAACG TCGAACTGGT GAAATACAGC CTGACCGCCA AAGGTCACGG TAAAGATGCG
CTGGGTCAGG TGGATATTGT CGCCAACTAC AACGGTCGCC GCTTCCACGG CGTCGGCCTG
GCCACCGATA TTGTCGAGTC CTCCGCCAAA GCCATGGTGC ACGTACTGAA CAATATCTGG
CGTGCCGCAG AAGTCGAAAA AGAGTTGCAA CGCAAAGCTC AACACAACGA AAACAACAAG
GAAACCGTGT GA
 
Protein sequence
MSQQVIIFDT TLRDGEQALQ ASLSAKEKLQ IALALERMGV DVMEVGFPVS SPGDFESVQT 
IARQVKNSRV CALARCVEKD IDVAAESLKV AEAFRIHTFI ATSPMHIATK LRSTLDEVIE
RAIYMVKRAR NYTDDVEFSC EDAGRTPIAD LARVVEAAIN AGATTINIPD TVGYTMPFEF
AGIISGLYER VPNIDKAIIS VHTHDDLGLA VGNSLAAVHA GARQVEGAMN GIGERAGNCS
LEEVIMAIKV RKDILNVHTA INHQEIWRTS QLVSQICNMP IPANKAIVGS GAFAHSSGIH
QDGVLKNREN YEIMTPESIG LNQIQLNLTS RSGRAAVKHR MDEMGYKESE YNLDNLYDAF
LKLADKKGQV FDYDLEALAF IGKQQEEPEH FRLDYFSVQS GSNDIATAAV KLACGEEVKA
EAANGNGPVD AVYQAINRIT DYNVELVKYS LTAKGHGKDA LGQVDIVANY NGRRFHGVGL
ATDIVESSAK AMVHVLNNIW RAAEVEKELQ RKAQHNENNK ETV