Gene ECH74115_0081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0081 
SymbolleuA 
ID6970749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp86546 
End bp88117 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content53% 
IMG OID643384159 
Product2-isopropylmalate synthase 
Protein accessionYP_002268682 
Protein GI209398496 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC AAGTCATTAT TTTCGATACC ACATTGCGCG ACGGTGAACA GGCGTTACAG 
GCAAGCTTGA GTGTGAAAGA AAAACTGCAA ATTGCGCTGG CCCTTGAGCG TATGGGTGTT
GACGTGATGG AAGTCGGGTT CCCCGTCTCT TCGCCGGGTG ATTTTGAATC AGTGCAAACC
ATCGCTCGCC AGGTTAAAAA CAGCCGTGTA TGCGCGTTAG CTCGCTGCGT GGAGAAAGAT
ATCGACGTGG CGGCCGAATC CCTGAAAGTC GCCGAAGCCT TCCGTATTCA TACCTTTATT
GCCACTTCGC CAATGCACAT CGCCACCAAG CTGCGCAGCA CGCTGGACGA GGTGATCGAA
CGCGCTATCT ATATGGTGAA ACGCGCCCGT AATTACACCG ATGATGTTGA ATTTTCTTGC
GAAGATGCCG GGCGTACACC CATTGCCGAT CTGGCGCGAG TGGTCGAAGC GGCGATTAAT
GCCGGTGCCA CCACCATCAA CATTCCGGAC ACCGTGGGCT ACACCATGCC GTTTGAGTTC
GCCGGAATCA TTAGCGGCCT GTATGAACGC GTGCCTAACA TCGACAAAGC CATTATTTCC
GTACATACCC ACGACGATTT GGGCCTGGCG GTCGGCAACT CACTGGCGGC GGTACATGCC
GGTGCACGCC AGGTGGAAGG TGCAATGAAC GGGATCGGCG AGCGTGCCGG TAACTGTTCC
CTGGAAGAAG TCATCATGGC GATTAAAGTT CGTAAGGATA TTCTCAACGT CCACACCGCC
ATTAATCACC AGGAGATATG GCGCACCAGC CAGTTAGTTA GCCAGATTTG TAATATGCCG
ATCCCGGCAA ACAAAGCCAT TGTTGGCAGC GGCGCATTCG CACACTCCTC CGGTATCCAC
CAGGATGGCG TGCTGAAAAA CCGCGAAAAC TACGAAATCA TGACACCAGA ATCTATTGGT
CTGAACCAAA TCCAGCTGAA TCTGACCTCT CGTTCGGGGC GTGCGGCGGT GAAACATCGC
ATGGATGAGA TGGGGTATAA AGAAAGTGAA TATAATTTAG ACAATTTGTA CGACGCCTTC
CTCAAGCTGG CGGACAAAAA AGGTCAGGTG TTTGATTACG ATCTGGAGGC GCTGGCCTTC
ATCGGTAAGC AGCAAGAAGA GCCGGAGCAT TTCCGTCTGG ATTACTTCAG CGTGCAGTCT
GGCTCTAACG ATATCGCCAC CGCCGCCGTC AAACTGGCCT GCGGCGAAGA AGTCAAAGCA
GAAGCCGCCA ACGGTAACGG TCCGGTCGAT GCCGTCTATC AGGCGATTAA CCGCATCACT
GACTATAACG TCGAACTGGT GAAATACAGC CTGACCGCCA AAGGTCACGG TAAAGATGCG
CTGGGTCAGG TGGATATTGT CGCCAACTAC AACGGTCGCC GCTTCCACGG CGTCGGCCTG
GCCACCGATA TTGTCGAGTC CTCCGCCAAA GCCATGGTGC ACGTACTTAA CAATATCTGG
CGTGCCACAG AAGTCGAAAA AGAGTTGCAA CGCAAAGCTC AACACAACGA AAACAACAAG
GAAACCGTGT GA
 
Protein sequence
MSQQVIIFDT TLRDGEQALQ ASLSVKEKLQ IALALERMGV DVMEVGFPVS SPGDFESVQT 
IARQVKNSRV CALARCVEKD IDVAAESLKV AEAFRIHTFI ATSPMHIATK LRSTLDEVIE
RAIYMVKRAR NYTDDVEFSC EDAGRTPIAD LARVVEAAIN AGATTINIPD TVGYTMPFEF
AGIISGLYER VPNIDKAIIS VHTHDDLGLA VGNSLAAVHA GARQVEGAMN GIGERAGNCS
LEEVIMAIKV RKDILNVHTA INHQEIWRTS QLVSQICNMP IPANKAIVGS GAFAHSSGIH
QDGVLKNREN YEIMTPESIG LNQIQLNLTS RSGRAAVKHR MDEMGYKESE YNLDNLYDAF
LKLADKKGQV FDYDLEALAF IGKQQEEPEH FRLDYFSVQS GSNDIATAAV KLACGEEVKA
EAANGNGPVD AVYQAINRIT DYNVELVKYS LTAKGHGKDA LGQVDIVANY NGRRFHGVGL
ATDIVESSAK AMVHVLNNIW RATEVEKELQ RKAQHNENNK ETV