Gene EcHS_A0079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0079 
SymbolleuA 
ID5592813 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp83713 
End bp85284 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content53% 
IMG OID640919267 
Product2-isopropylmalate synthase 
Protein accessionYP_001456862 
Protein GI157159544 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAGC AAGTCATTAT TTTCGATACC ACATTGCGCG ACGGTGAACA GGCGTTACAG 
GCAAGCTTGA GTGTGAAAGA AAAACTGCAA ATTGCGCTGG CCCTTGAGCG TATGGGTGTT
GACGTGATGG AAGTCGGTTT CCCCGTCTCT TCGCCGGGCG ATTTTGAATC GGTGCAAACC
ATCGCCCGCC AGGTTAAAAA CAGCCGCGTA TGTGCGTTAG CTCGCTGCGT GGAGAAAGAT
ATCGACGTGG CGGCCGAATC CCTGAAAGTC GCCGAAGCCT TCCGTATTCA TACCTTTATT
GCCACTTCGC CAATGCACAT CGCCACCAAG CTGCGCAGCA CGCTGGATGA AGTGATCGAA
CGCGCTATCT ATATGGTGAA ACGCGCCCGT AATTACACCG ATGATGTTGA ATTTTCTTGC
GAAGATGCCG GGCGTACACC CATTGCCGAT CTGGCGCGAG TGGTCGAAGC GGCGATTAAT
GCCGGTGCCA CCACCATCAA CATTCCGGAC ACCGTGGGCT ACACCATGCC GTTTGAGTTC
GCCGGAATCA TCAGCGGCCT GTATGAACGC GTACCTAACA TCGACAAAGC CATTATTTCC
GTACATACCC ACGACGATTT AGGCCTGGCG GTCGGCAACT CACTGGCGGC GGTACATGCC
GGTGCACGCC AGGTGGAAGG TGCAATGAAC GGGATCGGCG AGCGTGCCGG TAACTGTTCG
CTGGAAGAAG TCATCATGGC GATTAAAGTT CGTAAGGATA TTCTCAACGT CCACACCGCC
ATTAATCACC AGGAGATATG GCGCACCAGC CAGTTAGTTA GCCAGATTTG TAATATGCCG
ATCCCGGCAA ACAAAGCCAT TGTTGGCAGC GGCGCATTCG CTCACTCCTC CGGTATCCAC
CAGGATGGCG TGCTGAAAAA CCGCGAAAAC TACGAAATCA TGACACCAGA ATCTATTGGT
CTGAACCAAA TCCAGCTGAA TCTGACCTCT CGTTCGGGGC GTGCGGCGGT GAAACATCGC
ATGGATGAGA TGGGGTATAA AGAAAGTGAA TATAATTTAG ACAATTTGTA CGATGCTTTC
CTGAAGCTGG CGGACAAAAA AGGTCAGGTG TTTGATTACG ATCTGGAGGC GCTGGCCTTC
ATCGGTAAGC AGCAAGAAGA GCCGGAGCAT TTCCGTCTGG ATTACTTCAG CGTGCAGTCT
GGCTCTAACG ATATCGCCAC CGCCGCCGTC AAACTGGCCT GTGGCGAAGA AGTCAAAGCA
GAAGCCGCCA ACGGTAACGG TCCGGTCGAT GCCGTCTATC AGGCAATTAA CCGCATCACT
GACTATAACG TCGAACTGGT GAAATACAGC CTGACTGCTA AAGGTCACGG TAAAGATGCT
CTGGGTCAGG TGGATATTGT CGCTAACTAC AACGGTCGCC GCTTCCACGG CGTCGGCCTG
GCCACCGATA TTGTCGAGTC CTCCGCCAAA GCCATGGTGC ACGTACTGAA CAATATCTGG
CGCGCCGCAG AAGTCGAAAA AGAGTTGCAA CGCAAAGCTC AACACAACGA AAACAACAAG
GAAACCGTGT GA
 
Protein sequence
MSQQVIIFDT TLRDGEQALQ ASLSVKEKLQ IALALERMGV DVMEVGFPVS SPGDFESVQT 
IARQVKNSRV CALARCVEKD IDVAAESLKV AEAFRIHTFI ATSPMHIATK LRSTLDEVIE
RAIYMVKRAR NYTDDVEFSC EDAGRTPIAD LARVVEAAIN AGATTINIPD TVGYTMPFEF
AGIISGLYER VPNIDKAIIS VHTHDDLGLA VGNSLAAVHA GARQVEGAMN GIGERAGNCS
LEEVIMAIKV RKDILNVHTA INHQEIWRTS QLVSQICNMP IPANKAIVGS GAFAHSSGIH
QDGVLKNREN YEIMTPESIG LNQIQLNLTS RSGRAAVKHR MDEMGYKESE YNLDNLYDAF
LKLADKKGQV FDYDLEALAF IGKQQEEPEH FRLDYFSVQS GSNDIATAAV KLACGEEVKA
EAANGNGPVD AVYQAINRIT DYNVELVKYS LTAKGHGKDA LGQVDIVANY NGRRFHGVGL
ATDIVESSAK AMVHVLNNIW RAAEVEKELQ RKAQHNENNK ETV