Gene EcolC_3583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3583 
Symbol 
ID6065434 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3916052 
End bp3917623 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content53% 
IMG OID641603000 
Product2-isopropylmalate synthase 
Protein accessionYP_001726524 
Protein GI170021570 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00973] 2-isopropylmalate synthase, bacterial type 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000384081 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCAGC AAGTCATTAT TTTCGATACC ACATTGCGCG ACGGTGAACA GGCGTTACAG 
GCAAGCTTGA GTGTGAAAGA AAAACTGCAA ATTGCGCTGG CCCTTGAGCG TATGGGTGTT
GACGTGATGG AAGTCGGGTT CCCCGTTTCT TCGCCGGGTG ATTTTGAATC AGTGCAAACC
ATCGCTCGCC AGGTTAAAAA CAGCCGCGTA TGCGCGTTAG CTCGCTGCGT GGAAAAAGAT
ATCGACGTGG CGGCCGAATC CCTGAAAGTC GCCGAAGCCT TCCGTATTCA TACCTTTATT
GCCACTTCGC CAATGCACAT CGCCACCAAG CTGCGCAGCA CGCTGGACGA GGTGATCGAA
CGCGCTATCT ATATGGTGAA ACGCGCCCGT AATTACACCG ATGATGTTGA ATTTTCTTGC
GAAGATGCCG GGCGTACACC CATTGCCGAT CTGGCGCGAG TGGTCGAAGC GGCGATTAAT
GCCGGTGCCA CCACCATCAA CATTCCGGAC ACCGTGGGCT ACACCATGCC GTTTGAGTTC
GCCGGAATCA TCAGCGGCCT GTATGAACGC GTGCCTAACA TCGACAAAGC CATTATCTCC
GTACATACCC ACGACGATTT GGGCCTGGCG GTCGGAAACT CACTGGCGGC GGTACATGCC
GGTGCACGCC AGGTGGAAGG CGCAATGAAC GGGATCGGCG AGCGTGCCGG AAACTGTTCC
CTGGAAGAAG TCATCATGGC GATCAAAGTT CGTAAGGATA TTCTCAACGT CCACACCGCC
ATTAATCACC AGGAGATATG GCGCACCAGC CAGTTAGTTA GCCAGATTTG TAATATGCCG
ATCCCGGCAA ACAAAGCCAT TGTTGGCAGC GGCGCATTCG CACACTCCTC CGGTATCCAC
CAGGATGGCG TGCTGAAAAA CCGCGAAAAC TACGAAATCA TGACACCAGA ATCTATTGGT
CTGAACCAAA TCCAGCTGAA TCTGACCTCT CGTTCGGGGC GTGCGGCGGT GAAACATCGC
ATGGATGAGA TGGGGTATAA AGAAAGTGAA TATAATTTAG ACAATTTGTA CGACGCTTTC
CTGAAGCTGG CGGACAAAAA AGGTCAGGTG TTTGATTACG ATCTGGAGGC GCTGGCCTTC
ATCGGTAAGC AGCAAGAAGA GCCGGAGCAT TTCCGTCTGG ATTACTTCAG CGTGCAGTCT
GGCTCTAACG ATATCGCCAC CGCCGCCGTC AAACTGGCCT GCGGCGAAGA AGTCAAAGCA
GAAGCCGCCA ACGGTAACGG TCCGGTCGAT GCCGTCTATC AGGCGATAAA CCGCATCACT
GACTATAACG TCGAACTGGT GAAATACAGC CTGACTGCTA AAGGTCACGG TAAAGATGCT
CTGGGTCAGG TGGATATTGT CGCTAACTAC AACGGTCGCC GCTTCCACGG CGTCGGCCTG
GCCACCGATA TTGTCGAGTC CTCCGCCAAA GCCATGGTGC ACGTACTGAA CAATATCTGG
CGCGCCGCAG AAGTCGAAAA AGAGTTGCAA CGCAAAGCTC AACACAACGA AAACAACAAG
GAAACCGTGT GA
 
Protein sequence
MSQQVIIFDT TLRDGEQALQ ASLSVKEKLQ IALALERMGV DVMEVGFPVS SPGDFESVQT 
IARQVKNSRV CALARCVEKD IDVAAESLKV AEAFRIHTFI ATSPMHIATK LRSTLDEVIE
RAIYMVKRAR NYTDDVEFSC EDAGRTPIAD LARVVEAAIN AGATTINIPD TVGYTMPFEF
AGIISGLYER VPNIDKAIIS VHTHDDLGLA VGNSLAAVHA GARQVEGAMN GIGERAGNCS
LEEVIMAIKV RKDILNVHTA INHQEIWRTS QLVSQICNMP IPANKAIVGS GAFAHSSGIH
QDGVLKNREN YEIMTPESIG LNQIQLNLTS RSGRAAVKHR MDEMGYKESE YNLDNLYDAF
LKLADKKGQV FDYDLEALAF IGKQQEEPEH FRLDYFSVQS GSNDIATAAV KLACGEEVKA
EAANGNGPVD AVYQAINRIT DYNVELVKYS LTAKGHGKDA LGQVDIVANY NGRRFHGVGL
ATDIVESSAK AMVHVLNNIW RAAEVEKELQ RKAQHNENNK ETV