Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_3583 |
Symbol | |
ID | 6065434 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | + |
Start bp | 3916052 |
End bp | 3917623 |
Gene Length | 1572 bp |
Protein Length | 523 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641603000 |
Product | 2-isopropylmalate synthase |
Protein accession | YP_001726524 |
Protein GI | 170021570 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0119] Isopropylmalate/homocitrate/citramalate synthases |
TIGRFAM ID | [TIGR00973] 2-isopropylmalate synthase, bacterial type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.000384081 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCCAGC AAGTCATTAT TTTCGATACC ACATTGCGCG ACGGTGAACA GGCGTTACAG GCAAGCTTGA GTGTGAAAGA AAAACTGCAA ATTGCGCTGG CCCTTGAGCG TATGGGTGTT GACGTGATGG AAGTCGGGTT CCCCGTTTCT TCGCCGGGTG ATTTTGAATC AGTGCAAACC ATCGCTCGCC AGGTTAAAAA CAGCCGCGTA TGCGCGTTAG CTCGCTGCGT GGAAAAAGAT ATCGACGTGG CGGCCGAATC CCTGAAAGTC GCCGAAGCCT TCCGTATTCA TACCTTTATT GCCACTTCGC CAATGCACAT CGCCACCAAG CTGCGCAGCA CGCTGGACGA GGTGATCGAA CGCGCTATCT ATATGGTGAA ACGCGCCCGT AATTACACCG ATGATGTTGA ATTTTCTTGC GAAGATGCCG GGCGTACACC CATTGCCGAT CTGGCGCGAG TGGTCGAAGC GGCGATTAAT GCCGGTGCCA CCACCATCAA CATTCCGGAC ACCGTGGGCT ACACCATGCC GTTTGAGTTC GCCGGAATCA TCAGCGGCCT GTATGAACGC GTGCCTAACA TCGACAAAGC CATTATCTCC GTACATACCC ACGACGATTT GGGCCTGGCG GTCGGAAACT CACTGGCGGC GGTACATGCC GGTGCACGCC AGGTGGAAGG CGCAATGAAC GGGATCGGCG AGCGTGCCGG AAACTGTTCC CTGGAAGAAG TCATCATGGC GATCAAAGTT CGTAAGGATA TTCTCAACGT CCACACCGCC ATTAATCACC AGGAGATATG GCGCACCAGC CAGTTAGTTA GCCAGATTTG TAATATGCCG ATCCCGGCAA ACAAAGCCAT TGTTGGCAGC GGCGCATTCG CACACTCCTC CGGTATCCAC CAGGATGGCG TGCTGAAAAA CCGCGAAAAC TACGAAATCA TGACACCAGA ATCTATTGGT CTGAACCAAA TCCAGCTGAA TCTGACCTCT CGTTCGGGGC GTGCGGCGGT GAAACATCGC ATGGATGAGA TGGGGTATAA AGAAAGTGAA TATAATTTAG ACAATTTGTA CGACGCTTTC CTGAAGCTGG CGGACAAAAA AGGTCAGGTG TTTGATTACG ATCTGGAGGC GCTGGCCTTC ATCGGTAAGC AGCAAGAAGA GCCGGAGCAT TTCCGTCTGG ATTACTTCAG CGTGCAGTCT GGCTCTAACG ATATCGCCAC CGCCGCCGTC AAACTGGCCT GCGGCGAAGA AGTCAAAGCA GAAGCCGCCA ACGGTAACGG TCCGGTCGAT GCCGTCTATC AGGCGATAAA CCGCATCACT GACTATAACG TCGAACTGGT GAAATACAGC CTGACTGCTA AAGGTCACGG TAAAGATGCT CTGGGTCAGG TGGATATTGT CGCTAACTAC AACGGTCGCC GCTTCCACGG CGTCGGCCTG GCCACCGATA TTGTCGAGTC CTCCGCCAAA GCCATGGTGC ACGTACTGAA CAATATCTGG CGCGCCGCAG AAGTCGAAAA AGAGTTGCAA CGCAAAGCTC AACACAACGA AAACAACAAG GAAACCGTGT GA
|
Protein sequence | MSQQVIIFDT TLRDGEQALQ ASLSVKEKLQ IALALERMGV DVMEVGFPVS SPGDFESVQT IARQVKNSRV CALARCVEKD IDVAAESLKV AEAFRIHTFI ATSPMHIATK LRSTLDEVIE RAIYMVKRAR NYTDDVEFSC EDAGRTPIAD LARVVEAAIN AGATTINIPD TVGYTMPFEF AGIISGLYER VPNIDKAIIS VHTHDDLGLA VGNSLAAVHA GARQVEGAMN GIGERAGNCS LEEVIMAIKV RKDILNVHTA INHQEIWRTS QLVSQICNMP IPANKAIVGS GAFAHSSGIH QDGVLKNREN YEIMTPESIG LNQIQLNLTS RSGRAAVKHR MDEMGYKESE YNLDNLYDAF LKLADKKGQV FDYDLEALAF IGKQQEEPEH FRLDYFSVQS GSNDIATAAV KLACGEEVKA EAANGNGPVD AVYQAINRIT DYNVELVKYS LTAKGHGKDA LGQVDIVANY NGRRFHGVGL ATDIVESSAK AMVHVLNNIW RAAEVEKELQ RKAQHNENNK ETV
|
| |