Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0613 |
Symbol | entC |
ID | 6146295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 625809 |
End bp | 626984 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615505 |
Product | isochorismate synthase, entC |
Protein accession | YP_001742711 |
Protein GI | 170684126 |
COG category | [H] Coenzyme transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1169] Isochorismate synthase |
TIGRFAM ID | [TIGR00543] isochorismate synthases |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.900668 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATACGT CACTGGCTGA GGAAGTACAG CAGACCATGG CAACACTTGC GCCCAATCGC TTTTTCTTTA TGTCGCCGTA CCGCAGTTTT ACGACGTCAG GATGTTTCGC CCGCTTCGAT GAACCGGCTG TGAACGGGGA TTCGCCCGAC AGTCCCTTCC AGCAAAAACT CGCCGCGTTG TTTGCCGATG CCAAAGCGCA GGGCATCAAA AATCCGGTGA TGGTCGGGGC GATTCCCTTC GATCCACGTC AGCCTTCGTC GCTGTATATT CCCGAATCCT GGCTGTCGTT CTCCCGTCAG GAAAAACAAG CTTCCGCACG CCGTTTCACC CGCAGCCAGT CGCTGAACGT GGTGGAACGT CAGGCAATTC CGCAACAAGC CACGTTTGAA CAGATGGTTG CCCGCGCCGC CGCACTTACC GCCACGCCGC AGGTCGACAA AGTGGTGTTG TCACGGTTGA TTGATATCAC CACTGACGCC GCCATTGATA GTGGCGTATT GCTGGAACGG TTGATTGCAC AAAACCCGGT TAGTTACAAC TTCCATGTCC CGCTGGCTGA TGGTGGCGTT CTGCTGGGGG CCAGCCCGGA GCTGCTGCTA CGTAAAGACG GCGAGCGTTT TAGCTCCATT CCGTTAGCCG GTTCCGCGCG TCGTCAGCCG GATGAAGTGC TCGATCGCGA AGCGGGTAAT CGTCTGTTGG CCTCAGAAAA AGATCGTCAT GAACATGAAT TGGTGACTCA GGCGATGAAA GAGGTACTGC GAGAACGCAG CAGTGAGTTA AACGTTCCTT CCTCTCCACA ATTGATTACC ACGCCGACGC TGTGGCATCT CGCAACTCCC TTTGAAGGTA AAGCGAATTC GCAAGAAAAC GCACTGACTC TGGCCTGTCT GCTGCATCCG ACCCCCGCGC TGAGCGGTTT CCCGCATCAG GCCGCGACCC AGGTTATTGC TGAACTGGAG CCGTTCGACC GCGAACTGTT TGGCGGCATT GTGGGTTGGT GTGACAGCGA AGGTAACGGC GAATGGGTTG TGACCATCCG CTGCGCGAAG CTGCGGGAAA ATCAGGTGCG TCTGTTTGCC GGAGCGGGGA TTGTGCCTGC GTCGTCACCG TTGGGTGAGT GGCGCGAAAC GGGCGTCAAA CTTTCTACCA TGTTGAACGT TTTTGGATTG CATTAA
|
Protein sequence | MDTSLAEEVQ QTMATLAPNR FFFMSPYRSF TTSGCFARFD EPAVNGDSPD SPFQQKLAAL FADAKAQGIK NPVMVGAIPF DPRQPSSLYI PESWLSFSRQ EKQASARRFT RSQSLNVVER QAIPQQATFE QMVARAAALT ATPQVDKVVL SRLIDITTDA AIDSGVLLER LIAQNPVSYN FHVPLADGGV LLGASPELLL RKDGERFSSI PLAGSARRQP DEVLDREAGN RLLASEKDRH EHELVTQAMK EVLRERSSEL NVPSSPQLIT TPTLWHLATP FEGKANSQEN ALTLACLLHP TPALSGFPHQ AATQVIAELE PFDRELFGGI VGWCDSEGNG EWVVTIRCAK LRENQVRLFA GAGIVPASSP LGEWRETGVK LSTMLNVFGL H
|
| |