Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0679 |
Symbol | entE |
ID | 6968996 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 710815 |
End bp | 712425 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384715 |
Product | enterobactin synthase subunit E |
Protein accession | YP_002269228 |
Protein GI | 209396204 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1021] Peptide arylation enzymes |
TIGRFAM ID | [TIGR02275] 2,3-dihydroxybenzoate-AMP ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.137603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 84 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATTC CATTCACCCG CTGGCCGGAA GAGTTTGCCC GTCGCTATCG GGAAAAAGGC TACTGGCAGG ATTTGCCACT GACTGACATT CTGACTCGCC ACGCTGCGAG TGACAGCATC GCGGTTATCG ACGGCGAGCG ACAGTTGAGT TACCGGGAGC TGAATCAGGC GGCGGATAAC CTCGCGTGTA GTTTACGCCG TCAGGGCATT AAACCTGGTG AAACCGCGCT GGTACAACTG GGTAACGTCG CTGAACTTTA CATTACCTTT TTCGCGCTGC TGAAACTGGG CGTTGCGCCG GTGCTGGCGT TGTTCAGCCA TCAGCGTAGT GAACTGAACG CCTATGCCAG CCAGATTGAA CCCGCATTGC TGATTGCCGA TCGCCAACAT GCGCTGTTTA GCGGGGATGA TTTCCTCAAC ACATTTGTTG CAGAGCATTC TTCCATTCGC GTGGTGCAGC TACTCAACGA CAGCGGTGAG CATAACTTGC AGGATGCGAT TAACCATCCG GCAGACGGTT TTACTGCTAC GCCGTCACCT GCTGATGAAG TGGTCTATTT CCAGCTTTCC GGCGGCACCA CCGGTACACC GAAGCTGATC CCGCGCACTC ATAACGACTA CTACTACAGC GTGCGTCGTA GCGTCGAGAT TTGTCAGTTC ACACAACAGA CACGCTACCT GTGCGCGATC CCGGCGGCTC ATAACTACGC CATGAGTTCG CCGGGATCGC TGGGCGTCTT TCTTGCCGGA GGAACGGTTG TTCTGGCGGC CGATCCCAGC GCCACGCTTT GCTTCCCATT GATTGAAAAA CATCAGATTA ACGTTACCGC GCTGGTGCCG CCGGCAGTCA GCCTGTGGTT GCAGGCGCTG ACCGAAGGCG AAAGCCGGGC GCAGCTTGCC TCGCTGAAAC TGTTACAGGT CGGTGGCGCA CGTCTTTCTG CCACGCTTGC GGCGCGTATT CCCGCTGAGA TTGGTTGCCA GTTGCAGCAG GTGTTTGGTA TGGCGGAAGG GCTGGTGAAC TATACCCGTC TTGATGATAG CGCGGAGAAA ATTATCCATA CCCAGGGTTA CCCAATGTGT CCGGATGACG AAGTATGGGT TGCCGATGCC GAAGGAAATC CACTGCCGCA AGGGGAAGTC GGACGACTGA TGACGCGCGG GCCGTATACC TTCCGTGGCT ATTACAAAAG CCCGCAGCAC AATGCCAGCG CCTTTGATGC CAACGGTTTT TACTGTTCCG GCGATCTGAT CTCTATTGAT CCAGAGGGTT ACATCACCGT GCAGGGGCGC GAGAAAGATC AGATCAACCG TGGCGGCGAG AAGATCGCTG CCGAAGAGAT CGAAAACCTG TTACTGCGCC ATCCGGCGGT GATCTACGCC GCACTGGTCA GTATGGAAGA TGAGCTGATG GGCGAAAAAA GCTGTGCTTA TCTGGTGGTA AAAGAGCCGC TTCGCGCGGT GCAGGTGCGT CGTTTCCTGC GTGAACAGGG TATTGCCGAA TTTAAATTAC CGGATCGCGT GGAGTGTGTG GATTCACTTC CGCTGACGGC GGTCGGGAAA GTCGATAAAA AACAATTACG TCAGTGGCTG GCGTCACGCG CATCAGCCTG A
|
Protein sequence | MSIPFTRWPE EFARRYREKG YWQDLPLTDI LTRHAASDSI AVIDGERQLS YRELNQAADN LACSLRRQGI KPGETALVQL GNVAELYITF FALLKLGVAP VLALFSHQRS ELNAYASQIE PALLIADRQH ALFSGDDFLN TFVAEHSSIR VVQLLNDSGE HNLQDAINHP ADGFTATPSP ADEVVYFQLS GGTTGTPKLI PRTHNDYYYS VRRSVEICQF TQQTRYLCAI PAAHNYAMSS PGSLGVFLAG GTVVLAADPS ATLCFPLIEK HQINVTALVP PAVSLWLQAL TEGESRAQLA SLKLLQVGGA RLSATLAARI PAEIGCQLQQ VFGMAEGLVN YTRLDDSAEK IIHTQGYPMC PDDEVWVADA EGNPLPQGEV GRLMTRGPYT FRGYYKSPQH NASAFDANGF YCSGDLISID PEGYITVQGR EKDQINRGGE KIAAEEIENL LLRHPAVIYA ALVSMEDELM GEKSCAYLVV KEPLRAVQVR RFLREQGIAE FKLPDRVECV DSLPLTAVGK VDKKQLRQWL ASRASA
|
| |