Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0614 |
Symbol | entE |
ID | 6143426 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 626994 |
End bp | 628604 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641615506 |
Product | enterobactin synthase subunit E |
Protein accession | YP_001742712 |
Protein GI | 170681611 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1021] Peptide arylation enzymes |
TIGRFAM ID | [TIGR02275] 2,3-dihydroxybenzoate-AMP ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCATTC CATTCACCCG CTGGCCGGAA GAGTTTGCCC GTCGCTATCG GGAAAAAGGC TACTGGCAGG ATTTGCCGCT GACCGACATT CTGACCCGTC ATGCCGCAAG CGACAGCATC GCGGTTATCG ACGGCGAGCG GCAGTTGAGT TATCGGGAGC TGAATCAGGC GGCGGATAAC CTCGCGTGTA GTTTGCGCCG TCAGGGCATT AAACCTGGTG AAACCGCGCT GGTACAGCTG GGTAACGTCG CTGAACTTTA CATTACCTTT TTCGCGCTGC TGAAACTGGG CGTTGCGCCG GTGCTGGCAC TGTTCAGCCA TCAGCGTAGT GAACTGAACG CCTACGCCAG CCAGATTGAA CCGGCGTTAC TGATTGCCGA TCGCCAGCAT GCGCTGTTTA GCGGAGATGA TTTCCTCAAC ACATTTGTTG CAGAGCATTC TTCCATTCGC GTGGTGCAGC TGCTCAACGA CAGCGGTGAG CATAACTTGC AGGATGCGAT TAACCATCCG GCTGAGGATT TTACTGCCAC GCCATCTCCT GCTGATGAAG TGGCCTATTT CCAGCTTTCC GGCGGCACCA CCGGCACTCC GAAACTGATC CCGCGCACTC ATAACGACTA CTACTACAGC GTGCGTCGTA GCGTCGAGAT TTGTCAGTTC ACACAACAGA CGCGCTACCT GTGCGCGATC CCGGCGGCTC ATAACTACGC CATGAGTTCA CCGGGATCGC TGGGCGTCTT TCTCGCTGGC GGCACTGTCG TTCTGGCTGC CGACCCCAGC GCCACGCTTT GCTTCCCATT GATTGAAAAA CATCAGGTGA ACGTCACCGC GCTGGTGCCG CCAGCAGTCA GCCTGTGGTT GCAGGCACTG GCTGAAGGCG AAAGCCGGGC GCAGCTTGCC TCGCTGAAAC TGTTACAGGT CGGCGGCGCA CGTCTTTCTG CCACGCTTGC GGCGCGTATT CCCGCTGAGA TTGGCTGCCA GTTGCAGCAG GTGTTTGGCA TGGCGGAAGG GCTGGTGAAC TACACCCGTC TTGATGATAG TGCGGAGAAA ATTATCCATA CCCAGGGTTA CCCAATGTGC CCGGACGATG AAGTATGGGT TGCCGATGCC GAAGGAAATC CACTGCCGCA AGGGGAAGTT GGACGCCTGA TGACGCGCGG GCCGTACACC TTCCGTGGCT ATTACAAAAG CCCGCAGCAC AATGCCAGCG CCTTTGATGC CAACGGTTTT TACTGTTCCG GCGATCTGAT CTCTATTGAT CCAGAGGGTT ACATCACCGT GCAGGGGCGC GAGAAAGATC AGATCAACCG TGGCGGCGAG AAGATCGCTG CCGAAGAGAT CGAAAACCTG CTGCTGCGCC ACCCGGCGGT GATCTACGCC GCACTGGTGA GCATGGAAGA TGAGCTGATG GGCGAAAAAA GCTGTGCTTA TCTGGTGGTA AAAGAGCCGC TGCGCGCGGT GCAGGTGCGT CGTTTCCTGC GTGAACAGGG TATTGCCGAA TTTAAATTAC CGGATCGCGT GGAGTGTGTG GATTCACTTC CGCTGACGGC GGTCGGGAAA GTCGATAAAA AACAATTACG TCAGTGGCTG GCGTCACGCA CATCAGCCTG A
|
Protein sequence | MSIPFTRWPE EFARRYREKG YWQDLPLTDI LTRHAASDSI AVIDGERQLS YRELNQAADN LACSLRRQGI KPGETALVQL GNVAELYITF FALLKLGVAP VLALFSHQRS ELNAYASQIE PALLIADRQH ALFSGDDFLN TFVAEHSSIR VVQLLNDSGE HNLQDAINHP AEDFTATPSP ADEVAYFQLS GGTTGTPKLI PRTHNDYYYS VRRSVEICQF TQQTRYLCAI PAAHNYAMSS PGSLGVFLAG GTVVLAADPS ATLCFPLIEK HQVNVTALVP PAVSLWLQAL AEGESRAQLA SLKLLQVGGA RLSATLAARI PAEIGCQLQQ VFGMAEGLVN YTRLDDSAEK IIHTQGYPMC PDDEVWVADA EGNPLPQGEV GRLMTRGPYT FRGYYKSPQH NASAFDANGF YCSGDLISID PEGYITVQGR EKDQINRGGE KIAAEEIENL LLRHPAVIYA ALVSMEDELM GEKSCAYLVV KEPLRAVQVR RFLREQGIAE FKLPDRVECV DSLPLTAVGK VDKKQLRQWL ASRTSA
|
| |