Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A0692 |
Symbol | entE |
ID | 6875128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 694860 |
End bp | 696470 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642783904 |
Product | enterobactin synthase subunit E |
Protein accession | YP_002214590 |
Protein GI | 198245029 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1021] Peptide arylation enzymes |
TIGRFAM ID | [TIGR02275] 2,3-dihydroxybenzoate-AMP ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 0.0785305 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATAC CTTTCACCCG TTGGCCGGAT GAATTTGCCC GCCGTTATCG TGAAAAAGGC TACTGGCAAG ACGTGCCGTT GACCGATATT CTGACCCGCC ACGCTGATAG CGACAAGACG GCGGTCATTG AAGGCGAGCG CGCATTCAGC TATCGCCAGC TCAACCAGGC CGCAGATAAT TTGGCCTGCT ATTTACGCCA TCAGGGCATC AAACCTGGCG AAACCGCTCT GGTGCAACTG GGGAATGTAC CGGAACTGTA TATCACATTT TTCGCCCTGT TGAAGCTTGG CGTAGCGCCC GTTCTGGCGC TGTTTAGCCA TCAACGTACC GAACTTAACG CCTATGCGAT GCAGATCGCG CCGACGCTGG TGATTGCCGA TCGCCAACAT ACGCTGTTCG CCGGGGAGGA CTTTCTCAAC ACGTTTGTGG CTGAACACCG CTCTGTGCGG GTGGTGTTAT TGCGCAACGA CGACGGCGAT CACAGCCTGG ACGCGGCGAT GCGGCAGGCG GCGGAAGATT TCACCGCCAC GCCATCACCT GCTGACGAAG TGGCCTACTT TCAGCTTTCC GGCGGTACTA CCGGCACGCC AAAGCTTATT CCCCGTACCC ATAACGACTA TTACTACAGC GTGCGCCGCA GCAATGAGAT TTGCGGTTTC AACGAGGAGA CGCGTTTTCT GTGCGCGATT CCCGCCGCGC ATAACTACGC CATGAGTTCG CCGGGCGCGC TGGGCGTTTT TCTCGCCAAA GGAACGGTAG TGCTGACGAC CGATCCTGGC GCCACGCTCT GTTTCCCGCT GATCGAAAAA CACCAGATTA ATGCCACGGC GCTGGTGCCG CCCGCGGTCA GTCTATGGCT ACAGGCTATC CAGGAGTGGG GCGGCAATGC GCCGCTGGCG TCATTAAGGT TATTGCAGGT TGGCGGCGCG CGGCTTTCTG CGACGCTGGC CGCCCGTATT CCGCCTGAAA TTGGCTGTCA GTTGCAGCAG GTCTTCGGCA TGGCGGAAGG GTTAGTGAAC TATACCCGGC TGGACGATAG TCCGGAACGG ATTATCAATA CCCAGGGAAG ACCCATGTGT CCGGACGACG AAGTGTGGGT GGCGGATGCC GACGGGAATC CACTGCCGCC GGGCGAGATT GGTCGTCTGA TGACGCGCGG CCCCTATACT TTTCGCGGCT ATTTCAACAG TCCGCTACAC AATGCCAGCG CCTTTGACGC CAACGGTTTT TACTGTTCCG GCGATCTGAT CTCCATTGAT CAAGACGGTT ACATCACCGT TCACGGGCGT GAAAAAGATC AGATCAATCG GGGCGGCGAG AAGATAGCCG CCGAAGAGAT AGAAAACCTG TTACTACGCC ACCCGGCGGT GATCCATGCG GCGCTGGTCA GCATGGAAGA TGAACTGCTG GGGGAAAAAA GTTGCGCATA TCTGGTGGTA AAAGAGCCGC TGCGAGCGGT ACAGGTTCGC CGTTTCCTGC GAGAACAGGG CGTGGCGGAA TTTAAATTAC CGGATCGCGT GGAGTGCGTT GCGTCACTGC CGCTGACGCC GGTTGGTAAA GTCGATAAAA AACAATTACG CCAGCGGTTG GCGTCACGTT CACCGCTCTG A
|
Protein sequence | MRIPFTRWPD EFARRYREKG YWQDVPLTDI LTRHADSDKT AVIEGERAFS YRQLNQAADN LACYLRHQGI KPGETALVQL GNVPELYITF FALLKLGVAP VLALFSHQRT ELNAYAMQIA PTLVIADRQH TLFAGEDFLN TFVAEHRSVR VVLLRNDDGD HSLDAAMRQA AEDFTATPSP ADEVAYFQLS GGTTGTPKLI PRTHNDYYYS VRRSNEICGF NEETRFLCAI PAAHNYAMSS PGALGVFLAK GTVVLTTDPG ATLCFPLIEK HQINATALVP PAVSLWLQAI QEWGGNAPLA SLRLLQVGGA RLSATLAARI PPEIGCQLQQ VFGMAEGLVN YTRLDDSPER IINTQGRPMC PDDEVWVADA DGNPLPPGEI GRLMTRGPYT FRGYFNSPLH NASAFDANGF YCSGDLISID QDGYITVHGR EKDQINRGGE KIAAEEIENL LLRHPAVIHA ALVSMEDELL GEKSCAYLVV KEPLRAVQVR RFLREQGVAE FKLPDRVECV ASLPLTPVGK VDKKQLRQRL ASRSPL
|
| |