Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A0649 |
Symbol | entE |
ID | 6483468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 661848 |
End bp | 663458 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 642736064 |
Product | enterobactin synthase subunit E |
Protein accession | YP_002039837 |
Protein GI | 194444600 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1021] Peptide arylation enzymes |
TIGRFAM ID | [TIGR02275] 2,3-dihydroxybenzoate-AMP ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.199158 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 82 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTATAC CTTTCACCCG TTGGCCGGAT GAATTTGCCC GCCGTTATCG TGAAAAAGGC TACTGGCAAG ACGTGCCGTT GACCGATATT CTGACCCGCC ACGCTGACAG CGACAAGACG GCGGTCATTG AAGGCGAGCG CGCATTCAGC TATCGCCAGC TCAACCAGGC CGCAGATAAT CTGGCCTGCA GTTTACGCCG TCAGGGCATC AAACCTGGCG AAACCGCTCT GGTACAACTG GGGAATGTAC CGGAACTGTA TATCACCTTT TTCGCCCTGT TGAAGCTTGG CGTAGCGCCC GTTCTGGCGC TGTTTAGCCA TCAACGTACC GAACTTAACG CCTATGCGAT GCAGATCGCG CCGACGCTGG TGATTGCCGA TCGCCAACAT ACGCTGTTCG CCGGGGAGGA CTTTCTCAAC ACGTTTGTGG CTGAACACCG CTCTGTGCGG GTGGTGTTAT TGCGCAACGA CGACGGCGAT CACAGCCTGG ACGCGGCGAT GCGGCAGGCG GCGGAAGATT TCACCGCCAC GCCATCACCT GCTGACGAAG TGGCCTACTT TCAGCTTTCC GGCGGTACTA CCGGCACGCC AAAGCTTATT CCCCGTACCC ATAACGACTA TTACTACAGC GTGCGCCGCA GCAATGAGAT TTGCGGTTTC AACGAGGAGA CGCGTTTTCT GTGCGCGATT CCCGCCGCGC ATAACTACGC CATGAGTTCG CCGGGCGCGC TGGGCGTCTT TCTTGCCAAA GGAACGGTAG TGCTGGCGAC CGATCCCAGC GCCACGCTCT GTTTCCCGCT GATCGAAAAA CACCAGATTA ATGCCACGGC GCTGGTGCCT CCCGCGGTCA GTCTATGGCT ACAGGCTATC CAGGAGTGGG GCGGTAATGC GCCGCTGGCG TCATTAAGGT TATTGCAGGT TGGCGGCGCG CGGCTTTCTG CGACGCTGGC CGCCCGTATT CCGGCTGAAA TTGGCTGTCA GTTGCAGCAG GTCTTCGGCA TGGCGGAAGG GTTAGTGAAC TATACCCGGC TGGACGATAG TCCGGAACGG ATTATCAATA CCCAGGGAAG ACCCATGTGT CCGGACGACG AAGTGTGGGT GGCGGATGCT GACGGGAATC CACTGCCGCC GGGCGAGATT GGTCGTCTGA TGACGCGCGG CCCCTATACT TTTCGCGGCT ATTTCAACAG TCCGCAACAC AATGCCAGCG CCTTTGACGC CAACGGTTTT TACTGTTCCG GCGATCTGAT CTCCATTGAT CAAGACGGCT ACATCACCGT TCACGGGCGT GAAAAAGATC AGATCAATCG GGGCGGCGAG AAGATAGCCG CCGAAGAGAT AGAAAACCTG TTACTGCGCC ACCCGGCGGT GATCCATGCG GCGCTGGTCA GCATGGAAGA TGAACTGCTG GGGGAAAAAA GTTGCGCATA TCTGGTGGTA AAAGAGCCGC TGCGAGCGGT ACAGGTACGC CGTTTCCTGC GAGAGCAGGG CGTGGCGGAA TTTAAATTAC CGGATCGCGT GGAGTGCGTT GCGTCACTGC CGCTGACGCC GGTTGGTAAA GTCGATAAAA AACAATTACG CCAGCGGTTG GCGTCACGTT CACCGCTCTG A
|
Protein sequence | MRIPFTRWPD EFARRYREKG YWQDVPLTDI LTRHADSDKT AVIEGERAFS YRQLNQAADN LACSLRRQGI KPGETALVQL GNVPELYITF FALLKLGVAP VLALFSHQRT ELNAYAMQIA PTLVIADRQH TLFAGEDFLN TFVAEHRSVR VVLLRNDDGD HSLDAAMRQA AEDFTATPSP ADEVAYFQLS GGTTGTPKLI PRTHNDYYYS VRRSNEICGF NEETRFLCAI PAAHNYAMSS PGALGVFLAK GTVVLATDPS ATLCFPLIEK HQINATALVP PAVSLWLQAI QEWGGNAPLA SLRLLQVGGA RLSATLAARI PAEIGCQLQQ VFGMAEGLVN YTRLDDSPER IINTQGRPMC PDDEVWVADA DGNPLPPGEI GRLMTRGPYT FRGYFNSPQH NASAFDANGF YCSGDLISID QDGYITVHGR EKDQINRGGE KIAAEEIENL LLRHPAVIHA ALVSMEDELL GEKSCAYLVV KEPLRAVQVR RFLREQGVAE FKLPDRVECV ASLPLTPVGK VDKKQLRQRL ASRSPL
|
| |