Gene ECH74115_0679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0679 
SymbolentE 
ID6968996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp710815 
End bp712425 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content56% 
IMG OID643384715 
Productenterobactin synthase subunit E 
Protein accessionYP_002269228 
Protein GI209396204 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1021] Peptide arylation enzymes 
TIGRFAM ID[TIGR02275] 2,3-dihydroxybenzoate-AMP ligase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.137603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCATTC CATTCACCCG CTGGCCGGAA GAGTTTGCCC GTCGCTATCG GGAAAAAGGC 
TACTGGCAGG ATTTGCCACT GACTGACATT CTGACTCGCC ACGCTGCGAG TGACAGCATC
GCGGTTATCG ACGGCGAGCG ACAGTTGAGT TACCGGGAGC TGAATCAGGC GGCGGATAAC
CTCGCGTGTA GTTTACGCCG TCAGGGCATT AAACCTGGTG AAACCGCGCT GGTACAACTG
GGTAACGTCG CTGAACTTTA CATTACCTTT TTCGCGCTGC TGAAACTGGG CGTTGCGCCG
GTGCTGGCGT TGTTCAGCCA TCAGCGTAGT GAACTGAACG CCTATGCCAG CCAGATTGAA
CCCGCATTGC TGATTGCCGA TCGCCAACAT GCGCTGTTTA GCGGGGATGA TTTCCTCAAC
ACATTTGTTG CAGAGCATTC TTCCATTCGC GTGGTGCAGC TACTCAACGA CAGCGGTGAG
CATAACTTGC AGGATGCGAT TAACCATCCG GCAGACGGTT TTACTGCTAC GCCGTCACCT
GCTGATGAAG TGGTCTATTT CCAGCTTTCC GGCGGCACCA CCGGTACACC GAAGCTGATC
CCGCGCACTC ATAACGACTA CTACTACAGC GTGCGTCGTA GCGTCGAGAT TTGTCAGTTC
ACACAACAGA CACGCTACCT GTGCGCGATC CCGGCGGCTC ATAACTACGC CATGAGTTCG
CCGGGATCGC TGGGCGTCTT TCTTGCCGGA GGAACGGTTG TTCTGGCGGC CGATCCCAGC
GCCACGCTTT GCTTCCCATT GATTGAAAAA CATCAGATTA ACGTTACCGC GCTGGTGCCG
CCGGCAGTCA GCCTGTGGTT GCAGGCGCTG ACCGAAGGCG AAAGCCGGGC GCAGCTTGCC
TCGCTGAAAC TGTTACAGGT CGGTGGCGCA CGTCTTTCTG CCACGCTTGC GGCGCGTATT
CCCGCTGAGA TTGGTTGCCA GTTGCAGCAG GTGTTTGGTA TGGCGGAAGG GCTGGTGAAC
TATACCCGTC TTGATGATAG CGCGGAGAAA ATTATCCATA CCCAGGGTTA CCCAATGTGT
CCGGATGACG AAGTATGGGT TGCCGATGCC GAAGGAAATC CACTGCCGCA AGGGGAAGTC
GGACGACTGA TGACGCGCGG GCCGTATACC TTCCGTGGCT ATTACAAAAG CCCGCAGCAC
AATGCCAGCG CCTTTGATGC CAACGGTTTT TACTGTTCCG GCGATCTGAT CTCTATTGAT
CCAGAGGGTT ACATCACCGT GCAGGGGCGC GAGAAAGATC AGATCAACCG TGGCGGCGAG
AAGATCGCTG CCGAAGAGAT CGAAAACCTG TTACTGCGCC ATCCGGCGGT GATCTACGCC
GCACTGGTCA GTATGGAAGA TGAGCTGATG GGCGAAAAAA GCTGTGCTTA TCTGGTGGTA
AAAGAGCCGC TTCGCGCGGT GCAGGTGCGT CGTTTCCTGC GTGAACAGGG TATTGCCGAA
TTTAAATTAC CGGATCGCGT GGAGTGTGTG GATTCACTTC CGCTGACGGC GGTCGGGAAA
GTCGATAAAA AACAATTACG TCAGTGGCTG GCGTCACGCG CATCAGCCTG A
 
Protein sequence
MSIPFTRWPE EFARRYREKG YWQDLPLTDI LTRHAASDSI AVIDGERQLS YRELNQAADN 
LACSLRRQGI KPGETALVQL GNVAELYITF FALLKLGVAP VLALFSHQRS ELNAYASQIE
PALLIADRQH ALFSGDDFLN TFVAEHSSIR VVQLLNDSGE HNLQDAINHP ADGFTATPSP
ADEVVYFQLS GGTTGTPKLI PRTHNDYYYS VRRSVEICQF TQQTRYLCAI PAAHNYAMSS
PGSLGVFLAG GTVVLAADPS ATLCFPLIEK HQINVTALVP PAVSLWLQAL TEGESRAQLA
SLKLLQVGGA RLSATLAARI PAEIGCQLQQ VFGMAEGLVN YTRLDDSAEK IIHTQGYPMC
PDDEVWVADA EGNPLPQGEV GRLMTRGPYT FRGYYKSPQH NASAFDANGF YCSGDLISID
PEGYITVQGR EKDQINRGGE KIAAEEIENL LLRHPAVIYA ALVSMEDELM GEKSCAYLVV
KEPLRAVQVR RFLREQGIAE FKLPDRVECV DSLPLTAVGK VDKKQLRQWL ASRASA