Gene ECH74115_3406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3406 
SymbolmenD 
ID6971835 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3149464 
End bp3151134 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content58% 
IMG OID643387214 
Product2-succinyl-5-enolpyruvyl-6-hydroxy-3- cyclohexene-1-carboxylate synthase 
Protein accessionYP_002271677 
Protein GI209399910 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1165] 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase 
TIGRFAM ID[TIGR00173] 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylic-acid synthase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0925414 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTAA GCGCATTTAA CCGACGCTGG GCGGCGGTCA TTCTGGAAGC ATTAACGCGT 
CACGGCGTAA GACACATCTG CATCGCCCCT GGCTCGCGTT CTACACCGTT AACGTTAGCG
GCGGCGGAGA ATTCCGCATT CATTCACCAC ACCCATTTCG ATGAGCGTGG ACTGGGGCAT
CTGGCGCTGG GGCTGGCGAA AGTCAGCAAG CAGCCGGTGG CGGTGATTGT GACCTCCGGC
ACGGCGGTAG CAAATCTCTA TCCGGCACTG ATTGAAGCTG GGTTAACCGG AGAAAAACTG
ATCCTGTTAA CCGCCGATCG CCCGCCGGAG CTAATTGACT GCGGCGCGAA TCAGGCGATT
CGTCAGCCGG GAATGTTCGC CTCTCACCCC ACGCACAGTA TTTCACTGCC GCGCCCGACC
CAGGATATCC CCGCACGTTG GCTGGTTTCT ACCATCGACC ATGCTCTCGG TACGCTTCAT
GCTGGTGGGG TCCATATCAA CTGCCCGTTT GCTGAACCGC TGTATGGCGA GATGGACGAC
ACTGGGCTTA GCTGGCAACA GCGTCTGGGT GACTGGTGGC AGGATGATAA GCCGTGGCTG
CGTGAAGCGC CTCGTCTTGA GAGCGAAAAA CAGCGCGACT GGTTCTTCTG GCGGCAAAAG
CGCGGCGTGG TGGTTGCCGG GCGCATGAGT GCGGAAGAGG GCAAAAAAGT TGCCCTGTGG
GCGCAAACTC TTGGCTGGCC GCTGATTGGC GACGTGCTGT CGCAAACCGG ACAGCCGCTG
CCGTGTGCCG ATCTCTGGTT AGGCAATGCC AAAGCGACCA GCGAACTGCA ACAGGCGCAA
ATTGTGGTGC AACTGGGAAG CAGCCTGACG GGCAAACGGC TCCTGCAATG GCAGGCAAGC
TGTGAACCCG AAGAGTACTG GATTGTGGAT GACATCGAAG GGCGGCTTGA TCCGGCACAC
CATCGTGGAC GCCGCTTAAT TGCCAATATT GCCGACTGGC TGGAACTGCA TCCGGCAGAA
AAACGCCAGC CCTGGTGCGT TGAAATCCCG CGCCTGGCGG AACAGGCAAT GCAGGCGGTT
ATTGCCTGTC GCGATGCGTT TGGCGAAGCG CAACTGGCGC ATCGCATCAG CGACTATCTG
CCTGAACAGG GGCAATTGTT TGTCGGTAAC AGCCTGGTGG TACGTCTGAT TGATGCGCTT
TCGCAACTTC CGGCAGGTTA CCCGGTGTAC AGCAACCGTG GGGCCAGCGG TATCGACGGG
CTGCTCTCGA CTGCCGCCGG CGTTCAGCGG GCAAGCGGCA AACCGACGCT GGCGATTGTG
GGCGATCTCT CCGCACTTTA CGACCTGAAT GCGCTGGCGT TATTGCGCCA GGTTTCCGCG
CCGCTGGTAT TAATTGTGGT GAACAACAAC GGCGGGCAAA TTTTCTCGCT GCTGCCAACG
CCGAAAAGCG AGCGCGAGCG TTTCTATCTG ATGCCGCAAA ACGTCCATTT TGAGCACGCT
GCCGCGATGT TTGAGCTGAA ATATCATCGT CCGCAAAACT GGCAGGAACT TGAAACGGTA
CTTGCCGACG CCTGGCGTAC TCCGACCACC ACGGTAATTG AAATGGTGGT TAACGACACC
GACGGCGCGC AAACGCTCCA GCAGCTGCTG GCGCAGGTAA GCCATTTATG A
 
Protein sequence
MSVSAFNRRW AAVILEALTR HGVRHICIAP GSRSTPLTLA AAENSAFIHH THFDERGLGH 
LALGLAKVSK QPVAVIVTSG TAVANLYPAL IEAGLTGEKL ILLTADRPPE LIDCGANQAI
RQPGMFASHP THSISLPRPT QDIPARWLVS TIDHALGTLH AGGVHINCPF AEPLYGEMDD
TGLSWQQRLG DWWQDDKPWL REAPRLESEK QRDWFFWRQK RGVVVAGRMS AEEGKKVALW
AQTLGWPLIG DVLSQTGQPL PCADLWLGNA KATSELQQAQ IVVQLGSSLT GKRLLQWQAS
CEPEEYWIVD DIEGRLDPAH HRGRRLIANI ADWLELHPAE KRQPWCVEIP RLAEQAMQAV
IACRDAFGEA QLAHRISDYL PEQGQLFVGN SLVVRLIDAL SQLPAGYPVY SNRGASGIDG
LLSTAAGVQR ASGKPTLAIV GDLSALYDLN ALALLRQVSA PLVLIVVNNN GGQIFSLLPT
PKSERERFYL MPQNVHFEHA AAMFELKYHR PQNWQELETV LADAWRTPTT TVIEMVVNDT
DGAQTLQQLL AQVSHL