Gene EcSMS35_2419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2419 
SymbolmenD 
ID6144159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2466968 
End bp2468638 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content58% 
IMG OID641617292 
Product2-succinyl-5-enolpyruvyl-6-hydroxy-3- cyclohexene-1-carboxylate synthase 
Protein accessionYP_001744464 
Protein GI170683335 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1165] 2-succinyl-6-hydroxy-2,4-cyclohexadiene-1-carboxylate synthase 
TIGRFAM ID[TIGR00173] 2-succinyl-5-enolpyruvyl-6-hydroxy-3-cyclohexene-1-carboxylic-acid synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0945731 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones64 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTAA GCGCATTTAA CCGACGCTGG GCGGCGGTCA TTCTGGAAGC ATTAACGCGT 
CAAGGCGTCA GACACATCTG TATCGCCCCT GGCTCGCGTT CTACACCGTT AACGTTAGCG
GCGGCGGAGA ATTCCGCATT CATTCACCAC ACCCATTTCG ATGAGCGTGG ACTGGGGCAT
CTGGCGCTGG GGCTGGCGAA AGTCAGCAAG CAGCCGGTGG CGGTGATTGT GACCTCCGGC
ACGGCGGTAG CAAATCTCTA TCCGGCACTG ATTGAAGCCG GGTTAACCGG AGAAAAACTG
ATCCTGTTAA CCGCCGATCG CCCGCCGGAG CTCATTGACT GCGGCGCGAA TCAGGCAATT
CGTCAGCCGG GAATGTTCGC CTCTCACCCC ACGCACAGTA TTTCACTGCC GCGCCCGACC
CAGGATATCC CCGCACGTTG GCTGGTTTCT ACCATCGACC ATGCTCTCGG TACGCTTCAT
GCGGGCGGAG TCCATATCAA CTGCCCGTTT GCTGAACCGC TGTATGGCGA AATGGACGAC
ACCGGGCTTA GCTGGCAACA GCGTCTGGGC GACTGGTGGC AGGACGATAA GCCGTGGCTG
CGTGAAGCGC CTCGTCTGGA AAGTGAAAAA CAGCGCGACT GGTTCTTCTG GCGACAAAAG
CGCGGCGTGG TGGTTGCCGG GCGCATGAGT GCGGAAGAGG GCAAAAAAGT CGCACTGTGG
GCGCAAACTC TTGGCTGGCC GCTGATTGGC GATGTGCTGT CGCAAACCGG ACAGCCGCTG
CCGTGTGCCG ATCTCTGGTT AGGCAATGCC AAAGCGACCA GCGAACTGCA ACAGGCGCAA
ATTGTGGTGC AACTGGGAAG CAGCCTGACG GGCAAACGGC TCCTGCAATG GCAGGCAAGC
TGTGAACCCG AAGAGTACTG GATTGTTGAT GACATTGAAG GGCGACTTGA TCCGGCACAC
CATCGCGGAC GTCGCTTAAT TGCCAATATT GCCGACTGGC TGGAGCAACA TCCGGCAGAA
AAACGCCAGC CCTGGTGCGT TGAAATCCCG CGCCTGGCGG AACAGGCAAT GCAGGCGGTT
ATTGCCCGTC GCGATGCGTT TGGCGAAGCC CAACTGGCGC ATCGCATCAG CGACTATCTG
CCTGAACAGG GGCAATTGTT TGTCGGTAAC AGCCTGGTAG TACGTCTGAT TGATGCGCTT
TCGCAACTTC CGGCAGGTTA CCCGGTGTAC AGCAACCGAG GTGCCAGCGG TATCGACGGG
CTGCTCTCGA CCGCCGCTGG CGTTCAGCGG GCCAGTGGCA AACCGACGCT GGCGATTGTG
GGCGATCTCT CAGCACTTTA CGATCTCAAC GCGCTGGCGT TATTACGTCA GGTTTCCGCG
CCGCTGGTAT TAATTGTGGT GAACAACAAT GGCGGGCAAA TTTTCTCGCT GTTGCCAACG
CCGAAAAACG AGCGCGAGCG TTTCTATCTG ATGCCGCAAA ACGTCCATTT TGAGCACGCC
GCCGCGATGT TCGAGCTGAA ATATCATCGT CCGCAAAACT GGCAGGAACT TGAAACGGCA
CTTGTCGACG CCTGGCGCAC GCCGACCACC ACGGTAATTG AAATGGTGGT TAACGACACC
GATGGTGCGC AAACCCTCCA GCAACTGCTG GCGCAGGTAA GCCATTTATG A
 
Protein sequence
MSVSAFNRRW AAVILEALTR QGVRHICIAP GSRSTPLTLA AAENSAFIHH THFDERGLGH 
LALGLAKVSK QPVAVIVTSG TAVANLYPAL IEAGLTGEKL ILLTADRPPE LIDCGANQAI
RQPGMFASHP THSISLPRPT QDIPARWLVS TIDHALGTLH AGGVHINCPF AEPLYGEMDD
TGLSWQQRLG DWWQDDKPWL REAPRLESEK QRDWFFWRQK RGVVVAGRMS AEEGKKVALW
AQTLGWPLIG DVLSQTGQPL PCADLWLGNA KATSELQQAQ IVVQLGSSLT GKRLLQWQAS
CEPEEYWIVD DIEGRLDPAH HRGRRLIANI ADWLEQHPAE KRQPWCVEIP RLAEQAMQAV
IARRDAFGEA QLAHRISDYL PEQGQLFVGN SLVVRLIDAL SQLPAGYPVY SNRGASGIDG
LLSTAAGVQR ASGKPTLAIV GDLSALYDLN ALALLRQVSA PLVLIVVNNN GGQIFSLLPT
PKNERERFYL MPQNVHFEHA AAMFELKYHR PQNWQELETA LVDAWRTPTT TVIEMVVNDT
DGAQTLQQLL AQVSHL