Gene ECH74115_3403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3403 
SymbolmenC 
ID6968030 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3146875 
End bp3147837 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content59% 
IMG OID643387211 
ProductO-succinylbenzoate synthase 
Protein accessionYP_002271674 
Protein GI209397997 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1441] O-succinylbenzoate synthase 
TIGRFAM ID[TIGR01927] o-succinylbenzoic acid (OSB) synthetase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.079647 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones72 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGCG CGCAGGTATA CCGCTGGCAG ATCCCCATGG ACGCGGGGGT GGTTCTGCGC 
GACAGGCGGT TAAAAACCCG CGACGGGCTG TACGTTTGCC TGCGTGAGGG CGAGCGCGAA
GGGTGGGGGG AGATCTCCCC ACTGCCGGGC TTCAGTCAGG AAACCTGGGA AGAGGCGCAA
AGTGTGCTGC TTGCCTGGGT AAATAACTGG CTGGCAGGCG ATTGCGAATT ACCGCAGATG
CCTTCCGTTG CCTTTGGCGT AAGTTGTGCA TTGGCGGAGC TGGCAGATAC GCTCCCGCAG
GCGGCCAATT ACCGTACGGC ACCACTGTGT AATGGCGATC CGGACGATCT GATCCTCAAA
CTTGCAGATA TGCCAGGCGA GAAAGTGGCG AAGGTCAAAG TGGGATTGTA CGAAGCGGTG
CGCGACGGCA TGGTGGTGAA TCTGTTGCTG GAGGCAATTC CGGATCTGCA TTTGCGTCTT
GACGCAAATC GCGCCTGGAC ACCGCTGAAA GGTCAGCAGT TTGCCAAATA CGTTAACCCG
GATTATCGCC ACCGCATCGC GTTTCTCGAA GAGCCGTGCA AAACCCGCGA TGATTCGCGA
GCGTTTGCCC GTGAAACCGG CATTGCCATT GCCTGGGATG AAAGCTTGCG CGAGCCGGAT
TTTGCCTTTG TGGCTGAAGA GGGCGTGCGC GCGGTAGTTA TCAAACCCAC GCTCACGGGC
AGTCTGGAGA AAGTTCGCGA GCAGGTACAG GCGGCGCACG CGCTGGGGCT GACGGCAGTG
ATCAGTTCTT CCATTGAATC GAGCTTAGGC TTAACGCAAC TGGCGCGGAT TGCTGCCTGG
TTAACGCCGG ACACCATTCC AGGGCTGGAC ACGCTGGATC TAATGCAGGC GCAGCAGGTA
CGTCGCTGGC CGGGTAGCAC GCTGCCTGTC GTGGAAGTTG ATGCACTGGA GCGGTTGTTA
TGA
 
Protein sequence
MRSAQVYRWQ IPMDAGVVLR DRRLKTRDGL YVCLREGERE GWGEISPLPG FSQETWEEAQ 
SVLLAWVNNW LAGDCELPQM PSVAFGVSCA LAELADTLPQ AANYRTAPLC NGDPDDLILK
LADMPGEKVA KVKVGLYEAV RDGMVVNLLL EAIPDLHLRL DANRAWTPLK GQQFAKYVNP
DYRHRIAFLE EPCKTRDDSR AFARETGIAI AWDESLREPD FAFVAEEGVR AVVIKPTLTG
SLEKVREQVQ AAHALGLTAV ISSSIESSLG LTQLARIAAW LTPDTIPGLD TLDLMQAQQV
RRWPGSTLPV VEVDALERLL