Gene ECH74115_0876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0876 
Symbol 
ID6970494 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp893139 
End bp894422 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content54% 
IMG OID643384901 
Productputative pectinesterase 
Protein accessionYP_002269401 
Protein GI209397907 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4677] Pectin methylesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACAT TTTCAGTTTC CCGTCTGGCG CTGGCATTGG CTTTTGGCGT GACGCTGACC 
GCCTGTAGCT CAACACCACC CGATCAACGT CCTTCTGATC AAACTGCGCC TGGTACCTCT
TCTCGCCCGA TTCTGTCGGC AAAAGAAGCG CAGAATTTCG ATGCTCAACA CTATTTTGCA
TCCCTGACAC CAGGTGCTGC AGCGTGGAAT CCGTCCCCGA TAACCCTGCC TGCGCAACCT
GACTTTGTTG TCGGCCCGGC GGGTACTCAA GGTGTAACGC ATACCACGAT TCAGGCGGCG
GTAGATGCGG CAATTATCAA GCGCACCAAC AAGCGCCAGT ATATTGCCGT GATGCCTGGT
GAGTATCAGG GAACGGTGTA TGTCCCTGCT GCTCCAGGTG GAATTACTCT GTACGGTACA
GGTGAAAAAC CGATTGATGT GAAGATTGGG CTTTCCCTTG ATGGGGGCAT GAGCCCTGCC
GACTGGCGTC ACGACGTCAA CCCGCGCGGC AAATATATGC CAGGTAAACC AGCGTGGTAT
ATGTACGATA GCTGCCAGAG CAAACGCAGC GACAGTATCG GTGTTCTGTG TTCTGCGGTC
TTCTGGTCAC AAAACAATGG CCTGCAACTG CAAAACCTGA CCATCGAAAA CACGCTGGGT
GATAGCGTAG ATGCGGGTAA CCATCCGGCG GTGGCACTGC GTACTGACGG TGACCAGGTA
CAGATTAACA ACGTTAACAT TCTCGGTCGT CAGAACACCT TCTTTGTCAC CAACAGCGGT
GTGCAGAACC GTCTGGAAAC GAATCGTCAG CCGCGTACGC TGGTGACCAA CAGCTACATT
GAAGGGGATG TGGATATCGT TTCTGGTCGC GGCGCAGTGG TGTTCGATAA CACCGAATTC
CGCGTGGTGA ACTCACGTAC TCAGCAAGAA GCGTATGTGT TTGCACCGGC TACGCTGTCC
AACATTTACT ACGGTTTCCT CGCCGTAAAC AGCCGTTTCA ATGCTTCCGG TGATGGCGTG
GCACAACTGG GCCGCTCGCT GGATGTTGAT GCCAATACCA ACGGTCAGGT GGTGATCCGT
GATAGCGCCA TCAACGAAGG TTTTAACACG GCTAAACCGT GGGGCGATGC GGTGATCTCT
AATCGTCCGT TTGCGGGTAA CACCGGCAGC GTTGACGATA GCGACGAAAT ACAGCGCAAT
CTGAATGACA CTAACTACAA CCGCATGTGG GAATACAATA ACCGCGGCGT GGGTAGCAAA
GTGGTTGCAG AGGCGAAGAA GTAA
 
Protein sequence
MNTFSVSRLA LALAFGVTLT ACSSTPPDQR PSDQTAPGTS SRPILSAKEA QNFDAQHYFA 
SLTPGAAAWN PSPITLPAQP DFVVGPAGTQ GVTHTTIQAA VDAAIIKRTN KRQYIAVMPG
EYQGTVYVPA APGGITLYGT GEKPIDVKIG LSLDGGMSPA DWRHDVNPRG KYMPGKPAWY
MYDSCQSKRS DSIGVLCSAV FWSQNNGLQL QNLTIENTLG DSVDAGNHPA VALRTDGDQV
QINNVNILGR QNTFFVTNSG VQNRLETNRQ PRTLVTNSYI EGDVDIVSGR GAVVFDNTEF
RVVNSRTQQE AYVFAPATLS NIYYGFLAVN SRFNASGDGV AQLGRSLDVD ANTNGQVVIR
DSAINEGFNT AKPWGDAVIS NRPFAGNTGS VDDSDEIQRN LNDTNYNRMW EYNNRGVGSK
VVAEAKK