Gene EcSMS35_0795 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0795 
Symbol 
ID6143264 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp797846 
End bp799129 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content54% 
IMG OID641615683 
Productputative pectinesterase 
Protein accessionYP_001742875 
Protein GI170683447 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4677] Pectin methylesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.253968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACAT TTTCAGTTTC CCGTCTGGCG CTGGCATTGG CTTTTGGCGT GACGCTGACC 
GCCTGTAGCT CAACACCGCC CGATCAACGT CCTTCTGATC AAACCGCGCC AGGTACCTCT
TCTCGCCCGA TTCTGTCGGC AAAAGAAGCG CAGAATTTCG ATGCTCAACA CTATTTTGCA
TCCCTGACAC CAGGTGCTGC AGCGTGGAAT CCTTCCCCGA TTACCCTGCC TGCACAACCT
GACTTTGTTG TCGGCCCGGC GGGTACTCAA GGTGTAACGC ATACCACGAT TCAGGCGGCG
GTAGATGCGG CAATTATCAA GCGTACCAAC AAGCGCCAGT ATATTGCCGT GATGCCTGGT
GAGTATCAGG GAACGGTGTA TGTCCCTGCC GCTCCGGGTG GAATTACTCT GTACGGTACA
GGTGAAAAAC CGATTGATGT GAAGATTGGG CTTTCCCTTG ATGGTGGCAT GAGCCCTGCC
GACTGGCGTC ACGACGTCAA CCCGCGCGGC AAATATATGC CAGGTAAACC GGCGTGGTAT
ATGTACGATA GCTGCCAGAG TAAACGCAGC GACAGTATCG GTGTTCTCTG CTCTGCGGTC
TTCTGGTCAC AAAACAATGG CCTGCAACTG CAAAACCTGA CCATCGAAAA CACGCTGGGC
GATAGCGTAG ATGCGGGTAA CCATCCGGCG GTGGCACTGC GTACTGATGG CGACAAAGTG
CAGATCAATA ACGTCAACAT TCTCGGTCGT CAGAACACCT TCTTTGTCAC CAACAGCGGT
GTGCAGAACC GTCTGGAAAC CAACCGTCAG CCGCGTACGC TGGTGACCAA CAGCTATATT
GAAGGGGATG TGGATATCGT TTCTGGTCGC GGCGCAGTGG TGTTCGATAA CACCGAATTC
CGCGTGGTGA ACTCCCGTAC CCAGCAAGAA GCGTATGTGT TTGCACCGGC TACGCTGTCC
AACATTTACT ACGGTTTCCT CGCCGTAAAC AGCCGTTTCA ATGCTTCCGG TGATGGCGTG
GCGCAACTGG GCCGCTCGCT GGATGTTGAT GCCAATACCA ACGGTCAGGT AGTGATCCGT
GATAGCGCCA TCAACGAAGG TTTTAACACG GCGAAACCGT GGGCCGATGC GGTGATTTCC
AATCGTCCGT TTGCGGGTAA TACTGGCAAC GTTGATGATA ACGACGAAGT ACAGCGCAAT
CTGAATGACA CTAACTACAA CCGCATGTGG GAATACAATA ACCGCGGCGT GGGTAGCAAA
GTGGTTGCAG AGGCGAAGAA GTAA
 
Protein sequence
MNTFSVSRLA LALAFGVTLT ACSSTPPDQR PSDQTAPGTS SRPILSAKEA QNFDAQHYFA 
SLTPGAAAWN PSPITLPAQP DFVVGPAGTQ GVTHTTIQAA VDAAIIKRTN KRQYIAVMPG
EYQGTVYVPA APGGITLYGT GEKPIDVKIG LSLDGGMSPA DWRHDVNPRG KYMPGKPAWY
MYDSCQSKRS DSIGVLCSAV FWSQNNGLQL QNLTIENTLG DSVDAGNHPA VALRTDGDKV
QINNVNILGR QNTFFVTNSG VQNRLETNRQ PRTLVTNSYI EGDVDIVSGR GAVVFDNTEF
RVVNSRTQQE AYVFAPATLS NIYYGFLAVN SRFNASGDGV AQLGRSLDVD ANTNGQVVIR
DSAINEGFNT AKPWADAVIS NRPFAGNTGN VDDNDEVQRN LNDTNYNRMW EYNNRGVGSK
VVAEAKK