Gene EcHS_A0826 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0826 
Symbol 
ID5594744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp832601 
End bp833884 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content53% 
IMG OID640919998 
Productputative pectinesterase 
Protein accessionYP_001457565 
Protein GI157160247 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4677] Pectin methylesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value0.0907325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAACACAT TTTCAGTTTC CCGTCTGGCG CTGGCATTGG CTTTTGGCGT GACGCTGACC 
GCCTGTAGCT CAACCCCGCC CGATCAACGT CCTTCTGATC AAACCGCGCC TGGTACCTCT
TCTCGCCCGA TTCTGTCGGC AAAAGAAGCG CAGAATTTCG ATGCTCAACA CTATTTTGCA
TCCCTGACAC CAGGTGCTGC AGCGTGGAAT CCTTCCCCGA TTACCCTGCC TGCGCAACCT
GACTTTGTTG TCGGCCCGGC GGGCACTCAA GGTGTAACGC ATACCACGAT TCAGGCGGCG
GTAGATGCGG CAATTATCAA GCGTACCAAC AAGCGCCAGT ATATTGCCGT GATGCCTGGT
GAGTATCAGG GAACGGTATA TGTCCCTGCC GCTCCGGGTG GAATTACTCT GTACGGTACA
GGTGAAAAAC CGATTGATGT GAAGATTGGG CTTTCCCTTG ATGGTGGCAT GAGCCCTGCC
GACTGGCGTC ACGACGTCAA CCCGCGCGGC AAATATATGC CAGGTAAACC AGCGTGGTAT
ATGTACGATA GCTGCCAGAG CAAACGCAGC GACAGTATCG GTGTTCTCTG CTCAGCGGTC
TTCTGGTCAC AAAACAATGG CCTGCAACTG CAAAATCTGA CCATCGAAAA CACGCTGGGC
GATAGCGTAG ATGCAGGTAA CCATCCGGCG GTGGCACTGC GTACTGATGG TGACCAGGTA
CAGATTAACA ACGTTAACAT TCTCGGTCGT CAGAACACCT TCTTTGTCAC CAACAGCGGT
GTGCAGAACC GTCTGGAAAC GAATCGTCAG CCGCGTACGC TGGTGACCAA CAGCTACATT
GAAGGGGATG TGGATATCGT TTCTGGTCGC GGCGCAGTGG TGTTCGATAA CACCGAATTC
CGCGTGGTGA ACTCACGTAC TCAGCAAGAA GCGTATGTGT TTGCACCGGC TACGCTGTCC
AACATTTACT ACGGTTTCCT CGCCGTAAAC AGCCGTTTCA ATGCTTTCGG TGATGGTGTG
GCGCAACTGG GCCGCTCGCT GGATGTTGAT GCCAATACCA ACGGTCAGGT GGTGATCCGT
GATAGCGCCA TCAACGAAGG TTTTAACACG GCTAAACCGT GGGCCGATGC GGTGATCTCT
AATCGTCCGT TTGCGGGTAA TACCGGCAGC GTAGATGATA ACGACGAAAT ACAGCGCAAT
CTGAATGACA CTAACTACAA CCGCATGTGG GAATACAATA ACCGCGGCGT GGGTAGTAAA
GTGGTTGCAG AGGCGAAGAA GTAA
 
Protein sequence
MNTFSVSRLA LALAFGVTLT ACSSTPPDQR PSDQTAPGTS SRPILSAKEA QNFDAQHYFA 
SLTPGAAAWN PSPITLPAQP DFVVGPAGTQ GVTHTTIQAA VDAAIIKRTN KRQYIAVMPG
EYQGTVYVPA APGGITLYGT GEKPIDVKIG LSLDGGMSPA DWRHDVNPRG KYMPGKPAWY
MYDSCQSKRS DSIGVLCSAV FWSQNNGLQL QNLTIENTLG DSVDAGNHPA VALRTDGDQV
QINNVNILGR QNTFFVTNSG VQNRLETNRQ PRTLVTNSYI EGDVDIVSGR GAVVFDNTEF
RVVNSRTQQE AYVFAPATLS NIYYGFLAVN SRFNAFGDGV AQLGRSLDVD ANTNGQVVIR
DSAINEGFNT AKPWADAVIS NRPFAGNTGS VDDNDEIQRN LNDTNYNRMW EYNNRGVGSK
VVAEAKK