Gene EcolC_2890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2890 
Symbol 
ID6065080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3148286 
End bp3149569 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content53% 
IMG OID641602295 
Productputative pectinesterase 
Protein accessionYP_001725844 
Protein GI170020890 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4677] Pectin methylesterase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0446001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0752938 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACACAT TTTCAGTTTC CCGTCTGGCG CTGGCATTGG CTTTTGGCGT GACGCTGACC 
GCCTGTAGCT CAACCCCGCC CGATCAACGT CCTTCTGATC AAACCGCGCC TGGTACCTCT
TCTCGCCCGA TTCTGTCGGC AAAAGAAGCG CAGAATTTCG ATGCTCAACA CTATTTTGCA
TCCCTGACAC CAGGTGCTGC AGCGTGGAAT CCTTCCCCGA TTACCCTGCC TGCGCAACCT
GAATTTGTTG TCGGCCCGGC GGGCACTCAA GGTGTAACGC ATACCACGAT TCAGGCGGCG
GTAGATGCGG CAATTATCAA GCGTACCAAC AAGCGCCAGT ATATTGCCGT GATGCCTGGT
GAGTATCAGG GAACGGTATA TGTCCCTGCC GCTCCGGGTG GAATTACTCT GTACGGTACA
GGTGAAAAAC CGATTGATGT GAAGATTGGG CTTTCCCTTG ATGGTGGCAT GAGCCCTGCC
GACTGGCGTC ACGACGTCAA CCCGCGCGGC AAATATATGC CAGGTAAACC AGCGTGGTAT
ATGTACGATA GCTGCCAGAG CAAACGCAGC GACAGTATCG GTGTTCTCTG CTCTGCGGTC
TTCTGGTCAC AAAACAATGG CCTGCAACTG CAAAATCTGA CCATCGAAAA CACGCTGGGC
GATAGCGTAG ATGCAGGTAA CCATCCGGCG GTGGCACTGC GTACTGATGG TGACCAGGTA
CAGATTAACA ACGTTAACAT TCTCGGTCGT CAGAACACCT TCTTTGTCAC CAACAGCGGT
GTGCAGAACC GTCTGGAAAC GAATCGTCAG CCGCGTACGC TGGTGACCAA CAGCTACATT
GAAGGGGATG TGGATATCGT TTCTGGTCGC GGCGCAGTGG TGTTCGATAA CACCGAATTC
CGCGTGGTGA ACTCACGTAC TCAGCAAGAA GCGTATGTGT TTGCACCGGC TACGCTGTCC
AACATTTACT ACGGTTTCCT CGCCGTAAAC AGCCGTTTCA ATGCTTTCGG TGATGGTGTG
GCGCAACTGG GCCGCTCGCT GGATGTTGAT GCCAATACCA ACGGTCAGGT GGTGATCCGT
GATAGCGCCA TCAACGAAGG TTTTAACACG GCTAAACCGT GGGCCGATGC GGTGATCTCT
AATCGTCCGT TTGCGGGTAA TACCGGCAGC GTAGATGATA ACGACGAAAT ACAGCGCAAT
CTGAATGACA CTAACTACAA CCGCATGTGG GAATACAATA ACCGCGGCGT GGGTAGTAAA
GTGGTTGCAG AGGCGAAGAA GTAA
 
Protein sequence
MNTFSVSRLA LALAFGVTLT ACSSTPPDQR PSDQTAPGTS SRPILSAKEA QNFDAQHYFA 
SLTPGAAAWN PSPITLPAQP EFVVGPAGTQ GVTHTTIQAA VDAAIIKRTN KRQYIAVMPG
EYQGTVYVPA APGGITLYGT GEKPIDVKIG LSLDGGMSPA DWRHDVNPRG KYMPGKPAWY
MYDSCQSKRS DSIGVLCSAV FWSQNNGLQL QNLTIENTLG DSVDAGNHPA VALRTDGDQV
QINNVNILGR QNTFFVTNSG VQNRLETNRQ PRTLVTNSYI EGDVDIVSGR GAVVFDNTEF
RVVNSRTQQE AYVFAPATLS NIYYGFLAVN SRFNAFGDGV AQLGRSLDVD ANTNGQVVIR
DSAINEGFNT AKPWADAVIS NRPFAGNTGS VDDNDEIQRN LNDTNYNRMW EYNNRGVGSK
VVAEAKK