Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0795 |
Symbol | |
ID | 6143264 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 797846 |
End bp | 799129 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641615683 |
Product | putative pectinesterase |
Protein accession | YP_001742875 |
Protein GI | 170683447 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4677] Pectin methylesterase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.253968 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAACACAT TTTCAGTTTC CCGTCTGGCG CTGGCATTGG CTTTTGGCGT GACGCTGACC GCCTGTAGCT CAACACCGCC CGATCAACGT CCTTCTGATC AAACCGCGCC AGGTACCTCT TCTCGCCCGA TTCTGTCGGC AAAAGAAGCG CAGAATTTCG ATGCTCAACA CTATTTTGCA TCCCTGACAC CAGGTGCTGC AGCGTGGAAT CCTTCCCCGA TTACCCTGCC TGCACAACCT GACTTTGTTG TCGGCCCGGC GGGTACTCAA GGTGTAACGC ATACCACGAT TCAGGCGGCG GTAGATGCGG CAATTATCAA GCGTACCAAC AAGCGCCAGT ATATTGCCGT GATGCCTGGT GAGTATCAGG GAACGGTGTA TGTCCCTGCC GCTCCGGGTG GAATTACTCT GTACGGTACA GGTGAAAAAC CGATTGATGT GAAGATTGGG CTTTCCCTTG ATGGTGGCAT GAGCCCTGCC GACTGGCGTC ACGACGTCAA CCCGCGCGGC AAATATATGC CAGGTAAACC GGCGTGGTAT ATGTACGATA GCTGCCAGAG TAAACGCAGC GACAGTATCG GTGTTCTCTG CTCTGCGGTC TTCTGGTCAC AAAACAATGG CCTGCAACTG CAAAACCTGA CCATCGAAAA CACGCTGGGC GATAGCGTAG ATGCGGGTAA CCATCCGGCG GTGGCACTGC GTACTGATGG CGACAAAGTG CAGATCAATA ACGTCAACAT TCTCGGTCGT CAGAACACCT TCTTTGTCAC CAACAGCGGT GTGCAGAACC GTCTGGAAAC CAACCGTCAG CCGCGTACGC TGGTGACCAA CAGCTATATT GAAGGGGATG TGGATATCGT TTCTGGTCGC GGCGCAGTGG TGTTCGATAA CACCGAATTC CGCGTGGTGA ACTCCCGTAC CCAGCAAGAA GCGTATGTGT TTGCACCGGC TACGCTGTCC AACATTTACT ACGGTTTCCT CGCCGTAAAC AGCCGTTTCA ATGCTTCCGG TGATGGCGTG GCGCAACTGG GCCGCTCGCT GGATGTTGAT GCCAATACCA ACGGTCAGGT AGTGATCCGT GATAGCGCCA TCAACGAAGG TTTTAACACG GCGAAACCGT GGGCCGATGC GGTGATTTCC AATCGTCCGT TTGCGGGTAA TACTGGCAAC GTTGATGATA ACGACGAAGT ACAGCGCAAT CTGAATGACA CTAACTACAA CCGCATGTGG GAATACAATA ACCGCGGCGT GGGTAGCAAA GTGGTTGCAG AGGCGAAGAA GTAA
|
Protein sequence | MNTFSVSRLA LALAFGVTLT ACSSTPPDQR PSDQTAPGTS SRPILSAKEA QNFDAQHYFA SLTPGAAAWN PSPITLPAQP DFVVGPAGTQ GVTHTTIQAA VDAAIIKRTN KRQYIAVMPG EYQGTVYVPA APGGITLYGT GEKPIDVKIG LSLDGGMSPA DWRHDVNPRG KYMPGKPAWY MYDSCQSKRS DSIGVLCSAV FWSQNNGLQL QNLTIENTLG DSVDAGNHPA VALRTDGDKV QINNVNILGR QNTFFVTNSG VQNRLETNRQ PRTLVTNSYI EGDVDIVSGR GAVVFDNTEF RVVNSRTQQE AYVFAPATLS NIYYGFLAVN SRFNASGDGV AQLGRSLDVD ANTNGQVVIR DSAINEGFNT AKPWADAVIS NRPFAGNTGN VDDNDEVQRN LNDTNYNRMW EYNNRGVGSK VVAEAKK
|
| |