Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4477 |
Symbol | |
ID | 6146843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4575253 |
End bp | 4576077 |
Gene Length | 825 bp |
Protein Length | 274 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619293 |
Product | PTS system mannose/fructose/sorbose family IID subunit |
Protein accession | YP_001746405 |
Protein GI | 170683591 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3716] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IID |
TIGRFAM ID | [TIGR00828] PTS system, mannose/fructose/sorbose family, IID component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.419116 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAACAGA GAAAAATTAC ACGCAGCGAT CTGGTGAGCA TGTTTCTGCG CTCCAACCTG CAACAGGCAT CCTTTAACTT TGAACGTATT CACGGGCTGG GTTTTTGCTA CGACATGATC CCCGCCATCA AACGACTTTA CCCATTAAAA GAGGATCAGG TTGCGGCGCT CAGGCGACAC CTAGTGTTCT TCAATACCAC GCCAGCCGTA TGTGGCCCGG TCATCGGCGT CACTGCCGCC ATGGAAGAGG CGCGAGCCAA CGGCGCGGAA ATTGATGACG GTACCATTAA CGGCATCAAA GTCGGCCTGA TGGGACCGTT GGCAGGTGTT GGCGATCCAC TGGTCTGGGG AACGCTGCGC CCGATTACCG CCGCGCTCGG CGCATCTCTG GCACTTTCGG GCAACATTCT CGGCCCCCTG CTGTTCTTCT TTATTTTCAA TGCAGTACGT CTGGCAATGA AATGGTATGG GCTGCAACTG GGCTTTCGTA AAGGGGTGAA TATCGTCAGC GATATGGGCG GTAATTTGCT GCAAAAACTG ACCGAAGGCG CGTCGATTCT CGGGCTGTTT GTAATGGGCG TACTGGTGAC CAAATGGACA TCGATCAACG TGCCGTTAGT GGTTTCACAA ACGCCTGCCG CCGATGGTGC CACCGTCACT ATGACCGTAC AGAACATTCT CGACCAGCTT TGCCCTGGAT TGCTGGCGCT CGGTTTGACG CTGCTGATGG TACGTCTGCT CAACAAAAAA ATTAACCCGG TATGGCTGAT TTTCGCCCTG TTTGGTTTAG GGATTATCGG TAATGCGCTG GGCTTCCTGT CCTGA
|
Protein sequence | MEQRKITRSD LVSMFLRSNL QQASFNFERI HGLGFCYDMI PAIKRLYPLK EDQVAALRRH LVFFNTTPAV CGPVIGVTAA MEEARANGAE IDDGTINGIK VGLMGPLAGV GDPLVWGTLR PITAALGASL ALSGNILGPL LFFFIFNAVR LAMKWYGLQL GFRKGVNIVS DMGGNLLQKL TEGASILGLF VMGVLVTKWT SINVPLVVSQ TPAADGATVT MTVQNILDQL CPGLLALGLT LLMVRLLNKK INPVWLIFAL FGLGIIGNAL GFLS
|
| |