Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5409 |
Symbol | |
ID | 6971033 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5052525 |
End bp | 5053604 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643389062 |
Product | putative fructose-like permease EIIC subunit 2 |
Protein accession | YP_002273471 |
Protein GI | 209398507 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1299] Phosphotransferase system, fructose-specific IIC component |
TIGRFAM ID | [TIGR01427] PTS system, fructose subfamily, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 59 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGAGT TGGTGCAGAT CCTGAAAAAT ACCCGTCAGC ATTTAATGAC GGGCGTTTCA CACATGATTC CCTTCGTGGT ATCGGGCGGT ATTTTGCTGG CGGTTTCCGT CATGTTGTAT GGCAAAGGCG CAGTGCCGGA TGCCGTAGCC GATCCGAATC TGAAAAAACT GTTTGATATC GGCGTTGCAG GTTTGACGCT GATGGTGCCT TTCCTCGCCG CTTACATCGG TTACTCCATT GCAGAGCGTT CTGCGCTGGC TCCGTGCGCT ATCGGTGCCT GGGTTGGTAA CAGCTTTGGT GCGGGCTTCT TTGGTGCACT GATCGCCGGG ATTATCGGCG GCATCGTGGT GCATTACCTG AAGAAAATTC CGGTGCATAA AGTTCTGCGC TCGGTGATGC CCATCTTCAT CATTCCGATC GTCGGCACAC TGATTACCGC AGGCATCATG ATGTGGGGTC TGGGCGAGCC TGTAGGGGCG TTGACCAACA GCCTGACTCA GTGGCTTCAG GGGATGCAGC AGGGCAGCAT TGTTATGCTG GCGGTGATCA TGGGTCTGAT GCTGGCGTTC GATATGGGCG GTCCGGTTAA CAAAGTGGCC TATGCCTTCA TGCTGATTTG CGTTGCTCAG GGTGTTTATA CCGTGGTGGC CATTGCTGCC GTGGGTATTT GTATCCCGCC GCTGGGGATG GGGCTGGCGA CGCTGATTGG TCGTAAAAAT TTCTCCGCAG AAGAGCGCGA AACCGGTAAA GCGGCGCTGG TGATGGGGTG CGTTGGGGTT ACTGAAGGGG CGATTCCTTT CGCCGCAGCC GATCCGTTAC GTGTAATTCC TTCCATCATG GTTGGTTCAG TTTGTGGTGC AGTAACTGCG GCGCTGGTCG GTGCGCAGTG CTATGCAGGC TGGGGTGGTC TGATTGTGCT GCCGGTGGTT GAAGGTAAGT TGGGTTATAT CGCAGCAGTG GCTGTCGGGG CGGTTGTGAC GGCAGTCTGC GTGAACGTGC TGAAAAGTCT GGCGCGTAAA AATGGATCTT CTACTGATGA AAAAGAAGAC GACCTGGATT TGGATTTTGA AATTAATTAA
|
Protein sequence | MNELVQILKN TRQHLMTGVS HMIPFVVSGG ILLAVSVMLY GKGAVPDAVA DPNLKKLFDI GVAGLTLMVP FLAAYIGYSI AERSALAPCA IGAWVGNSFG AGFFGALIAG IIGGIVVHYL KKIPVHKVLR SVMPIFIIPI VGTLITAGIM MWGLGEPVGA LTNSLTQWLQ GMQQGSIVML AVIMGLMLAF DMGGPVNKVA YAFMLICVAQ GVYTVVAIAA VGICIPPLGM GLATLIGRKN FSAEERETGK AALVMGCVGV TEGAIPFAAA DPLRVIPSIM VGSVCGAVTA ALVGAQCYAG WGGLIVLPVV EGKLGYIAAV AVGAVVTAVC VNVLKSLARK NGSSTDEKED DLDLDFEIN
|
| |