Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4183 |
Symbol | |
ID | 5594452 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | + |
Start bp | 4172442 |
End bp | 4173521 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640923285 |
Product | putative fructose-like permease EIIC subunit 2 |
Protein accession | YP_001460744 |
Protein GI | 157163426 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1299] Phosphotransferase system, fructose-specific IIC component |
TIGRFAM ID | [TIGR01427] PTS system, fructose subfamily, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 72 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATGAGT TGGTGCAGAT CCTGAAAAAT ACCCGTCAGC ATTTAATGAC GGGCGTTTCA CACATGATTC CCTTCGTGGT ATCGGGCGGT ATTTTGCTGG CGGTTTCCGT CATGTTGTAT GGCAAAGGCG CAGTGCCGGA TGCCGTAGCC GATCCGAATC TGAAAAAACT GTTTGATATC GGCGTTGCGG GTTTGACGCT GATGGTGCCT TTCCTCGCCG CTTACATTGG TTACTCCATT GCAGAGCGTT CTGCGCTGGC TCCGTGCGCT ATCGGGGCCT GGGTTGGTAA CAGCTTTGGT GCGGGCTTCT TTGGTGCGCT GATCGCCGGG ATTATCGGCG GCATCGTGGT GCATTACCTG AAGAAAATTC CGGTGCATAA AGTTCTGCGC TCGGTGATGC CTATCTTCAT CATTCCTATC GTCGGCACAC TGATTACCGC AGGCATCATG ATGTGGGGCT TGGGCGAGCC TGTAGGGGCG TTGACCAACA GCCTGACTCA GTGGCTTCAG GGGATGCAGC AGGGCAGCAT TGTTATGCTG GCGGTGATCA TGGGTCTGAT GCTGGCGTTC GATATGGGCG GTCCGGTTAA CAAAGTGGCC TATGCCTTCA TGCTGATTTG CGTTGCTCAG GGTGTTTATA CCGTGGTGGC TATTGCCGCT GTTGGGATTT GTGTTCCACC GCTGGGGATG GGGCTGGCGA CGCTGATTGG TCGTAAAAAT TTCTCCGCAG AAGAGCGCGA AACTGGTAAA GCGGCGCTGG TGATGGGGTG CGTTGGGGTT ACTGAAGGGG CGATTCCTTT CGCCGCTGCC GATCCGCTGC GTGTCATTCC TTCCATCATG GTTGGTTCTG TTTGTGGTGC GGTAACTGCG GCGCTGGTCG GTGCGCAGTG CTATGCAGGC TGGGGTGGTC TGATTGTACT GCCGGTGGTT GAAGGCAAGC TGGGTTATAT CGCCGCAGTG GCTGTCGGAG CAGTGGTGAC GGCTGTTTGT GTGAACGTGC TGAAAAGTCT GGCGCGTAAA AATGGGTCTT CGACTGATGA AAAAGAAGAC GACCTGGATT TGGATTTTGA AATTAATTAA
|
Protein sequence | MNELVQILKN TRQHLMTGVS HMIPFVVSGG ILLAVSVMLY GKGAVPDAVA DPNLKKLFDI GVAGLTLMVP FLAAYIGYSI AERSALAPCA IGAWVGNSFG AGFFGALIAG IIGGIVVHYL KKIPVHKVLR SVMPIFIIPI VGTLITAGIM MWGLGEPVGA LTNSLTQWLQ GMQQGSIVML AVIMGLMLAF DMGGPVNKVA YAFMLICVAQ GVYTVVAIAA VGICVPPLGM GLATLIGRKN FSAEERETGK AALVMGCVGV TEGAIPFAAA DPLRVIPSIM VGSVCGAVTA ALVGAQCYAG WGGLIVLPVV EGKLGYIAAV AVGAVVTAVC VNVLKSLARK NGSSTDEKED DLDLDFEIN
|
| |