Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4396 |
Symbol | |
ID | 6144054 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4488714 |
End bp | 4489793 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641619217 |
Product | putative fructose-like permease EIIC subunit 2 |
Protein accession | YP_001746341 |
Protein GI | 170681906 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1299] Phosphotransferase system, fructose-specific IIC component |
TIGRFAM ID | [TIGR01427] PTS system, fructose subfamily, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.448765 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGAGT TGGTGCAGAT CCTGAAAAAT ACTCGTCAGC ATTTAATGAC GGGCGTTTCA CACATGATTC CCTTCGTGGT ATCGGGCGGT ATTTTGCTGG CGGTTTCCGT CATGTTGTAT GGCAAAGGCG CAGTGCCGGA TGCCGTCGCC GATCCGAATC TGAAAAAACT GTTTGATATC GGCGTTGCAG GTTTGACGCT GATGGTGCCT TTCCTCGCCG CTTACATCGG TTATTCCATT GCAGAACGCT CTGCGCTGGC TCCGTGCGCT ATCGGTGCCT GGGTTGGTAA CAGCTTTGGT GCGGGCTTCT TTGGTGCACT GATCGCCGGG ATTATCGGCG GCATCGTGGT GCATTACCTG AAGAAAATTC CGGTGCATAA AGTTCTGCGC TCGGTGATGC CAATCTTCAT CATTCCTATC GTCGGCACAC TGATTACCGC TGGCGTCATG ATGTGGGGGC TGGGCGAGCC TGTAGGGGCG TTGACCAACA GCCTGACTCA GTGGCTTCAG GGGATGCAGC AGGGCAGCAT TGTTATGCTG GCGGTGATCA TGGGTCTGAT GCTGGCGTTC GATATGGGCG GCCCGGTTAA CAAAGTGGCC TATGCCTTCA TGCTGATTTG CGTTGCTCAG GGTGTTTATA CCGTGGTGGC TATTGCCGCT GTTGGGATTT GTGTTCCACC GCTGGGGATG GGGCTGGCGA CGCTGATTGG TCGTAAAAAT TTCTCCGCAG AAGAGCGCGA AACCGGCAAG GCGGCACTGG TGATGGGCTG TGTTGGGGTT ACTGAAGGGG CGATTCCTTT CGCCGCTGCC GATCCGCTGC GTGTTATTCC TTCCATCATG ATCGGTTCAG TTTGTGGTGC AGTAACTGCG GCGCTGGTCG GTGCGCAGTG CTATGCAGGC TGGGGTGGTC TGATTGTGCT GCCGGTGGTT GAAGGCAAGC TGGGTTATAT CGCAGCAGTG GCTGTCGGAG CAGTGGTGAC GGCTGTTTGT GTGAACGTGC TGAAAAGTCT GGCGCGTAAA AATGGGTCTT CGACTGATGA AAAAGAAGAC GACCTGGATT TGGATTTTGA AATTAATTAA
|
Protein sequence | MNELVQILKN TRQHLMTGVS HMIPFVVSGG ILLAVSVMLY GKGAVPDAVA DPNLKKLFDI GVAGLTLMVP FLAAYIGYSI AERSALAPCA IGAWVGNSFG AGFFGALIAG IIGGIVVHYL KKIPVHKVLR SVMPIFIIPI VGTLITAGVM MWGLGEPVGA LTNSLTQWLQ GMQQGSIVML AVIMGLMLAF DMGGPVNKVA YAFMLICVAQ GVYTVVAIAA VGICVPPLGM GLATLIGRKN FSAEERETGK AALVMGCVGV TEGAIPFAAA DPLRVIPSIM IGSVCGAVTA ALVGAQCYAG WGGLIVLPVV EGKLGYIAAV AVGAVVTAVC VNVLKSLARK NGSSTDEKED DLDLDFEIN
|
| |