Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4478 |
Symbol | |
ID | 6142882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4576088 |
End bp | 4576885 |
Gene Length | 798 bp |
Protein Length | 265 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641619294 |
Product | PTS system mannose/fructose/sorbose family IIC subunit |
Protein accession | YP_001746406 |
Protein GI | 170684109 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3715] Phosphotransferase system, mannose/fructose/N-acetylgalactosamine-specific component IIC |
TIGRFAM ID | [TIGR00822] PTS system, mannose/fructose/sorbose family, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.274968 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAATAA GTACCCTACA AATTATTGCT ATATTTCTTT TTTCCTGTAT TGCCGGAATG GGCAGCGTGC TGGATGAATT TCAGACTCAT CGTCCGTTAA TTGCCTGTAC GGTGATTGGT TTAATTCTCG GTGATTTAAA AACCGGAATT ATGCTCGGTG GTACGCTGGA ATTGATAGCT CTCGGCTGGA TGAACGTCGG CGCGGCGCAA TCTCCGGATT CTGCACTCGC CAGCATAATC TCCGCCATTC TGGTTATCGT TGGTCAGCAA AGCATCGCCA CCGGAATCGC CATCGCGTTG CCTGTGGCTG CGGCAGGCCA GGTGCTGACG GTGTTTGCCC GTACCATCAC AGTAGTGTTC CAGCACGCGG CGGATAAAGC AGCAGAAGAG GCGCGGTTTC GCACTCTCGA TATTCTGCAT GTCTCCGCGC TTGGCGTGCA GGCGCTGCGC GTTGCTATTC CGGCACTGAT TGTCTCGCTG TTCGTCAGCG CCGATATGGT GAGCAATATG CTGAGCGCCA TTCCCGAATT TGTGACCCGT GGACTGCAGA TTGCTGGCGG TTTTATCGTG GTGGTCGGTT ACGCCATGGT GCTTCGCATG ATGGGCGTGA AATATTTGAT GCCTTTCTTT TTCCTCGGTT TCCTCGCAGG TGGCTACCTC GATCTCAGTC TGCTGGCGTT CGGTGGCGTC GGCGTGATCA TGGCCCTGCT CTACATCCAG TTAAATCCAC AGTGGCGTAA AGCTGAACCA CATCCCCAGA CCACCACTAT CACCGCCCTT GACCAACTTG ATGATTAA
|
Protein sequence | MEISTLQIIA IFLFSCIAGM GSVLDEFQTH RPLIACTVIG LILGDLKTGI MLGGTLELIA LGWMNVGAAQ SPDSALASII SAILVIVGQQ SIATGIAIAL PVAAAGQVLT VFARTITVVF QHAADKAAEE ARFRTLDILH VSALGVQALR VAIPALIVSL FVSADMVSNM LSAIPEFVTR GLQIAGGFIV VVGYAMVLRM MGVKYLMPFF FLGFLAGGYL DLSLLAFGGV GVIMALLYIQ LNPQWRKAEP HPQTTTITAL DQLDD
|
| |