Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3769 |
Symbol | |
ID | 6147163 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3832532 |
End bp | 3833890 |
Gene Length | 1359 bp |
Protein Length | 452 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641618595 |
Product | PEP-dependent sugar transporting PTS family, IIC component |
Protein accession | YP_001745735 |
Protein GI | 170682820 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3775] Phosphotransferase system, galactitol-specific IIC component |
TIGRFAM ID | [TIGR00827] PTS system, galactitol-specific IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGATA TCGCGCATAC CCTCTATAAT ATTGTGCAAT ATATATTGGG ATTTGGCCCC ACGGTAATGT TGCCGTTGGT GTTATTTATT CTTGCTCTCT GTTTTAAAGT AAAACCCGCT AAAGCCTTAC GCTCGTCATT AACGGTCGGC ATTGGTTTTG TCGGTATTTA TGCCATTTTC GATATTCTTA CCAGCAATGT CGGGCCAGCG GCCCAGGCGA TGGTTGAACG CACCGGAATT AATTTACCGG TGGTGGATTT AGGCTGGCCG CCGCTTTCTG CCATTACCTG GGGGTCGCCA ATTGCCCCGT TTGTTATTCC CCTGACCATT CTGATTAATG TCGCAATGCT GGCGTTGAAT AAAACCCGCA CCGTTGATGT AGATATGTGG AACTACTGGC ATTTTGCCCT TGCCGGTACG CTGGTTTATT ACAGCACCGG CAGCCTGTTC TTTGGTTTGC TGGCGGCCGC GATTGCCGCA GTGGTGGTGC TAAAACTCGC CGACTGGTCT GCGCCACTGG TACAAAAATA CTTTGGCCTG GAAGGGATCT CATTGCCGAC GCTCTCTTCG GTGGTGTTCT TCCCGGTCGG TCTGCTGGTC GACAAAATCA TCGACCATAT CCCTGGCCTC AATCGTATTC ATATCGACCC GGAAACCGTA CAGAAAAAGT TTGGCATCTT CGGCGAACCG ATGATGGTTG GCACTATTCT GGGCATTCTG CTCGGCGTAA TTGCCGGATA CGATTTCAAA AAAGTCTTGC TGCTTGGCAT CAGCATTGGC GGTGTGATGT TCATCCTGCC ACGCATGGTA CGCATCCTGA TGGAAGGTTT ATTACCGCTG TCTGAAGCCA TTAAAAAGTA TCTCAATGCC AAATACCCTG ACCGTGACGA TCTCTATATC GGCCTGGATA TCGCCGTTGC CGTAGGTAAC CCGGCGATTA TCTCCACCGC CCTGCTGCTG ACGCCAATCT CGGTCTTTAT CGCGTTTGTC CTCCCAGGTA ATGAAGTCCT GCCGCTTGGT GACCTTGCCA ACCTGGCGGT AATGGCGTCG ATGATTGCTT TAGCCAGCCG TGGCAATATT TTCCGCACCG TTCTGGCGGC GATCCCGGTG ATTATTGCCG ACCTGTGGAT TGCCACTAAA ATCGCGCCGT TTATTACCGG AATGGCGAAA GACGTTAACT TCAAATTTGC CGAAGGCTCC AGCGGCCAGG TTTCCAGTTT CCTTGATGGC GGTAACCCGT TCCGCTTCTG GCTGCTGGAA ATCTTCAACG GCAATCTCAT CGCCATTGGT CTGGTGCCGG TTATCGCCCT GGTACTGTAT GGCATTTTCC GAATGACGCG GAGCACGGTT TATGCCTGA
|
Protein sequence | MNDIAHTLYN IVQYILGFGP TVMLPLVLFI LALCFKVKPA KALRSSLTVG IGFVGIYAIF DILTSNVGPA AQAMVERTGI NLPVVDLGWP PLSAITWGSP IAPFVIPLTI LINVAMLALN KTRTVDVDMW NYWHFALAGT LVYYSTGSLF FGLLAAAIAA VVVLKLADWS APLVQKYFGL EGISLPTLSS VVFFPVGLLV DKIIDHIPGL NRIHIDPETV QKKFGIFGEP MMVGTILGIL LGVIAGYDFK KVLLLGISIG GVMFILPRMV RILMEGLLPL SEAIKKYLNA KYPDRDDLYI GLDIAVAVGN PAIISTALLL TPISVFIAFV LPGNEVLPLG DLANLAVMAS MIALASRGNI FRTVLAAIPV IIADLWIATK IAPFITGMAK DVNFKFAEGS SGQVSSFLDG GNPFRFWLLE IFNGNLIAIG LVPVIALVLY GIFRMTRSTV YA
|
| |