Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1804 |
Symbol | |
ID | 6147420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1823946 |
End bp | 1825028 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641616680 |
Product | sugar ABC transporter, ATP-binding protein |
Protein accession | YP_001743858 |
Protein GI | 170680817 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3839] ABC-type sugar transport systems, ATPase components |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.438286 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTCAGC TTTCGTTACA ACATATTCAA AAAATCTACG ATAACCAGGT GCATGTGGTG AAGGACTTCA ACCTGGAAAT TGCCGATAAA GAGTTCATTG TCTTTGTCGG TCCGTCAGGC TGCGGTAAAT CGACTACGCT GCGCATGATT GCCGGGCTTG AGGAGATCAG CGGTGGCGAT CTGTTGATCG ACGGCAAACG AATGAATGAC GTTCCAGCCA AAGCGCGCAA TATCGCGATG GTGTTCCAGA ACTACGCGCT GTATCCGCAT ATGACGGTCT ACGACAACAT GGCGTTTGGC CTGAAGATGC AAAAAATCGC CAAAGAGGTG ATTGATGAGC GGGTTAACTG GGCGGCGCAA ATCCTCGGCC TGCGTGAGTA CCTGAAACGC AAGCCGGGCG CGCTTTCTGG CGGGCAACGC CAGCGTGTGG CGCTCGGACG GGCGATTGTG CGCGAAGCGG GCGTGTTTTT AATGGATGAA CCGCTCTCTA ACCTTGATGC CAAGCTGCGC GTACAAATGC GCGCAGAGAT CAGCAAGCTG CATCAGAAAC TGAACACCAC CATGATCTAC GTGACCCACG ATCAGACCGA AGCGATGACC ATGGCGACGC GGATTGTGAT TATGAAGGAT GGGATTGTTC AGCAGGTCGG TGCGCCGAAA ACGGTTTATA ACCAACCTGC GAATATGTTT GTTGCCGGAT TTATTGGATC GCCTGCGATG AATTTTATTC GCGGCACGAT CGATGGCGAT AAATTCGTTA CGGAAACGCT TAAATTAACC ATACCCGAAG AGAAATTAGC GGTTCTGAAA ACACAGGAAA GTTTGCATAA GCCCATCGTG ATGGGAATAC GCCCGGAAGA TATTCATCCG GACGCGCAAG AGGAAAATAA CATTTCCGCC AAAATTAGCG TGGCAGAATT AACCGGTGCG GAATTTATGC TCTACACCAC GGTTGGGGGG CACGAGTTAG TGGTCCGTGC TGGTGCGTTA AATGATTATC ATGCAGGAGA AAATATCACT ATTCATTTTG ATATGACTAA ATGTCATTTC TTTGATGCAG AAACGGAAAT AGCAATTCGC TAA
|
Protein sequence | MAQLSLQHIQ KIYDNQVHVV KDFNLEIADK EFIVFVGPSG CGKSTTLRMI AGLEEISGGD LLIDGKRMND VPAKARNIAM VFQNYALYPH MTVYDNMAFG LKMQKIAKEV IDERVNWAAQ ILGLREYLKR KPGALSGGQR QRVALGRAIV REAGVFLMDE PLSNLDAKLR VQMRAEISKL HQKLNTTMIY VTHDQTEAMT MATRIVIMKD GIVQQVGAPK TVYNQPANMF VAGFIGSPAM NFIRGTIDGD KFVTETLKLT IPEEKLAVLK TQESLHKPIV MGIRPEDIHP DAQEENNISA KISVAELTGA EFMLYTTVGG HELVVRAGAL NDYHAGENIT IHFDMTKCHF FDAETEIAIR
|
| |