Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4552 |
Symbol | alsC |
ID | 6143888 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4654097 |
End bp | 4655077 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619368 |
Product | D-allose transporter subunit |
Protein accession | YP_001746480 |
Protein GI | 170684183 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTTTA CCACAAGAGT AAAAAGCGAA GCGAGCGAGA AGAAACCGTT CAACTTTGCG CTGTTCTGGG ATAAATACGG CACCTTTTTT ATCCTGGCGA TCATCGTCGC CATCTTTGGT TCGCTGTCAC CAGAATATTT TCTGACCACC AATAATATTA CCCAGATTTT TGTGCAAAGC TCCGTGACGG TATTGATCGG CATGGGCGAG TTTTTCGCTA TCCTGGTCGC TGGTATCGAC CTCTCGGTTG GCGCGATTCT GGCGCTTTCC GGTATGGTGA CCGCCAAACT GATGTTGGCA GGTGTTGACC CGTTTCTCGC GGCGCTGATT GGCGGTGTAC TGGTTGGCGG CGCACTGGGG GCGATCAACG GCTGCCTGGT CAACTGGACG GGGCTACACC CATTTATTAT CACCCTTGGT ACCAACGCCA TTTTCCGTGG GATCACGCTG GTGATCTCCG ATGCCAACTC GGTATACGGC TTCTCATTTG ACTTCGTGAA CTTCTTTGCC GCCAGCGTAA TTGGGATACC TGTTCCCGTT ATCTTCTCGC TAATTGTCGC GCTCATCCTT TGGTTTCTGA CAACGCGTAT GCGGCTCGGA CGCAACATCT ACGCACTGGG CGGCAACAAA AACTCGGCGT TCTATTCCGG GATTGACGTG AAATTCCACA TCCTGGTGGT GTTTATCATC TCCGGTGTTT GTGCAGGTCT GGCAGGCGTC GTCTCAACTG CACGACTCGG TGCCGCAGAA CCGCTTGCCG GTATGGGTTT TGAAACCTAT GCCATTGCCA GCGCCATCAT TGGCGGCACC AGTTTCTTCG GCGGCAAGGG GCGCATTTTC TCTGTGGTGA TTGGCGGGTT GATCATCGGC ACCATCAACA ACGGTCTGAA TATTTTGCAG GTACAAACCT ATTACCAACT GGTGGTGATG GGCGGATTAA TTATCGCGGC TGTCGCCCTT GACCGTCTTA TCAGTAAGTA A
|
Protein sequence | MGFTTRVKSE ASEKKPFNFA LFWDKYGTFF ILAIIVAIFG SLSPEYFLTT NNITQIFVQS SVTVLIGMGE FFAILVAGID LSVGAILALS GMVTAKLMLA GVDPFLAALI GGVLVGGALG AINGCLVNWT GLHPFIITLG TNAIFRGITL VISDANSVYG FSFDFVNFFA ASVIGIPVPV IFSLIVALIL WFLTTRMRLG RNIYALGGNK NSAFYSGIDV KFHILVVFII SGVCAGLAGV VSTARLGAAE PLAGMGFETY AIASAIIGGT SFFGGKGRIF SVVIGGLIIG TINNGLNILQ VQTYYQLVVM GGLIIAAVAL DRLISK
|
| |