Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4554 |
Symbol | alsB |
ID | 6142967 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4656716 |
End bp | 4657657 |
Gene Length | 942 bp |
Protein Length | 313 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641619370 |
Product | D-allose transporter subunit |
Protein accession | YP_001746482 |
Protein GI | 170682838 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1879] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTATGA ATAAATATCT GAAATATTTC AGTGGCACAC TCGTGGGCTT AATGTTGTCA ACCAGCGCTT TTGCTGCCGC CGAATATGCT GTCGTATTGA AAACACTCTC CAACCCATTT TGGGTAGATA TGAAAAAAGG CATTGAAGAT GAAGCAAAAA CGCTGGGCGT CAGCGTTGAT ATTTTTGCCT CTCCTTCAGA AGGCGATTTT CAATCTCAAT TGCAGTTATT TGAAGATCTC AGTAATAAAA ATTACAAAGG TATCGCCTTC GCGCCATTAT CCTCAGTAAA TCTGGTCATG CCTGTCGCCC GCGCATGGAA AAAAGGCATT TATCTGGTCA ATCTCGATGA AAAAATCGAC ATGGATAATC TGAAAAAAGC TGGCGGTAAT GTGGAAGGTT TTGTCACCAC CGATAATGTT GCCGTCGGGG CGAAAGGCGC GTCATTCATT ATTGACAAGC TGGGTGCCGA AGGGGGTGAA GTCGCAATCA TTGAGGGTAA AGCCGGTAAC GCCTCCGGTG AAGCGCGTCG TAATGGTGCC ACCGAAGCCT TCAAAAAAGC AAGCCAGATC AAGCTTGTCG CCAGCCAGCC TGCCGACTGG GACCGTATTA AAGCACTGGA TGTCGCCACT AACGTGTTGC AACGTAATCC GAATATTAAA GCGATCTATT GCGCGAATGA CACGATGGCA ATGGGTGTTG CTCAGGCAGT CGCAAACGCC GGAAAAACGG GCAAAGTGTT GGTCGTCGGT ACTGACGGCA TTCCGGAAGC CCGCAAAATG GTGGAAACCG GACAAATGAC CGCGACGGTT GCCCAGAACC CGGCGGATAT CGGTGCAACG GGTCTGAAGC TGATGGTTGA CGCTGAGAAA TCCGGCAAGG TTATCCCGCT GGATAAAGCA CCGGAATTTA AACTGGTCGA TTCAATCCTG GTCACTCAAT AA
|
Protein sequence | MIMNKYLKYF SGTLVGLMLS TSAFAAAEYA VVLKTLSNPF WVDMKKGIED EAKTLGVSVD IFASPSEGDF QSQLQLFEDL SNKNYKGIAF APLSSVNLVM PVARAWKKGI YLVNLDEKID MDNLKKAGGN VEGFVTTDNV AVGAKGASFI IDKLGAEGGE VAIIEGKAGN ASGEARRNGA TEAFKKASQI KLVASQPADW DRIKALDVAT NVLQRNPNIK AIYCANDTMA MGVAQAVANA GKTGKVLVVG TDGIPEARKM VETGQMTATV AQNPADIGAT GLKLMVDAEK SGKVIPLDKA PEFKLVDSIL VTQ
|
| |