Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1541 |
Symbol | |
ID | 6146502 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1525569 |
End bp | 1526738 |
Gene Length | 1170 bp |
Protein Length | 389 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616418 |
Product | major facilitator transporter |
Protein accession | YP_001743596 |
Protein GI | 170684306 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.188499 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.114044 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATTA ACTATCCGTT GCTGGCGCTG GCGATTGGCG CGTTTGGTAT CGGGACAACG GAGTTCTCGC CAATGGGCTT ATTGCCCGTC ATTGCGCGCG GTGTGGATGT CTCGATTCCC GCTGCCGGAA TGTTAATCAG TGCCTATGCA GTTGGCGTAA TGGTTGGCGC GCCGCTGATG ACGCTTCTAC TTTCTCATCG TGCTCGCCGC AGTGCGTTGA TTTTCCTGAT GGCGATTTTC ACGCTCGGCA ACGTACTTTC CGCCATCGCG CCGGATTATA TGACTCTGAT GCTTTCACGC ATTTTGACCA GCCTGAATCA CGGAGCATTT TTTGGTTTGG GTTCAGTTGT GGCCGCAAGC GTGGTGCCAA AACATAAACA GGCCAGCGCA GTTGCCACTA TGTTTATGGG GTTAACCCTG GCAAATATCG GTGGCGTGCC GGCGGCGACC TGGTTGGGTG AAACCATCGG CTGGCGGATG TCATTTCTGG CAACGGCGGG GCTGGGAGTG ATTTCAATGG TAAGTCTGTT CTTCTCATTA CCTAAAGGTG GCGCAGGGGC GCGACCTGAA GTGAAAAAAG AGCTGGCGGT ATTAATGCGT CCGCAGGTGC TGTCTGCATT GCTGACGACG GTACTGGGAG CTGGTGCAAT GTTTACCCTC TACACCTATA TCTCTCCGGT ACTGCAAAGT ATTACCCACG CAACACCGGT GTTCGTTACG GCAATGCTGG TGCTGATTGG TGTCGGATTC TCTATCGGTA ACTATCTCGG CGGCAAACTG GCAGATCGTT CAGTTAACGG CACGTTGAAA GGCTTTTTGT TGCTGCTGAT GGTGATTATG CTGGCAATCC CGTTCCTGGC CCGCAATGAG TTCGGCGCAG CTATTAGCAT GGTGGTGTGG GGCGCTGCAA CCTTTGCGGT CGTACCGCCG TTACAGATGC GCGTGATGCG TGTCGCCAGT GAAGCGCCAG GTCTGTCTTC ATCAGTCAAT ATTGGTGCCT TTAATCTTGG AAATGCGCTG GGAGCAGCTG CTGGTGGTGC GGTAATTTCC GCTGGGCTGG GATACAGTTT TGTGCCGGTG ATGGGGGCGA TTGTCGCGGG ACTGGCATTA TTACTGGTGT TTATGTCAGC CAGAAAACAA CCTGAAACAG TTTGCGTTGC CAACAGCTAA
|
Protein sequence | MKINYPLLAL AIGAFGIGTT EFSPMGLLPV IARGVDVSIP AAGMLISAYA VGVMVGAPLM TLLLSHRARR SALIFLMAIF TLGNVLSAIA PDYMTLMLSR ILTSLNHGAF FGLGSVVAAS VVPKHKQASA VATMFMGLTL ANIGGVPAAT WLGETIGWRM SFLATAGLGV ISMVSLFFSL PKGGAGARPE VKKELAVLMR PQVLSALLTT VLGAGAMFTL YTYISPVLQS ITHATPVFVT AMLVLIGVGF SIGNYLGGKL ADRSVNGTLK GFLLLLMVIM LAIPFLARNE FGAAISMVVW GAATFAVVPP LQMRVMRVAS EAPGLSSSVN IGAFNLGNAL GAAAGGAVIS AGLGYSFVPV MGAIVAGLAL LLVFMSARKQ PETVCVANS
|
| |