Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2900 |
Symbol | |
ID | 6145029 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2970432 |
End bp | 2971769 |
Gene Length | 1338 bp |
Protein Length | 445 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641617769 |
Product | major facilitator family transporter |
Protein accession | YP_001744924 |
Protein GI | 170681816 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACACTT CACCGGTGCG AATGGATGAT TTACCACTTA ACCGTTTTCA CTGCCGCATT GCTGCGCTCA CTTTCGGTGC GCACCTGACC GACGGTTATG TCCTCGGCGT CATTGGTTAC GCCATTATTC AGCTTACGCC CGCCATGCAA CTGACGCCGT TTATGGCGGG AATGATTGGC GGCTCAGCAC TCCTTGGTTT GTTTCTTGGC AGCCTGGTTC TTGGATGGAT CTCCGACCAT ATTGGTCGGC AAAAAATCTT CACCTTCAGC TTTTTGCTGA TTACGCTCGC TTCGTTCTTG CAATTTTTTG CCACCACGCC AGAGCATCTT ATTGGGCTGC GCATTTTGAT CGGCATTGGT CTGGGAGGCG ATTACTCAGT CGGTCACACC TTGCTGGCTG AATTTTCCCC GCGCCGCCAT CGCGGTATTT TACTGGGCGC ATTCAGCGTG GTGTGGACCG TAGGCTATGT GTTGGCAAGT ATTGCCGGAC ATCACTTTAT TTCCGAAAAC CCGGAGGCCT GGCGCTGGCT GCTGGCATCA GCGGCGTTAC CAGCCTTGCT GATTACGCTG CTACGTTGGG GCACGCCGGA ATCGCCACGC TGGCTACTGC GCCAGGGACG TTTTGCTGAA GCTCACGCTA TTGTGCATCG CTATTTGGGG CCACATGTTT TACTGGGCGA TGAAGTGGCA GCGGCGACCC ATAAACACAT CAAAACCTTG TTTTCTTCGC GCTACTGGCG ACGCACGGCG TTTAACAGCG TCTTCTTTGT CTGCCTCGTA ATCCCATGGT TTGTGATTTA TACCTGGCTG CCGACCATCG CCCAGACTAT TGGTCTGGAG GATGCGCTGA CCGCCAGCCT GATGCTTAAT GCGTTGTTAA TTGTGGGCGC GCTGCTGGGA TTAGTTCTGA CGCACCTGCT GGCACATCGC AAGTTTTTGC TGGGAAGTTT TTTGCTGCTG GCGGCAACGC TGGTAGTGAT GGCCTGTTTG CCTTCCGGCA GTTCATTAAC GCTGCTGCTT TTTGTTCTCT TCAGCACCAC CATTTCGGCA GTCAGTAATC TGGTGGGCAT TTTGCCTGCG GAAAGTTTTC CTACTGACAT TCGCTCGCTG GGCGTCGGTT TTGCCACCGC CATGAGTCGG TTGGGGGCGG CGGTAAGTAC TGGCCTGCTG CCGTGGGTGC TGGCGCAGTG GGGAATGCAA GCCACCTTAT TGCTCCTGGC GGCAGTGTTG TTGGTTGGTT TTGTTGTGAC CTGGCTATGG GCACCAGAAA CCAAAGCACT CCCGCTGGTG GCGGCGGGAA ATGTAGGAGG TGCGAATGAA CATTCTGTTA GCGTTTAA
|
Protein sequence | MNTSPVRMDD LPLNRFHCRI AALTFGAHLT DGYVLGVIGY AIIQLTPAMQ LTPFMAGMIG GSALLGLFLG SLVLGWISDH IGRQKIFTFS FLLITLASFL QFFATTPEHL IGLRILIGIG LGGDYSVGHT LLAEFSPRRH RGILLGAFSV VWTVGYVLAS IAGHHFISEN PEAWRWLLAS AALPALLITL LRWGTPESPR WLLRQGRFAE AHAIVHRYLG PHVLLGDEVA AATHKHIKTL FSSRYWRRTA FNSVFFVCLV IPWFVIYTWL PTIAQTIGLE DALTASLMLN ALLIVGALLG LVLTHLLAHR KFLLGSFLLL AATLVVMACL PSGSSLTLLL FVLFSTTISA VSNLVGILPA ESFPTDIRSL GVGFATAMSR LGAAVSTGLL PWVLAQWGMQ ATLLLLAAVL LVGFVVTWLW APETKALPLV AAGNVGGANE HSVSV
|
| |