Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2221 |
Symbol | |
ID | 6145050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2240204 |
End bp | 2241352 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617097 |
Product | putative MFS family transporter protein |
Protein accession | YP_001744271 |
Protein GI | 170682710 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.707303 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACGT ATACCCGGCC TGTCATGCTT TTGCTGTCTG GCCTGCTTTT GTTGACTCTG GCGATTGCGG TGTTAAATAC ACTCGTGCCG CTTTGGCTCG CCCAGGAACA CATGTCCACA TGGCAGGTAG GCGTTGTCAG CTCATCCTAT TTTACCGGCA ACCTGGTCGG TACATTGCTG ACAGGGTATG TCATTAAGCG CATTGGCTTT AACCGCAGCT ATTATCTGGC CTCCTTCATT TTTGCCGCTG GCTGTGCCGG CCTTGGCCTG ATGATTGGAT TCTGGAGCTG GTTGGCTTGG CGTTTTGTCG CCGGCGTCGG CTGTGCCATG ATTTGGGTGG TTGTTGAGAG CGCGCTGATG TGCAGTGGGA CGTCACGTAA CCGTGGGCGT TTGCTTGCTG CGTATATGAT GGTTTATTAC GTGGGAACGT TTTTAGGCCA GTTACTGGTC AGCAAAGTTT CAACCGAGCT GATGTCCGTA TTGCCGTGGG TTACAGGTTT GACGTTGGCA GGGATCTTAC CGCTGTTGTT TACGCGTGTG CTGAATCAGC AGGCTGAAAA CCATGATTCG ACGTCAATTA CGTCAATGCT AAAACTCCGT CAGGCGCGGC TTGGCGTGAA TGGCTGCATT ATCTCAGGAA TCGTTCTGGG ATCTCTATAT GGCCTGATGC CGCTGTACCT CAATCACAAA GGGGTGAGCA ATGCCAGCAT TGGTTTCTGG ATGGCGGTAC TGGTCAGTGC GGGTATCCTT GGACAATGGC CGATTGGACG TCTGGCGGAT AAGTTTGGTC GACTGCTGGT GTTGCGTGTT CAGGTCTTTG TCGTCATTCT CGGCAGTATC GCGATGCTTA GCCAGGCGGC GATGGCCCCA GCGTTATTCA TCCTCGGTGC CGCTGGCTTT ACGCTATATC CGGTGGCGAT GGCATGGGCT TGCGAGAAAG TTGAACATCA TCAACTGGTG GCGATGAACC AGGCCTTACT GTTGAGCTAT ACCGTGGGAA GTCTGCTTGG CCCGTCATTT ACTGCCATGC TAATGCAGAA TTTCTCCGAT AATTTATTGT TTATCATGAT CGCCAGCGTA TCGTTTATCT ATTTGCTGAT GCTGCTTCGC AACGCCGGTC ATACGCCGAA ACCCGTTGCT CACGTGTAA
|
Protein sequence | MSTYTRPVML LLSGLLLLTL AIAVLNTLVP LWLAQEHMST WQVGVVSSSY FTGNLVGTLL TGYVIKRIGF NRSYYLASFI FAAGCAGLGL MIGFWSWLAW RFVAGVGCAM IWVVVESALM CSGTSRNRGR LLAAYMMVYY VGTFLGQLLV SKVSTELMSV LPWVTGLTLA GILPLLFTRV LNQQAENHDS TSITSMLKLR QARLGVNGCI ISGIVLGSLY GLMPLYLNHK GVSNASIGFW MAVLVSAGIL GQWPIGRLAD KFGRLLVLRV QVFVVILGSI AMLSQAAMAP ALFILGAAGF TLYPVAMAWA CEKVEHHQLV AMNQALLLSY TVGSLLGPSF TAMLMQNFSD NLLFIMIASV SFIYLLMLLR NAGHTPKPVA HV
|
| |