Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1359 |
Symbol | |
ID | 6146574 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1348444 |
End bp | 1349817 |
Gene Length | 1374 bp |
Protein Length | 457 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641616237 |
Product | major facilitator transporter |
Protein accession | YP_001743417 |
Protein GI | 170680826 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000544866 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0171999 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAAAG TTCAGGCCGA CGGCCTGCCA TTGCCCCAGC GATACGGTGC GATATTAACC ATTGTGATTG GTATTTCGAT GGCTGTCCTT GACGGCGCAA TCGCCAACGT CGCCCTGCCA ACAATCGCCA CGGACCTTCA TGCCACGCCA GCCAGTTCCA TCTGGGTAGT GAACGCCTAT CAAATCGCCA TTGTCATCTC CCTGCTCTCA TTTTCGTTTC TGGGCGATAT GTTTGGCTAT CGACGTATTT ATAAATGCGG TCTGGTCGTT TTTCTGTTGT CTTCACTGTT CTGCGCCCTT TCTGATTCGC TGCAAATGCT CACCCTTGCG CGTGTCATAC AAGGTTTCGG CGGTGCAGCG TTGATGAGCG TTAATACCGC ACTTATCCGC CTGATCTATC CACAACGTTT TCTGGGTAGA GGGATGGGCA TAAACTCGTT TATTGTTGCC GTCTCTTCTG CTGCCGGGCC GACAATTGCT GCAGCAATCC TCTCCATCGC ATCCTGGAAA TGGTTATTTT TAATCAACGT ACCGTTGGGT ATTATCGCCC TGCTTCTGGC GATGCGTTTT CTGCCACCCA ATGGTTCTCG CGCCAGTAAA CCCCGTTTCG ACCTGCCCAG CGCCGTGATG AACGCGTTAA CCTTCGGCCT GCTTATTACT GCATTGAGTG GTTTCGCTCA GAGGCAATCG CTGACATTGA TTGGTGCGGA ACTGGTGGTA ATGGTTGTCG TTGGTATTTT CTTTATTCGC CGCCAGCTTT CTCTTCCCGT ACCGCTGCTA CCGGTGGATT TACTGCGTAT CCCGCTGTTT TCACTTTCTA TTTGCACATC TGTTTGCTCT TTCTGCGCAC AAATGCTGGC AATGGTTTCC CTTCCCTTTT ACCTGCAAAC CGTGCTCGGG CGTAGTGAAG TCGAAACAGG TTTACTTCTG ACACCGTGGC CGTTAGCAAC AATGGTGATG GCTCCACTGG CAGGCTATTT GATTGAACGC GTACATGCAG GATTGCTGGG TGCTTTAGGG TTATTCATCA TGGCTGCGGG GCTTTTTTCC CTGGTTCTGC TGCCAGCGTC ACCTGCGGAT ATCAATATTA TCTGGCCGAT GATCTTATGT GGTGCTGGAT TTGGCTTGTT CCAGTCACCC AATAACCACA CCATTATTAC CTCCGCTCCT CGCGAACGTA GCGGTGGAGC CAGTGGCATG TTAGGGACGG CTCGTCTTCT GGGTCAGAGT AGCGGCGCGG CTCTGGTAGC GCTGATGCTA AATCAGTTTG GTGATAATGG TACGCACGTC TCGCTGATGG CTGCGGCTAT TCTGGCGGTG ATTGCAGCCT GTGTCAGTGG TTTACGTATC ACTCAGCCAC GATCCATGGC ATAA
|
Protein sequence | MPKVQADGLP LPQRYGAILT IVIGISMAVL DGAIANVALP TIATDLHATP ASSIWVVNAY QIAIVISLLS FSFLGDMFGY RRIYKCGLVV FLLSSLFCAL SDSLQMLTLA RVIQGFGGAA LMSVNTALIR LIYPQRFLGR GMGINSFIVA VSSAAGPTIA AAILSIASWK WLFLINVPLG IIALLLAMRF LPPNGSRASK PRFDLPSAVM NALTFGLLIT ALSGFAQRQS LTLIGAELVV MVVVGIFFIR RQLSLPVPLL PVDLLRIPLF SLSICTSVCS FCAQMLAMVS LPFYLQTVLG RSEVETGLLL TPWPLATMVM APLAGYLIER VHAGLLGALG LFIMAAGLFS LVLLPASPAD INIIWPMILC GAGFGLFQSP NNHTIITSAP RERSGGASGM LGTARLLGQS SGAALVALML NQFGDNGTHV SLMAAAILAV IAACVSGLRI TQPRSMA
|
| |