Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31127 |
Symbol | HOL41 |
ID | 4837772 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1832831 |
End bp | 1834300 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640389087 |
Product | major facilitator superfamily multidrug-resistance protein |
Protein accession | XP_001383641 |
Protein GI | 150864702 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTGG TTAGCGATAT TGAAAAGCAC TCTCAGAACC ATATCGAAAC TGTTCTGGAT GAAAATTTGT CAGAACTTTC TCGTGAACAT ATTGATTATC TTATGCAAAG ACATGGGACA GTTGACTTAG ATCCTCTACC TTCTATGGAC CCAGAAGATC CACTCAACTG GCCCACTTGG AGAAAGAACT ACGAAATTTT GATTATCGCC TTCCAATGTT TTTTGGGTAA TTATTTTTCT GCGTCTCTTA CTCCCGCTTA TGAAGATATG GCTGTACAGT ATGGTGTTGA TGTTCCAACA TCTTCTTACT TAACTTCTGC TCAAATTGCT GTTGTTGGTG TTTTGCCATT TTTTTGGGTT CCTATTATGA ATACTTACGG AAGAAGGTCC ATTCTCATGT ATACTGCTTT TTGTGCTATT GGCATAAATA TTGCTGGTGC TTATGTCAAA ACCTACGGTC AACAAATGGC TACAAGATGT TTGTATGCTT TTTGTGCTGC TACAGCATCT GCTTTGGGTA GCGCAGTGGT TTCTGACTTG TCTTTTAGTC ACGAAAGAGG TATGAAAAAT GGTTGGTGGT CTCTTGGTTT TGTTCTTGGT ACTCCATCTG GTCCCTTTAT CTCCGGTTTC ATAATGCAAT ACTCTACTAA GAAATGGATC TTCTTCATGT TCGCTATATT CAACGCAATT CAAGTTGTGT TGTTTTTCTT CTCAAAGGAA ACGGTCTACA ACAGAGGCGA TAAATTAGAA GAACCAAGCG GTATAATCAA ATTGATTGGG ATTTACAGAC GTAATAACAA TAAGGTCACT TTAGGATCTT TTATTCTGCC ATTAAAGCAG GGCTTAAACT GGAGAATAGG TACTATTGTT ATTGCTCTCA GTGTTACTTT TGCGTACGCC AATATTGTTC TCATCGTTGA AATGCCACAA ACTTTTGGTA CTATATTCGA ACTCAGTCCG CAAGCACTAG GTTTACACTA CATTGCTCTT ATTCTTGGTT CCGTCATAGG TGAAGCGCTT GCTGGTTCTC TTTCCGATTG GTGGATGGCT AAAAGTATCA AAAGACGAGG AGGAAAGAGA ATCATTGCTG ACCGTTTGTA TGTCTCTTAC AATGGATTCT TGTTGGTCAT CATTGGTTTG GTTGTCTGGG GTGTTTATTT GGACAGAGCT AGGCCTGATC ACTGGAGTAT TAGTGCATTG ATTGGAGCAG CCATTATGGC AGTTGGAAAC AACATTGTTG CAACAGTCTT AATTACTTAT TCGATTGATT GTAATCCTGC ATATGCTTCT GATATTGGTT TATTTATTAC CATTGTAAGA CAAGTCTATG GTTTTGTTGC TCCATTCTAC TTTCCAAGCA TGTTTACCAA TTTGGGCTTC ATTGGTTCCG CTGGTTTGAT GATTGGCCTT GTATTCGTCT TCGGTACGAT TGTTACCTCG TTCGTCCACT TCATGAGTAG AAAGGTTTAG
|
Protein sequence | MSLVSDIEKH SQNHIETVSD ENLSELSREH IDYLMQRHGT VDLDPLPSMD PEDPLNWPTW RKNYEILIIA FQCFLGNYFS ASLTPAYEDM AVQYGVDVPT SSYLTSAQIA VVGVLPFFWV PIMNTYGRRS ILMYTAFCAI GINIAGAYVK TYGQQMATRC LYAFCAATAS ALGSAVVSDL SFSHERGMKN GWWSLGFVLG TPSGPFISGF IMQYSTKKWI FFMFAIFNAI QVVLFFFSKE TVYNRGDKLE EPSGIIKLIG IYRRNNNKVT LGSFISPLKQ GLNWRIGTIV IALSVTFAYA NIVLIVEMPQ TFGTIFELSP QALGLHYIAL ILGSVIGEAL AGSLSDWWMA KSIKRRGGKR IIADRLYVSY NGFLLVIIGL VVWGVYLDRA RPDHWSISAL IGAAIMAVGN NIVATVLITY SIDCNPAYAS DIGLFITIVR QVYGFVAPFY FPSMFTNLGF IGSAGLMIGL VFVFGTIVTS FVHFMSRKV
|
| |