Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_35133 |
Symbol | HOL42 |
ID | 4837673 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1730476 |
End bp | 1731945 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640388988 |
Product | major facilitator superfamily multidrug-resistance protein |
Protein accession | XP_001383098 |
Protein GI | 150864328 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.446888 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.227667 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTTTAT CCAGCGATAT TGAAAAGCAC TTCGAAAGCC ACATTGAAAG CGTTCTGGCT GAAAACTTGC CAGAACTTTC TCGTGAACAT ATTGAATATC TTATGGAAAG ACATGGAACA ATTGAGTTAG ACCCACTTCC TTCCATGGAC CCAGAAGATC CACTCAACTG GCCAAATTGG AGAAAAAACT ATGAAATTTT GATTATTGCT TACCAATGCT TTATGGGTAC CTATTTTGCT GCTGGTCTTA CTCCAGCTTA TGAAGGTATG GCTGAAGAAT ACGGTATTGA TATTCCAACT GCTTCATACT TAACTTCTTC TCAAATTGCT ATAATGGGTG TTTTGCCTCT TTTTTGGGTT CCTATTATGA ATACTTTTGG TAGAAGGTCT ATTCTTTTGT ACTCTGCTAT TTGTGCCATA GGTTTAAATA TTGCTGGAGC TTACGTTACC ACATATGGAC AACAAATGGC TACTAGATGC TTGTATGCTT TCTTCAATGC CACCGCTTCC GCTTTGGGTA GTGCAGTGGT TTCTGATTTG TCTTTCAGTC ACGAAAGAGG TAAGAAGAAT GGTTGGTGGT CTCTTGGTTT TGTTGTTGGC ACACCAGCTG GTCCATTCCT TTCTGGTTTC ATTATGCAAT ACTCGACAAA AAAATGGATT TTCTTCATGT TCGCTATCAT GAACGCCATT CAGGTTGTGT TGTTCTTCTT TTCTAAAGAA ACCGTATACA ACAGAGGCGA CAAAATTGAA GAGCCATCCA AGCTTATCAA AATGATTGGT ATTTTCAGAC GTAATTCCAA GAAGATAAAT GTTGGTGCTT TTGTGAAACC ATTCAAGCAA GCTGCAAATT GGAGAATTAC TGTTGTCATC CTTGCTGCTA GTGTCACTTT CGCTTATGCT AACATCGTTC TCATTGTCGA GATGCCACAA ACCTTTGGTG CAATTTTCGA ACTCAGTCCT CAAGCATTAG GTTTGCACTA TATCGCTCTT ATTGTCGGTT CTTGTATTGG TGAGGGACTT GCCGGTCCCC TTTCCGATTG GTGGATGGCC AGAAGTATCA AAAAGAGAAA TGGCCAAAGA GTCATCGCTG ATAGATTGTT TATTTCCTAC AATGGGTATT TGCTAGTCAT TGTTGGTTTA GTTGTTTGGG GTGTTTATTT GGACAGGGCT CAACCTGGTC ATTGGAAAAT TAATGCTTTG ATTGGTTCAG CAATTATAGC AGCTGGTAAC AATATCGTTG CCACTGTTTT GATTACTTTC GCCATTGACT GCAACCCAGC ATGCGCTGCC GATATTGGTT TATATATGAC TTTTGTCAGA CAGGTCTATG GTTTTATCGC CCCATTCTAC TTCCCATACA TGTTTACCAA CTTAGGTTTC ATGGGTTCTG CAGGTTTGAT GATTGGACTT GTTTTCGTTT TTGGTTCACT TTCCACTGCA TTGGTACACA TTTTGAGTAG AAACAAATAA
|
Protein sequence | MSLSSDIEKH FESHIESVSA ENLPELSREH IEYLMERHGT IELDPLPSMD PEDPLNWPNW RKNYEILIIA YQCFMGTYFA AGLTPAYEGM AEEYGIDIPT ASYLTSSQIA IMGVLPLFWV PIMNTFGRRS ILLYSAICAI GLNIAGAYVT TYGQQMATRC LYAFFNATAS ALGSAVVSDL SFSHERGKKN GWWSLGFVVG TPAGPFLSGF IMQYSTKKWI FFMFAIMNAI QVVLFFFSKE TVYNRGDKIE EPSKLIKMIG IFRRNSKKIN VGAFVKPFKQ AANWRITVVI LAASVTFAYA NIVLIVEMPQ TFGAIFELSP QALGLHYIAL IVGSCIGEGL AGPLSDWWMA RSIKKRNGQR VIADRLFISY NGYLLVIVGL VVWGVYLDRA QPGHWKINAL IGSAIIAAGN NIVATVLITF AIDCNPACAA DIGLYMTFVR QVYGFIAPFY FPYMFTNLGF MGSAGLMIGL VFVFGSLSTA LVHILSRNK
|
| |