Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_82843 |
Symbol | HOL5 |
ID | 4837726 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 592129 |
End bp | 594239 |
Gene Length | 2111 bp |
Protein Length | 656 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389041 |
Product | member of major facilitator superfamily multidrug-resistance protein |
Protein accession | XP_001383395 |
Protein GI | 126133741 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.451266 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.203181 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCTATTTGCA TAGCTTTGTC TCTTGCCCTT CAAAATGACT TCCCGTCTAG GCATCCGACA AGCGCGGGTC GAAGAAGTCA CTGGGACCAT CATAATGATG GATGATCCAG ATGACATGCC ACTTGATTCT ACTTCGGCTC AAGATGGCTC GCTTCACAGA GAGTCACTTA CCATCAAGAA AACTGCCACC GGTGTGATTC TCAATCCACA ACCACACGAC TCGCCCAACG ATCCTCTCAA CTGGTCTGTA TGGAAAAGAG ACTTGTGTCT ATTAATCATT GGATTTCAGT CATTTCTAGG AGGCGGCCAG TCTCCGTTGC TTGCTGCTGG TATGAGTTCA TTAGCAACTG AGTTTCACAA ATCTACAGCA ACAGTTGCCT ATCTTGTAGG AGGGTTCATG TTGTCTTTAG GTGTAGGTTC CGTATTTGCC AGTCCTTCTG CCGTTCTCTT CGGCAAGAGA TTGGTGTATT TGGTTGGGAT CCTTGTATTT CTTCTCGGCT CTGTCTGGGG AGCCTCGTCT AAATCGTTTG GCAGCTTAAT GGGTGCCAGA GTGATGACCG GGTTTGGTGC GTCTCCTACC GAATGTTTGC CCAGTTCTAC TATTGCCGAA ATTTACTTTG CTCATGAAAG AGCCTATAGA GTCGGCATAT ACACCATGTT GATGTTGGGA GGCAAAAATA TCATTCCTTT GCTTTCGGGT TTGGTGTTCC AGAACCTAGA CAGACACTGG CTATTCTGGA TCCAAGCAAT GTTTTTGGGT GCAAATCTCA TTCTCACCTT CTTGTTTGTT CCAGACACTT TCTGGGACAG ATCGCCTACA CCTAATAAGA GATCTCAAGA GGAAACAGAA GCTGCCATGA AGGTGAAATC GTATCACCCA CCAGAGCAAA GACCAAATGC ATATGCTTTG CAGCGTCCTT CACAAATGAA TGTGTTGGAT TTGGAAAGCA TTTCATCCAC AGTAAACCCC ACTAATTCTC AAACAGGACA GCCCATCGGC ATAACCGAAG CCGAGACTAC TCATTACAAT ATACCCAAAC AAACCTTTCG TCAGGATTTG GCCATTTATT CTGGTAGACA CACAAAAGAC AGTTGGTGGA TGGTGGCTCT AAGACCCTTC TTCTTGTACA CATATCCCTC TGTTCTCTTT GGTTCGTTTA TATACTCCTT CGCTGTAGTG TGGTTGATTG TGATTTCTGA AACTATCTCA GAAATTTTTA GAGGCGAAGG GTATGGATAT TCCCAAGGAA CAGTTGGATT ATTCTATGTT TCTCCATTTG TTGGAGGTAT CTTAGGCTCA CTCACTGCTG GTTTAGTTAG TGATAGATTG AGTCGATTCC TAGTGAGCAA GAATAAGGGA GTATATGAAC CTGAGTTCAG ATTGTTCATG ATCATACCTT CTACATTTTT CATAGCTTTT GGTCTTATGG GATTTGGATG GTCTGCGCAA GACAAGGATT TGTGGATTGG ACCCGTTATC TTCTTTGGAT GTTTGAGTTT TGGAAGTTCC ATGGCAAGTA CCACTGCGAT AACATTCACA GTGGATTCTT ATAAGGTTTT CTCTGCTGAA GCATTAGTAT CCTTCAACTT CTTAAAAAAC TTGATTGGTT TCATATTTTC GTTGTTCAAT AACAACTTCG CTAATGCCCG TGGAAATCGT ACTGCTTTTG TAACTTATGG AGCAGTGCAA ATCTTTGTGA GTTTATGGGC AATCCCATTG TACATTTACG GGAAACGTTT GAGAAGTTGG ACTGATAAGA AGGAATTCTT GAAAAAGTTG TACCATGTTG ATAATATTCC TAGAAATGTC TACAAGGAAA GCATGGCCAC CAGTGCTGCG AATATCTCCT CGCCCAAGGA AGCAAGCACT CCATCAGGGC CAATCTCTTC TTCACTCTCT GTTGACGACA ACACGGTTCA TAATAGTGAG GTGGAAAAAG TCAGTAGCAA TAAGGACGAT AGTGTTACGT CTGAAGATAC CGCACCACCA CAAAAGAGCA TTTGAATAAC AGTCAAATCA GCCTGGAATC TTACGAAATG ACGATTGCAT TAATTCTGTA CACTAGTCTA TTAAAATACG CTTTATAGAA AAATAACTAT ACTTGTTTCA A
|
Protein sequence | MTSRLGIRQA RVEEVTGTII MMDDPDDMPL DSTSAQDGSL HRESLTIKKT ATGVILNPQP HDSPNDPLNW SVWKRDLCLL IIGFQSFLGG GQSPLLAAGM SSLATEFHKS TATVAYLVGG FMLSLGVGSV FASPSAVLFG KRLVYLVGIL VFLLGSVWGA SSKSFGSLMG ARVMTGFGAS PTECLPSSTI AEIYFAHERA YRVGIYTMLM LGGKNIIPLL SGLVFQNLDR HWLFWIQAMF LGANLILTFL FVPDTFWDRS PTPNKRSQEE TEAAMKVKSY HPPEQRPNAY ALQRPSQMNV LDLESISSTV NPTNSQTGQP IGITEAETTH YNIPKQTFRQ DLAIYSGRHT KDSWWMVALR PFFLYTYPSV LFGSFIYSFA VVWLIVISET ISEIFRGEGY GYSQGTVGLF YVSPFVGGIL GSLTAGLVSD RLSRFLVSKN KGVYEPEFRL FMIIPSTFFI AFGLMGFGWS AQDKDLWIGP VIFFGCLSFG SSMASTTAIT FTVDSYKVFS AEALVSFNFL KNLIGFIFSL FNNNFANARG NRTAFVTYGA VQIFVSLWAI PLYIYGKRLR SWTDKKEFLK KLYHVDNIPR NVYKESMATS AANISSPKEA STPSGPISSS LSVDDNTVHN SEVEKVSSNK DDSVTSEDTA PPQKSI
|
| |