Gene PICST_31127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31127 
SymbolHOL41 
ID4837772 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1832831 
End bp1834300 
Gene Length1470 bp 
Protein Length489 aa 
Translation table12 
GC content39% 
IMG OID640389087 
Productmajor facilitator superfamily multidrug-resistance protein 
Protein accessionXP_001383641 
Protein GI150864702 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTGG TTAGCGATAT TGAAAAGCAC TCTCAGAACC ATATCGAAAC TGTTCTGGAT 
GAAAATTTGT CAGAACTTTC TCGTGAACAT ATTGATTATC TTATGCAAAG ACATGGGACA
GTTGACTTAG ATCCTCTACC TTCTATGGAC CCAGAAGATC CACTCAACTG GCCCACTTGG
AGAAAGAACT ACGAAATTTT GATTATCGCC TTCCAATGTT TTTTGGGTAA TTATTTTTCT
GCGTCTCTTA CTCCCGCTTA TGAAGATATG GCTGTACAGT ATGGTGTTGA TGTTCCAACA
TCTTCTTACT TAACTTCTGC TCAAATTGCT GTTGTTGGTG TTTTGCCATT TTTTTGGGTT
CCTATTATGA ATACTTACGG AAGAAGGTCC ATTCTCATGT ATACTGCTTT TTGTGCTATT
GGCATAAATA TTGCTGGTGC TTATGTCAAA ACCTACGGTC AACAAATGGC TACAAGATGT
TTGTATGCTT TTTGTGCTGC TACAGCATCT GCTTTGGGTA GCGCAGTGGT TTCTGACTTG
TCTTTTAGTC ACGAAAGAGG TATGAAAAAT GGTTGGTGGT CTCTTGGTTT TGTTCTTGGT
ACTCCATCTG GTCCCTTTAT CTCCGGTTTC ATAATGCAAT ACTCTACTAA GAAATGGATC
TTCTTCATGT TCGCTATATT CAACGCAATT CAAGTTGTGT TGTTTTTCTT CTCAAAGGAA
ACGGTCTACA ACAGAGGCGA TAAATTAGAA GAACCAAGCG GTATAATCAA ATTGATTGGG
ATTTACAGAC GTAATAACAA TAAGGTCACT TTAGGATCTT TTATTCTGCC ATTAAAGCAG
GGCTTAAACT GGAGAATAGG TACTATTGTT ATTGCTCTCA GTGTTACTTT TGCGTACGCC
AATATTGTTC TCATCGTTGA AATGCCACAA ACTTTTGGTA CTATATTCGA ACTCAGTCCG
CAAGCACTAG GTTTACACTA CATTGCTCTT ATTCTTGGTT CCGTCATAGG TGAAGCGCTT
GCTGGTTCTC TTTCCGATTG GTGGATGGCT AAAAGTATCA AAAGACGAGG AGGAAAGAGA
ATCATTGCTG ACCGTTTGTA TGTCTCTTAC AATGGATTCT TGTTGGTCAT CATTGGTTTG
GTTGTCTGGG GTGTTTATTT GGACAGAGCT AGGCCTGATC ACTGGAGTAT TAGTGCATTG
ATTGGAGCAG CCATTATGGC AGTTGGAAAC AACATTGTTG CAACAGTCTT AATTACTTAT
TCGATTGATT GTAATCCTGC ATATGCTTCT GATATTGGTT TATTTATTAC CATTGTAAGA
CAAGTCTATG GTTTTGTTGC TCCATTCTAC TTTCCAAGCA TGTTTACCAA TTTGGGCTTC
ATTGGTTCCG CTGGTTTGAT GATTGGCCTT GTATTCGTCT TCGGTACGAT TGTTACCTCG
TTCGTCCACT TCATGAGTAG AAAGGTTTAG
 
Protein sequence
MSLVSDIEKH SQNHIETVSD ENLSELSREH IDYLMQRHGT VDLDPLPSMD PEDPLNWPTW 
RKNYEILIIA FQCFLGNYFS ASLTPAYEDM AVQYGVDVPT SSYLTSAQIA VVGVLPFFWV
PIMNTYGRRS ILMYTAFCAI GINIAGAYVK TYGQQMATRC LYAFCAATAS ALGSAVVSDL
SFSHERGMKN GWWSLGFVLG TPSGPFISGF IMQYSTKKWI FFMFAIFNAI QVVLFFFSKE
TVYNRGDKLE EPSGIIKLIG IYRRNNNKVT LGSFISPLKQ GLNWRIGTIV IALSVTFAYA
NIVLIVEMPQ TFGTIFELSP QALGLHYIAL ILGSVIGEAL AGSLSDWWMA KSIKRRGGKR
IIADRLYVSY NGFLLVIIGL VVWGVYLDRA RPDHWSISAL IGAAIMAVGN NIVATVLITY
SIDCNPAYAS DIGLFITIVR QVYGFVAPFY FPSMFTNLGF IGSAGLMIGL VFVFGTIVTS
FVHFMSRKV