Gene PICST_35133 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_35133 
SymbolHOL42 
ID4837673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1730476 
End bp1731945 
Gene Length1470 bp 
Protein Length489 aa 
Translation table12 
GC content40% 
IMG OID640388988 
Productmajor facilitator superfamily multidrug-resistance protein 
Protein accessionXP_001383098 
Protein GI150864328 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.446888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.227667 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTTAT CCAGCGATAT TGAAAAGCAC TTCGAAAGCC ACATTGAAAG CGTTCTGGCT 
GAAAACTTGC CAGAACTTTC TCGTGAACAT ATTGAATATC TTATGGAAAG ACATGGAACA
ATTGAGTTAG ACCCACTTCC TTCCATGGAC CCAGAAGATC CACTCAACTG GCCAAATTGG
AGAAAAAACT ATGAAATTTT GATTATTGCT TACCAATGCT TTATGGGTAC CTATTTTGCT
GCTGGTCTTA CTCCAGCTTA TGAAGGTATG GCTGAAGAAT ACGGTATTGA TATTCCAACT
GCTTCATACT TAACTTCTTC TCAAATTGCT ATAATGGGTG TTTTGCCTCT TTTTTGGGTT
CCTATTATGA ATACTTTTGG TAGAAGGTCT ATTCTTTTGT ACTCTGCTAT TTGTGCCATA
GGTTTAAATA TTGCTGGAGC TTACGTTACC ACATATGGAC AACAAATGGC TACTAGATGC
TTGTATGCTT TCTTCAATGC CACCGCTTCC GCTTTGGGTA GTGCAGTGGT TTCTGATTTG
TCTTTCAGTC ACGAAAGAGG TAAGAAGAAT GGTTGGTGGT CTCTTGGTTT TGTTGTTGGC
ACACCAGCTG GTCCATTCCT TTCTGGTTTC ATTATGCAAT ACTCGACAAA AAAATGGATT
TTCTTCATGT TCGCTATCAT GAACGCCATT CAGGTTGTGT TGTTCTTCTT TTCTAAAGAA
ACCGTATACA ACAGAGGCGA CAAAATTGAA GAGCCATCCA AGCTTATCAA AATGATTGGT
ATTTTCAGAC GTAATTCCAA GAAGATAAAT GTTGGTGCTT TTGTGAAACC ATTCAAGCAA
GCTGCAAATT GGAGAATTAC TGTTGTCATC CTTGCTGCTA GTGTCACTTT CGCTTATGCT
AACATCGTTC TCATTGTCGA GATGCCACAA ACCTTTGGTG CAATTTTCGA ACTCAGTCCT
CAAGCATTAG GTTTGCACTA TATCGCTCTT ATTGTCGGTT CTTGTATTGG TGAGGGACTT
GCCGGTCCCC TTTCCGATTG GTGGATGGCC AGAAGTATCA AAAAGAGAAA TGGCCAAAGA
GTCATCGCTG ATAGATTGTT TATTTCCTAC AATGGGTATT TGCTAGTCAT TGTTGGTTTA
GTTGTTTGGG GTGTTTATTT GGACAGGGCT CAACCTGGTC ATTGGAAAAT TAATGCTTTG
ATTGGTTCAG CAATTATAGC AGCTGGTAAC AATATCGTTG CCACTGTTTT GATTACTTTC
GCCATTGACT GCAACCCAGC ATGCGCTGCC GATATTGGTT TATATATGAC TTTTGTCAGA
CAGGTCTATG GTTTTATCGC CCCATTCTAC TTCCCATACA TGTTTACCAA CTTAGGTTTC
ATGGGTTCTG CAGGTTTGAT GATTGGACTT GTTTTCGTTT TTGGTTCACT TTCCACTGCA
TTGGTACACA TTTTGAGTAG AAACAAATAA
 
Protein sequence
MSLSSDIEKH FESHIESVSA ENLPELSREH IEYLMERHGT IELDPLPSMD PEDPLNWPNW 
RKNYEILIIA YQCFMGTYFA AGLTPAYEGM AEEYGIDIPT ASYLTSSQIA IMGVLPLFWV
PIMNTFGRRS ILLYSAICAI GLNIAGAYVT TYGQQMATRC LYAFFNATAS ALGSAVVSDL
SFSHERGKKN GWWSLGFVVG TPAGPFLSGF IMQYSTKKWI FFMFAIMNAI QVVLFFFSKE
TVYNRGDKIE EPSKLIKMIG IFRRNSKKIN VGAFVKPFKQ AANWRITVVI LAASVTFAYA
NIVLIVEMPQ TFGAIFELSP QALGLHYIAL IVGSCIGEGL AGPLSDWWMA RSIKKRNGQR
VIADRLFISY NGYLLVIVGL VVWGVYLDRA QPGHWKINAL IGSAIIAAGN NIVATVLITF
AIDCNPACAA DIGLYMTFVR QVYGFIAPFY FPYMFTNLGF MGSAGLMIGL VFVFGSLSTA
LVHILSRNK