Gene PICST_40758 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_40758 
SymbolHST6 
ID4836981 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2137078 
End bp2140671 
Gene Length3594 bp 
Protein Length1197 aa 
Translation table12 
GC content38% 
IMG OID640388296 
ProductATP-dependent permease 
Protein accessionXP_001383172 
Protein GI150864385 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components
[COG2274] ABC-type bacteriocin/lantibiotic exporters, contain an N-terminal double-glycine peptidase domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00433632 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.648895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAAAGCGTAT TGATGTTCTT CCAGAAATCA GATATCCCCT ACTTCATATT CGCAACCATA 
AATGTATGCA TTGCTGCAGC TGCTACCCCT TTGCAAACAT TGGTATATGG TAAAATCTTT
GCCAGGCTTA GCAATTTCTA TTCTGGTCAC TATTCGGATT ATCAAATCTT CATTTCTGAT
GTTCGTCTAC TCTGTTTGCT CATCATGGCT ATAGGCGGTT GTAAGATGCT ATTCACATTC
TTGGGTGTTT CTTGTTGGAT GCAATTTGGA GAAAGACAAC AATTGCGAGC AAGAACAAAA
CTCTACAAAC TAATGCTCTC CAGAAAAATG GAATGGTTTG ACTCTGTCGA TGGTACATCT
GGTCAGATCT CACAGGTTAA TCGTTGTGTT GAAGAATTGA GATCTGGTTC TTCGGAAGTT
ATTGGTTTAT TGGTTCAATC CATTGCTAGT ATTATTGCCT TGCTGATAAC TTCACTCTAT
CATTCCTGGT CATTGACACT AGTTATCATG GCAACATCTC CAATTATGGC CATATTCTCG
TGGTTGTTTG GCAGATTGAC TTACAAGGCT GCAGATCAGG AAAACAGATT GAACGCTCTG
GCCGCCAAAA TATTAGATTG GTGTTTAAGC TGTGCGTCTG TTGTCAGAGT ATTCAATGGA
AAATATGTAG AGCAAGCAAA ATTCAACAGG TTGATCGACA TGTCAGCTGG GGTATACTTC
AAGTTGGCTG CAGCAATGGC TGGCAATTCA GGAATTCTAA GAACATTATC GATAATGATG
TTTGTCCAAG GGTTCTGGTT TGGAAACTAT ATGATTAGTA TTGGAGATCT TGATATAAAT
CAAGTGTTTA CCTGTTTCAC CTCTTGTTTA ATGTTGGGAG CTGCTATATC TGATCTAGCA
AATCTTCTAT CCAGTCTCAA TACCGCAAGA GCTGCAGCTT CAATGATTTC CAAATTTATG
TTGTCACAAG GTGAAGTTCT TGACGAAGAG AGCCATTATT TGCAGCCTGC CATTTGTAAA
GGAAGAATAG AGTTTAAAGA TGTCTGCTTT CATTATCCAG GAAGAGAAGG ACAAGTTCTT
AACAATCTTT CTGCTACAAT TGAGGCTCAA AAATTGACAT TTATCATTGG AAAATCTGGT
TCAGGAAAAT CAACTATTTC GCAACTATTG TTGAAAATGT ATAGCAATAA TGAAGGAACC
ATTTCGATCG ACGGACATGA TATTTCAACC TTGAGCAGAA ATTGGATAAC AGATTCTATT
ACAGTTATTG AACAATCAGT TACAATTTAC AATCAAACAC TCAGGAACAA TTTGGCGGTG
TCGGTTGTTA ATAAATATGG TTCGCTAAAT AGTGTTCCAT TATCGTTGAT ACTGGATGCG
CTTTCTTTTG CCCGTTTGAA TGAAGTCGTA GAAGGACTCG AAGATGGAAT AGACACCACT
ATTTCTTCTT CTACTCTTTC GGGTGGCCAG AAGCAGAGGG TTGCGATAGC TAGAGCAAAA
TTGAGGGACA CTCCAATCCT TATCCTTGAT GAATCATTAT GCGCATTAGA TAACAGATTG
AGAATTCCAT TATTCAACGA AATAAGAGAA TGGAGAAAGG GAAAGACAAC AATTGTAATT
GTTCACGAGC TAGACTATCT AAATGACAAT GACAATGTCA TTGTCTTGGA AAATGGTTCT
GTAAAGTACC ATGGTTTGTT CGAGGAAGTT AGGGATCAAG AGATTATTCT GAGGTTCAAT
GACGAAATTA GTTTTGAATC AAAGACAAGC CTGGTTATAG ACCCAATTCC TAGAAAATCA
GTAGAGTACA ATTACTTGAC CAATCCAGTT ATTCTCAAAG ACTTGGAGAA GAATGTTGGT
TCAAAAATTG CAGATGACAA TTCTGATGAG GTTTATTGTG TTCTTGCAAT ACTAAAGTAT
TGCTTTCATA CTATTGAAAC AAAACCATGG ATAGCAGTTG GATTACTATT CTCTGTCTTC
AGTGGAATTG CTGGTCCCGT ATTCTCCTTT AGTTTCTCCA GGTTACTCTC CAGCATGGTC
GAAGCTTCTG TGGGCATTAA CATCACTCAC AAACTCAAAT TGTGGTCTCT AATTGTTGTA
GGCATTTCTG TTGCTAGTGG CGGGTCTCAT TTTGTATCCA GTTTTATTCT TTCATTCTGT
AGTGAAAAGT GGATATTAAA ACTTCGAAAG CTATGTTTCC AAAAGATCAA TGAACAGGAT
ATGTCTTTTA TTGATTTGGA AAATACTAAG GCTTCTGAAA TTACAGCTTT ACTCATGAAT
GACTCTCGAG ATTTGCGGAA TCTTGTTTCT GAATTTCTCT CTTTGGCTCT CAACCTAGTT
GTCATGGTTC TTGTTGGAGT AACTTGGTCA ATTATTTCAG GTTGGAAGTT GGCTTTGGTT
GGGATATCTT TTGTACCTCT TGTTTTGTCA GTTACAAGAG TATACGGTAT TTTGTTGGAA
CTGTCTGAGA ATTGTTATAA GTCTACGGTT GCAAAATTGG AAAATCACAA TTACGAAACT
CTCACAGCTA TCAGATCCAT AACCATCCTT CGATTACAAA ACTACTTTAA AGAGGAGTTT
GATTGCAATA TTCAACGCAT TAGAAAAAGA GGTACTATAC GAGCACTTCA GACTGGCCTT
GGGCTTGGTC TTACCGAAGG TTGTAACGCC ATAGCAACAG GAACAATCTT GTACTACGGT
ATGGTACTCG CTGGTAAGAA TGAATATACC CAACAACAGC TTTTGCAAGT TATTACGATC
TTGACATTAA CGATGACCAA TGCTGCAGGC CTCATGAACC AACTCCCGGA AATTGCAAGA
GGACAAAGAG CTGGAACTAG AATTATCAAG TTGTTGAATA TGACGCCTTC AGAAGCCGAG
AATGATGGCG ACATCACGAT AAACAGAAAT TTTACCAATC CTTTACTACA ATTTGACAAT
CTTCAATTTG GTTATTCTGG GCAAGAGAAC GTTCTCAAGA ATGTCAGTTT CAATATAGAT
AAGGACGACA TTTGTGCTAT AGTAGGTAAA TCTGGAGGAG GAAAGTCCAC TATTGCTTCT
TTGCTTATGA GACTCTACAA TGCTGGTGAT AGAAGTATAT TTCTATCGAA CTATGACATA
ATGAGGCTAG ATATCGACCA TCTAAGAGAT ACGATTACAA TTGTTCCTCA GAATCCCAGC
TTCTTTGAAG GCACCATCTT CGATAACTTA ACTTACGGTA TCAATCCAAA GAAAAGAGTG
AATATGGATA AGATCTACAA AGTTTTGAAA TACGTCAATA TGTACTCATA TGTACTATCA
TTGCCTGAAG GTGTCCATAC GATAATCGGC GAGGGTTCTC ATTCGCTTCT CTCTGGTGGC
CAGAGCCAGA GGCTCTCAAT AGCCAGAGCA TTAATTAGGG ATCCACAGGT GTTGATCTTG
GACGAGTGTA CATCAAACTT GGATAAGGAA AATACTGACT TTATTATCAA TTTGATCAAC
ACTACTTTAC GAGGCAAAAT GACTATAATT TTGATTACTC ACGATGTTGA GGTAATGAAG
GTTGCTAATA GAATAATAAA GGTCAAGAAT GGTTACATAG TCGAGCAAAG GTAA
 
Protein sequence
KSVLMFFQKS DIPYFIFATI NVCIAAAATP LQTLVYGKIF ARLSNFYSGH YSDYQIFISD 
VRLLCLLIMA IGGCKMLFTF LGVSCWMQFG ERQQLRARTK LYKLMLSRKM EWFDSVDGTS
GQISQVNRCV EELRSGSSEV IGLLVQSIAS IIALSITSLY HSWSLTLVIM ATSPIMAIFS
WLFGRLTYKA ADQENRLNAS AAKILDWCLS CASVVRVFNG KYVEQAKFNR LIDMSAGVYF
KLAAAMAGNS GILRTLSIMM FVQGFWFGNY MISIGDLDIN QVFTCFTSCL MLGAAISDLA
NLLSSLNTAR AAASMISKFM LSQGEVLDEE SHYLQPAICK GRIEFKDVCF HYPGREGQVL
NNLSATIEAQ KLTFIIGKSG SGKSTISQLL LKMYSNNEGT ISIDGHDIST LSRNWITDSI
TVIEQSVTIY NQTLRNNLAV SVVNKYGSLN SVPLSLISDA LSFARLNEVV EGLEDGIDTT
ISSSTLSGGQ KQRVAIARAK LRDTPILILD ESLCALDNRL RIPLFNEIRE WRKGKTTIVI
VHELDYLNDN DNVIVLENGS VKYHGLFEEV RDQEIISRFN DEISFESKTS SVIDPIPRKS
VEYNYLTNPV ILKDLEKNVG SKIADDNSDE VYCVLAILKY CFHTIETKPW IAVGLLFSVF
SGIAGPVFSF SFSRLLSSMV EASVGINITH KLKLWSLIVV GISVASGGSH FVSSFILSFC
SEKWILKLRK LCFQKINEQD MSFIDLENTK ASEITALLMN DSRDLRNLVS EFLSLALNLV
VMVLVGVTWS IISGWKLALV GISFVPLVLS VTRVYGILLE SSENCYKSTV AKLENHNYET
LTAIRSITIL RLQNYFKEEF DCNIQRIRKR GTIRALQTGL GLGLTEGCNA IATGTILYYG
MVLAGKNEYT QQQLLQVITI LTLTMTNAAG LMNQLPEIAR GQRAGTRIIK LLNMTPSEAE
NDGDITINRN FTNPLLQFDN LQFGYSGQEN VLKNVSFNID KDDICAIVGK SGGGKSTIAS
LLMRLYNAGD RSIFLSNYDI MRLDIDHLRD TITIVPQNPS FFEGTIFDNL TYGINPKKRV
NMDKIYKVLK YVNMYSYVLS LPEGVHTIIG EGSHSLLSGG QSQRLSIARA LIRDPQVLIL
DECTSNLDKE NTDFIINLIN TTLRGKMTII LITHDVEVMK VANRIIKVKN GYIVEQR