Gene PICST_87852 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_87852 
Symbol 
ID4837462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2174447 
End bp2175987 
Gene Length1541 bp 
Protein Length494 aa 
Translation table12 
GC content43% 
IMG OID640388777 
Productpredicted protein 
Protein accessionXP_001383179 
Protein GI150864390 
COG category[K] Transcription 
COG ID[COG5095] Transcription initiation factor TFIID, subunit TAF6 (also component of histone acetyltransferase SAGA) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.142846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.141924 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCCA GTCTGAAGAT ACCCTTGTCG TACTCGTTGT GGTCTCCCCA TGACACCGTC 
AAAGATGCTG CGGAGCTGCT CAGTATCAAT TTGCCAGAGG AGGCAGCCAA GAATCTTGCC
ATGGATGTCG AGTACCGTAT CCATGAAATC TTGGAAACAG CCATTAAATT CATGAGACAC
TCGAAAAGAA AGTTGCTCAT GACTTCAGAC ATAAGTAACG CCTTAAAAGT ATTGAATATT
GAGCCCCTTT ATGGCTACGA CAATTCGCAA CCGTTGGTTT TCAAGGAAGC CTTGGTTGGT
GCTGGAGGAC AGACTTTGTA TTATATAGAT GATAATGAGA TTGAACTTGA AAAGTTGATC
AACCAGGAAT TACCAAAAGT GCCTCGTCAA ACCACCTTTA CTGCTCATTG GCTTGCTATT
GAAGGAGTCC AACCAATGGT GGCGCAGAAT CCTCTTCCTG CCGAAATCAA GAGTCTCCCT
CCGATAATAA GAGGTGCCAC TTCGTCCATT CTTGGAAACG ATATTTTATC GCTTGGCCAA
AACGGCGAAA ACAAGGACTC AGCACAAGGA TCTGCTGCTA CCAAAGACAA GAAGGCCACC
GAGAAGCAAT CAGAAGTCAA GCCGTTGGTC AAACACGTAT TGTCGAAGGA ACTCAAGTTG
TATTTTGACA AGGTTGTAGA AGTTCTCATT TCGACAGATC CAGAAAAGGA GAACTTGAAG
GTAGCTGCCT TGAACTCGTT GAAGAATGAC CCGGGTTTGC ACCAGTTGGT GCCTTATTTC
ATCCAATTCG TGGCCGAACA GATCACAAAC CAGTTGAGGA ACATTGATAT CTTGTCCACA
ATGTTGGAAG TGATTTCTGC TCTCGCTGAT AACAAGACTA TTTTCCTCGA TCCTTATGTC
CATGCTCTTA TGCCTTGTAT CTTGACGTTG TTATTGGCTA AGAGAATAGG TCCTGTGATC
AGAGAAACCT CCTCCAAAGA GTCTCAGGAC ACGTTGAAAA CACAATTGGC CGTTAGAGAA
TTCGCTGCCT TTTTGCTAGA ACACATTATC AAGGTTTACG GCTCTTCGTA CTCGACTTTG
AGACCCAGAG TGACAAGAAC GTTATTGCGT GCGTTGTTAG ATTCTACAAA ACCAGTAGGT
ACGCATTATG GTGCCCTTTT GGGGTTGAAG AATATGGGCA CAGAAGTTGT CAAGTTGGTC
TTGATAGGCA ACTTGAAAGT TTGGTGCAAG CTGGTAGTTG AGAACAGCGA TAAGACCGAC
TTTGAAAAGG ATATCTTGCT AAATGAATCG CTAGAAATTT TGGATAGCCT TAAAGTGGAA
TTACCCCAAG ATGAAGAGTC TATGGATACA GACTCCGGGT TTTCAGAAGA AATTAAGAAC
AAGTTGAAAG ACAGGATTGG TGATACTCTC GCCGACTTAG TTTTGCAACA CAGGGATGCA
AAAGACATTG TCAGAGGTCT ATTCTTTGGC GAAGTTGTCG TCTAGAACGT GTACAACAAT
TCAAAATAAT GATGTATATT AATGTATGAA TAGTAACTAT T
 
Protein sequence
MESSSKIPLS YSLWSPHDTV KDAAESLSIN LPEEAAKNLA MDVEYRIHEI LETAIKFMRH 
SKRKLLMTSD ISNALKVLNI EPLYGYDNSQ PLVFKEALVG AGGQTLYYID DNEIELEKLI
NQELPKVPRQ TTFTAHWLAI EGVQPMVAQN PLPAEIKSLP PIIRGATSSI LGNDILSLGQ
NGENKDSAQG SAATKDKKAT EKQSEVKPLV KHVLSKELKL YFDKVVEVLI STDPEKENLK
VAALNSLKND PGLHQLVPYF IQFVAEQITN QLRNIDILST MLEVISALAD NKTIFLDPYV
HALMPCILTL LLAKRIGPVI RETSSKESQD TLKTQLAVRE FAAFLLEHII KVYGSSYSTL
RPRVTRTLLR ALLDSTKPVG THYGALLGLK NMGTEVVKLV LIGNLKVWCK SVVENSDKTD
FEKDILLNES LEILDSLKVE LPQDEESMDT DSGFSEEIKN KLKDRIGDTL ADLVLQHRDA
KDIVRGLFFG EVVV