Gene PICST_16323 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_16323 
SymbolCTA8 
ID4838236 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1024255 
End bp1026132 
Gene Length1878 bp 
Protein Length599 aa 
Translation table12 
GC content42% 
IMG OID640389551 
ProductHeat shock transcription factor 
Protein accessionXP_001383484 
Protein GI150864599 
COG category[K] Transcription 
COG ID[COG5169] Heat shock transcription factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.173191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0871257 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAACT TCGAGGATAA GCACGATCCT ATCGTTGAAC TTCTCAACTC GGGTGCTGGA 
GAGCATGACG AAGATATCAA AGTGATTAGT AGAGATCCGT TCAGCGATAA CTTCGTAGCT
ACACCAACCA CAGCTCAGCT CTTGGATCCC AGCTATCACA TAAAGGACGA GGAAAAAAGC
AATACGAATA AGAATAGCAA CGCAGGCAAC AATAATAGCA GTATCAGCAA TATCAACAGC
ATCAACAGTA CTAGTAATAT CCATCACAAC AGTGTAAACC ACAGTAATAA TAGCAATAAT
AATGATAATT ATGACGATTT TCATACTTTC CATACTGGGG CTACCCCCAT AGAGCCAACC
GGACTAACAC CAATGCTTGC ACCAGCACCT GATTTTGTGC CGCATGCCAA CAACCAGATC
GCCTACATCA ACAAATCCAG CTTTTTGAAC CCTTTGCCTC CTATGGCTCC ACTTTCAGAA
CTGGTCCTTC CCAACGGAAT CAATATAAAC CCTCTAGCAT TGAACGGATC GAGTAAACTT
TCTAACGATG ACAAGAAAAC TATAACCAAT AACAGTAGTG CTGTGAGTAG CTCTAAAAGG
AAAAAGGAAC CAGCTGGACC CAAAACTAGA CCTCTTTTTG TTACTAAGAT ATGGTCTATG
GTAAACGATC CAGACAACCA GGAATATATC CGTTGGAACG AAGATGGAAA GACTTTTCAG
GTTTTCCATA GAGAAGAATT CATGAAGTAC ATATTACCCA AATACTTCAA GCACAGTAAT
TTCGCTTCCT TTGTCCGTCA ATTGAACATG TACGGATGGC ACAAAGTACA AGATATCAAC
AGTGGAACTT TCAACCTGGG AAAGGGAGAT AAAGGCATGG AAGAAGTGTG GCAATTTGAG
AATCCAAATT TCATTAGAGA TAGAGAAGAT TTATTGGACA AGATTATTAG AAACAAGAGT
GTTTCCCAAG AGAGTGAACA CGACAACAAC GCCGTCAACT TCCAGATCCT CCTCAATGAA
TTAGACAGCA TCAAGATGAA CCAATTGGCT ATTGGTGAAG ATTTGCGTCG CGTCAGAAAA
GATAACAAAA CATTATGGAA CGAGAATTAC ATGACAAGAG AAAGACACCA ACAGCAAGCC
CAAACTCTTG ACAGAATCTT GAAGTTCTTA GCTGCTGTTT ATGGTAACAA TACTGGCAAG
ATTCTAGAAG TAGATAATGG TCCAGAGTAC AATGATGGTC AAATGACTGC CTACAATCCT
GGCCAGCCCC CATCGCCTAA CCCCTACGCT CAACAAATGT ACGCTCCAAT ACAGAAGCCA
ATGCTTATGC TCACGAATCA AGCACATGGA CCTAGTCCGT CTGGTTCTAC GTATAAATCT
CCTAGACAAA CTTCAATATC TAGTTCTAAC AACAGGGATC ATAGAGATAG TTCCATTACG
GACTCGGGTT CGATCGAAGA AATTATAAGA TCCTACGGAA ACACTCCCAG AAATGGCGAG
AGAAGTGGCG ATGCTGCGAA CAATGTGAAT AGGATATATC AACAGATTAT CAACCAGGAG
CCTTCGGCTG CTTCCCCTAG ACATTACTTC CCTGAGTTGA ACAACAGTGG AATGCCACAG
AGTCCTTATG TTGGCCCCAG CACTCCATCG AACCAACAAT TTTTACGAGT GGCAACTCCT
GATAATGCCA ATGACTTAAT GAATGGTTTA GAGCAAAATA TTTACAAGCA GGGACAGCTG
ATTCAGCAAG TTCAAGACTG GATCCAGAAA CTTGCTTCAC AGCAGCAGCA ACAACAGGCC
TCAATTAACG AAATTAATGA CGAAATCAAG CACGATTTAG ACAGTTTTGA TGTTAACGAG
TTTTTGAATA ACACTAAC
 
Protein sequence
MANFEDKHDP IVELLNSGAG EHDEDIKVIS RDPFSDNFVA TPTTAQLLDP SYHIKDEEKS 
NTNKNSNAGN NNSSISNINS INSTSNIHHN KPTGLTPMLA PAPDFVPHAN NQIAYINKSS
FLNPLPPMAP LSESVLPNGI NINPLALNGS SKLSNDDKKT ITNNSSAVSS SKRKKEPAGP
KTRPLFVTKI WSMVNDPDNQ EYIRWNEDGK TFQVFHREEF MKYILPKYFK HSNFASFVRQ
LNMYGWHKVQ DINSGTFNSG KGDKGMEEVW QFENPNFIRD REDLLDKIIR NKSVSQESEH
DNNAVNFQIL LNELDSIKMN QLAIGEDLRR VRKDNKTLWN ENYMTRERHQ QQAQTLDRIL
KFLAAVYGNN TGKILEVDNG PEYNDGQMTA YNPGQPPSPN PYAQQMYAPI QKPMLMLTNQ
AHGPSPSGST YKSPRQTSIS SSNNRDHRDS SITDSGSIEE IIRSYGNTPR NGERSGDAAN
NVNRIYQQII NQEPSAASPR HYFPELNNSG MPQSPYVGPS TPSNQQFLRV ATPDNANDLM
NGLEQNIYKQ GQSIQQVQDW IQKLASQQQQ QQASINEIND EIKHDLDSFD VNEFLNNTN