Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_16323 |
Symbol | CTA8 |
ID | 4838236 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | + |
Start bp | 1024255 |
End bp | 1026132 |
Gene Length | 1878 bp |
Protein Length | 599 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640389551 |
Product | Heat shock transcription factor |
Protein accession | XP_001383484 |
Protein GI | 150864599 |
COG category | [K] Transcription |
COG ID | [COG5169] Heat shock transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.173191 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0871257 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAACT TCGAGGATAA GCACGATCCT ATCGTTGAAC TTCTCAACTC GGGTGCTGGA GAGCATGACG AAGATATCAA AGTGATTAGT AGAGATCCGT TCAGCGATAA CTTCGTAGCT ACACCAACCA CAGCTCAGCT CTTGGATCCC AGCTATCACA TAAAGGACGA GGAAAAAAGC AATACGAATA AGAATAGCAA CGCAGGCAAC AATAATAGCA GTATCAGCAA TATCAACAGC ATCAACAGTA CTAGTAATAT CCATCACAAC AGTGTAAACC ACAGTAATAA TAGCAATAAT AATGATAATT ATGACGATTT TCATACTTTC CATACTGGGG CTACCCCCAT AGAGCCAACC GGACTAACAC CAATGCTTGC ACCAGCACCT GATTTTGTGC CGCATGCCAA CAACCAGATC GCCTACATCA ACAAATCCAG CTTTTTGAAC CCTTTGCCTC CTATGGCTCC ACTTTCAGAA CTGGTCCTTC CCAACGGAAT CAATATAAAC CCTCTAGCAT TGAACGGATC GAGTAAACTT TCTAACGATG ACAAGAAAAC TATAACCAAT AACAGTAGTG CTGTGAGTAG CTCTAAAAGG AAAAAGGAAC CAGCTGGACC CAAAACTAGA CCTCTTTTTG TTACTAAGAT ATGGTCTATG GTAAACGATC CAGACAACCA GGAATATATC CGTTGGAACG AAGATGGAAA GACTTTTCAG GTTTTCCATA GAGAAGAATT CATGAAGTAC ATATTACCCA AATACTTCAA GCACAGTAAT TTCGCTTCCT TTGTCCGTCA ATTGAACATG TACGGATGGC ACAAAGTACA AGATATCAAC AGTGGAACTT TCAACCTGGG AAAGGGAGAT AAAGGCATGG AAGAAGTGTG GCAATTTGAG AATCCAAATT TCATTAGAGA TAGAGAAGAT TTATTGGACA AGATTATTAG AAACAAGAGT GTTTCCCAAG AGAGTGAACA CGACAACAAC GCCGTCAACT TCCAGATCCT CCTCAATGAA TTAGACAGCA TCAAGATGAA CCAATTGGCT ATTGGTGAAG ATTTGCGTCG CGTCAGAAAA GATAACAAAA CATTATGGAA CGAGAATTAC ATGACAAGAG AAAGACACCA ACAGCAAGCC CAAACTCTTG ACAGAATCTT GAAGTTCTTA GCTGCTGTTT ATGGTAACAA TACTGGCAAG ATTCTAGAAG TAGATAATGG TCCAGAGTAC AATGATGGTC AAATGACTGC CTACAATCCT GGCCAGCCCC CATCGCCTAA CCCCTACGCT CAACAAATGT ACGCTCCAAT ACAGAAGCCA ATGCTTATGC TCACGAATCA AGCACATGGA CCTAGTCCGT CTGGTTCTAC GTATAAATCT CCTAGACAAA CTTCAATATC TAGTTCTAAC AACAGGGATC ATAGAGATAG TTCCATTACG GACTCGGGTT CGATCGAAGA AATTATAAGA TCCTACGGAA ACACTCCCAG AAATGGCGAG AGAAGTGGCG ATGCTGCGAA CAATGTGAAT AGGATATATC AACAGATTAT CAACCAGGAG CCTTCGGCTG CTTCCCCTAG ACATTACTTC CCTGAGTTGA ACAACAGTGG AATGCCACAG AGTCCTTATG TTGGCCCCAG CACTCCATCG AACCAACAAT TTTTACGAGT GGCAACTCCT GATAATGCCA ATGACTTAAT GAATGGTTTA GAGCAAAATA TTTACAAGCA GGGACAGCTG ATTCAGCAAG TTCAAGACTG GATCCAGAAA CTTGCTTCAC AGCAGCAGCA ACAACAGGCC TCAATTAACG AAATTAATGA CGAAATCAAG CACGATTTAG ACAGTTTTGA TGTTAACGAG TTTTTGAATA ACACTAAC
|
Protein sequence | MANFEDKHDP IVELLNSGAG EHDEDIKVIS RDPFSDNFVA TPTTAQLLDP SYHIKDEEKS NTNKNSNAGN NNSSISNINS INSTSNIHHN KPTGLTPMLA PAPDFVPHAN NQIAYINKSS FLNPLPPMAP LSESVLPNGI NINPLALNGS SKLSNDDKKT ITNNSSAVSS SKRKKEPAGP KTRPLFVTKI WSMVNDPDNQ EYIRWNEDGK TFQVFHREEF MKYILPKYFK HSNFASFVRQ LNMYGWHKVQ DINSGTFNSG KGDKGMEEVW QFENPNFIRD REDLLDKIIR NKSVSQESEH DNNAVNFQIL LNELDSIKMN QLAIGEDLRR VRKDNKTLWN ENYMTRERHQ QQAQTLDRIL KFLAAVYGNN TGKILEVDNG PEYNDGQMTA YNPGQPPSPN PYAQQMYAPI QKPMLMLTNQ AHGPSPSGST YKSPRQTSIS SSNNRDHRDS SITDSGSIEE IIRSYGNTPR NGERSGDAAN NVNRIYQQII NQEPSAASPR HYFPELNNSG MPQSPYVGPS TPSNQQFLRV ATPDNANDLM NGLEQNIYKQ GQSIQQVQDW IQKLASQQQQ QQASINEIND EIKHDLDSFD VNEFLNNTN
|
| |