Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66106 |
Symbol | SKN7 |
ID | 4840546 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | + |
Start bp | 902241 |
End bp | 904543 |
Gene Length | 2303 bp |
Protein Length | 421 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640391861 |
Product | Protein with similarity to DNA-binding region of heat shock transcription factors |
Protein accession | XP_001386184 |
Protein GI | 150866545 |
COG category | [K] Transcription [T] Signal transduction mechanisms |
COG ID | [COG0745] Response regulators consisting of a CheY-like receiver domain and a winged-helix DNA-binding domain [COG5169] Heat shock transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.415841 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTCTCAGTTC TTGTTGGTTG ATCGGATTCA GAATTATACG CCTCGACGCA ACTCTCTATA GTTATAATTC AATTCAAGAA GACCCCATAT CCAGACAGTA CCCACTGAAT ATCCTGCTGT CTTGGAAAGG TCGTGATACT AGTTCTTATC ATTGTCTATC TCGTTGTTGT CATTAGTATA GTATCGAGGA GTACGTAGGG ATTCGACAGG ATTTGGAATA TGAACATCTG GAACTGAATA CCTGAATATC CGAATCTGAA ATATTTCAAT ATCTGATAAA ATCTGTGAAA TTTATCTTAC AACAAATACA AGACTACCAT CAATTCCCAT CCTTATCAAT ATCATATCAC TGTCTCCATC GCTATCGCTA TCACTACCAC TATCATTATC ATTATATTAG AATCCTTCGA ATAATTGATA GATTTGACCC TTTTCTCGGC GCCCATTTGA AACATAAACT ATGTCGCATC CCATCAAGTC CGAGCCGAGT CTCTCAGCAT CTGTATCCTC GACTGCGAGC TCCAATCAGT CAGGCTCCAA CGACTTTGTC AAAAAGCTCT TTCTAATGCT TCAGGAAGAT TCCTACAAGG ATGTCGTGAG ATGGACAGCT AATGGTGACA GTTTTGTAGT GCTCAACACC AACGAGTTCA CCAAGGAGAT CTTGCCGCGT CACTTCAAGC ACTCTAACTT TGCCAGCTTT GTTCGTCAGC TCAACAAGTA CGACTTTCAC AAAGTAAAAG TATCTAATGA AGAGAAAATG GTGTACCCAT ACGGTGAAGA TGCGTGGGAA TTCAAACACC CTGATTTCAA GATCAACGAT AGGGGCTCAC TTGAAAATAT CAAAAGAAAG GGCCCTTCTT CTAAAAAAAT CCTGAGTGCC AACACCATCA CTAACGGCGG TGATTTCACG TCGTCGTCCT CAGTAGCATG TAACCATAAC TTGTCTCAAA TCACGACTGC TCAGTCCCAT CTCAAGGACC AAGTAGAACA GCTCAGAGCG GAAAACAAAC AATTGCATCA GGATGTCAAC GTTCTCCAGA CGAAGTATAA GACGCTTATC GAGAACATTG TAGCCATAAA TACTTTTGAT GAGCGCTACC ATCGCTCCAT GGGCATTTTG ATTAACTGTT TGCTCCAAGC GGGAATCAAA CTCCCTCCAT TAGATTTTCC CAATCCCGCA CTCATGAGTC TCCAGCAGCC GTCAAGACAT CCTCAACAGC AACAACCCCA ATCCTCGTCT CAAATTCAAT CTCAACAACC ACAATTACCA CCATCATTAG CACCTTCCCT CATACCGCCA ACTCAAGGTC CTGTAGTCAA CCCACCGTTG GTAACCAACG TACCTCCACA GCTGGGACAT ATTGCACAGC TTTCACCCAC AATAGCTAGT CCCAACGGCG TGAGAATACC GTTACAGCCA GGCTCTGCAG CTTCTCAAAG TGTAGCACCA GGCGTAGCAG TAGTACCACC GCCAGGTCCC AATGAGCCGG GAGGGCCCGG AGGTGCAGCT TTTATGACGG GACAAGGACC TCCTCCTGTA GTTGCTAGAC AGGCTTCGCC GAGCGATACA TTAACTCCCA AACAGCAAGG AAACTCGTCT TCTGGCTCAC CCAGGACAAC TGGTCTTGTA GCACAGACCA CCACGACAAA CATACCCAAC CCCAAGTTCC ATGTACTCTT AGTGGAGGAC GACAATGTGT GTATCCAGTT GTGTCGTAAG TTTCTTGTCA AGTATGGCTG TCAAGTTACC GTAGTTACCG ATGGACTCAA TGCAATCTCC ACTGTAGAAC ACACCAAGTA CGACTTGGTT TTGATGGATA TCGTAATGCC CAACTTAGAC GGAGCCACGG CGACCAGTGT AATCAGATCG TTCGATACAA AGACCCCCAT CATCGCCATG ACAGGAAACA TCGAAGATAA CGACTTGGTG ACATACTTGC AAAACGGGAT GTCGGACATT TTGGCCAAAC CATTCACCAA AGATGATCTC TACTCCATCT TATCCAAGCA CTTGTTGACA GACGAGTCGA AGACTGCTGC TGCTGTTGGA GTAACTTCCA GAAACATCAG CATCAGCGGA CCTACCTTGC CAGCAGAATC CGACGAAGAC CCACTTCTCA AGAAACAGCG ACTTCAATAA TATCAATGAA AATAATAAAC ATGAACAATG ATAAAAGGTG AATGTAATAT AGATTTGGCT GCTGTACTAT TATCATAATA ATTATATACT ATATCTTGCA AAAACGTGCT GGTAAAATAA CAGTGCTTTA ATTATATAGC TGTAATATAC ACTGTACTTC TAT
|
Protein sequence | MSHPIKSEPS LSASVSSTAS SNQSGSNDFV KKLFLMLQED SYKDVVRWTA NGDSFVVLNT NEFTKEILPR HFKHSNFASF VRQLNKYDFH KVKVSNEEKM VYPYGEDAWE FKHPDFKIND RGSLENIKRK GPSSKKISSA NTITNGGDFT SSSSVACNHN LSQITTAQSH LKDQVEQLRA ENKQLHQDVN VLQTKYKTLI ENIVAINTFD ERYHRSMGIL INCLLQAGIK LPPLDFPNPA LMSLQQPTTG LVAQTTTTNI PNPKFHVLLV EDDNVCIQLC RKFLVKYGCQ VTVVTDGLNA ISTVEHTKYD LVLMDIVMPN LDGATATSVI RSFDTKTPII AMTGNIEDND LVTYLQNGMS DILAKPFTKD DLYSILSKHL LTDESKTAAA VGVTSRNISI SGPTLPAESD EDPLLKKQRL Q
|
| |