Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33379 |
Symbol | |
ID | 4840688 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 462546 |
End bp | 463814 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640392003 |
Product | predicted protein |
Protein accession | XP_001386290 |
Protein GI | 150866626 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0478] RIO-like serine/threonine protein kinase fused to N-terminal HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00101834 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGCTTG ATACTTCTCA TATGAGGTAT TTGACCTCTG ACGATTTCAA AGTTCTTCAA GCTGTGGAAA TTGGCTCGAG AAATCACGAG TTGGTTCCTA CCACCTTGAT TCATTCTATA GGAGGGTTGA AGTCTCCCTC AGGCACCAAC AGGGCCATTG GGGACCTAGC CAAATTGAAA TTAGTCAACC GGTTAAGGAA CGCCAAATAC GACGGGTTCC GATTGACTTA CTCTGGTTAC GACTACTTGG CTCTCAAATC CATGCTTAAC AGACAAACTG TGTACTCTGT AGGAACTACT ATTGGTGTAG GTAAAGAATC GGATATCTAT TCTGTCAGTG ATCCACAAGG AGTTCAGAAG GTGATGAAGA TTCACCGTTT GGGTAGAACA TCCTTCAAAA CTGTCAAAAA CAACCGTGAC TACTTGAAAA ATAAGCTGAC TTCCAACTGG ATGTACTTGT CTCGTCTTGC TGCCGAGAAG GAACACGAAT TTATGGTAGT ATTGTATAAC AACGGGTTCA ATGTTCCCGA GCCGTTTGAT TCGTCCAGAC ACTGTGTGTT AATGGAGTGG ATCAAGGGAA TTCCTATGAA ACACTTGCGA AAACATAGAG ACTACAGGAA GTTGTACTCT GAGTTGATGA ACTTCATCGT CAAGTTGGCT AACCATGGGT TGATCCACTG TGACTTCAAT GAGTTCAACA TAATCATCCG AGACGACTCT GAAGCTTCCA AGCACGAGTT CGACTTTGTA GTCATCGATT TCCCTCAGTG TGTCTCCATA GAACATCCTG ACGCTAAGCA GTACTTTGAC AGAGACGTGG AAGGTATACG ATCTTTTTTT GAAAAGAAGT TTAGATACGC TCCTAGCCAC GATGCTACCA TGTTCGACAC TGAAGGATAC GGTGATGGTT ACAAGTATGC TTATCCTAAC TTCAAACGTG ATGTTATCCG TGAAAAGAGT CTAGATGTAG AGGTGAAGGC ATCGGGATAT GCTAAGAAAA CGACTGGGGT CAAGGAGGAC AAAGACTTGG AAAAGGCAGT TTTGGGAATG AGAATAAATC GATATGAGGA CGAAGATGAC CTTTCGGAAT TCGATGATGA AGATGTAGAC GGTGAAGACG ATGAAGACGG TGATGATTAC GAAGAAGAAG AGATTGACAG TGACGATGAC AACCAAGAGG AAGAAAACGA AAGAATCGTA GAGATGTTGT CTAGTGGAGT CAAGAACCTA AAGATGGACA AGTTGGGAAA TTATATTATA GAAGAATAA
|
Protein sequence | MKLDTSHMRY LTSDDFKVLQ AVEIGSRNHE LVPTTLIHSI GGLKSPSGTN RAIGDLAKLK LVNRLRNAKY DGFRLTYSGY DYLALKSMLN RQTVYSVGTT IGVGKESDIY SVSDPQGVQK VMKIHRLGRT SFKTVKNNRD YLKNKSTSNW MYLSRLAAEK EHEFMVVLYN NGFNVPEPFD SSRHCVLMEW IKGIPMKHLR KHRDYRKLYS ELMNFIVKLA NHGLIHCDFN EFNIIIRDDS EASKHEFDFV VIDFPQCVSI EHPDAKQYFD RDVEGIRSFF EKKFRYAPSH DATMFDTEGY GDGYKYAYPN FKRDVIREKS LDVEVKASGY AKKTTGVKED KDLEKAVLGM RINRYEDEDD LSEFDDEDVD GEDDEDGDDY EEEEIDSDDD NQEEENERIV EMLSSGVKNL KMDKLGNYII EE
|
| |