Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67669 |
Symbol | |
ID | 4838563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 1476222 |
End bp | 1479575 |
Gene Length | 3354 bp |
Protein Length | 960 aa |
Translation table | 12 |
GC content | 43% |
IMG OID | 640389878 |
Product | predicted protein |
Protein accession | XP_001384585 |
Protein GI | 150865388 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATATAGGCCA GCCTATCTAA GTGAAGTATT CCGTCCTACG ACATTCATTG TCACTGTAAA GATTCTGTTA ATCCATAGGA CTGGTTTTTC ATTCTCATTC CATAAAAGCC TACAACTTCA CTTCAATCTT CAATTCTTAC ATCTTCACAT TTCCATTCTT ATTACATAGC TTTTACCATT AGCTTCCCAT CGGCCGTATA CCCATCACAC TCATCGGAGC GAGCTATTGC TACAGTTATT GTCATCCACA TCCACATCTA CTTTGTTTCA TTATCAAAGT CTCATTTTTC AGAGAGCCAT TCCTTTAATT CTTCAATCAT AAACAATTCG TAAATGAATG GTGCTTCCCG AGTGATCAGT GATCAGAATC AGCGAGCTCA ACTAGCAGCG CAGTTCAACG ATTTCTACCT CCACATAACA CTGCCCAACG TCACCCAGAT TGGCAATTAT CGTATCATCG AAGAGATAGG CGAGGGAGCC TTTGGCAAGG TTTACTTGGC CAAGCATGTC TTGCTTAACA TAGAAGTAGT CTTGAAATGT GGACTAGTCG ATGATCCCAA TATTGTACGA GAGATCTACT ATCACAAGCA GTTGAAACAC AAGAACATCG TAAGTCTCTA CGAGGTTATC AAAACAGAAT CGCATCTCTG GCTCGTTCTC GAGTATTGTC AGGGTGGGGA ATTATTTTAC TATATATACG AGAAAAAGCG TCTAGATTAT CGAGAATGCC AGCACCTTTT CTTTCAGATA GTTCTAGCCT TGAGACATGT CCATCTGCTT AATCTAAGTC ATCGAGACTT GAAGCTAGAG AACATCTTGC TTGCGGACAA AAAGAGGTCG ATTGTCAAGA TTACAGACTT TGGCTTTGTG CGGGAGTTTG ATCCTCTGAA CAGGCGTTTT TTGTCGACTA TCTGTGGAAC CACAGTCTAT ATGGCACCAG AGTTGTTGAA GAACGAGAAA TACTCAGGTT TTGCGAGCGA TATCTGGGCT CTCGGAGTAA TTCTCTTTAC TATGATATAT GGTGAGATGC CCTTTGATGA AGATGACGAT CTCAGAACAA AGTACAAAAT CGTCAATGAA GAACCTTTCT ACAGAGACAA TATACCGCAA CATTTAGTCC AACTTATCCA GAAGATGCTA TCCAAAGATC CCAACGAAAG ACCGCATCTC AACGACATCT TGAACTCGCA GTTCTTGATA GACATCCACA ACAAATTCAC CGAGAGAGAC TCCAAAAAAT ACAACGACAC TGAGTCGATC ATTTCTATAC ATCAGTACTA CAGCAATTGC GCCAGACCGT TCCAGTCTAA AGTAGAGAAA GAAATCATAA AAAGACTTCA GAAGTTGAAC TTCGACATAG ACGAACTTCA AGCTTGTGTT TATAGCAATG ATATGAATTC GTTAACAGCT TTCTACGAGT TAATGTTGAC ACAAGAGTTC TCCAAGAAGA AAAGGAAGTA TTATAGAGAG AAGCAGAAAA AATACTACGA GGCCAAAAGA ACACTCCGCA AATCAAGAAA GCGAGTTAAG AGTGCACTTT CGCTATCTGA CCTGAGTGTC ACTGGTAATG CTCCACCTTT AGAGAGAATC ATTTCCAGTT TGAGTTTGTC GTCCAACAGA AATGGTAGCC GTGCAGCGTT AAATAAGACT GTAGAGTCGC GTATGTCTTC AGATGGTGCA GAAAGACGAA GTTCTCATAC CAGAACGGCT TCTGGCTCTG GGCCCACTTT ACGAATCGAC AACAATGGGC TATTAAACGT GGGCTCTTCT CCAGGCTCGC CTACCTCTAC CAGATCACGA GGAGCTCCTG ATACACCACG AGAGAGAGAA AATCTGCATG CCACCGTAGA AACTGGCCCA CCGTTGCAGA GAATCGTATC CTTTGTGCCC GAGGATAATC GAAGACGATC TGATATTTCT GTGGTTGCAA GCAACGAGCC ACTCAAGAAG AAGATCAAGA ACGGCAACTT CTTAAACAAG ATCCAGTTCT GGAAGAAATC GAGAAAGGAA GAAGACTTGG ACTCAAACTA TAGTGTCCAT TCCAGACAGT TCCCAAAATC TTCAACTGAC TACAGTAACG AAAAAAACGA AGACAGACCT CTAGAAATCA CCATAAACCG TAGTTCACCT TCACGTGATA TGAATGACGT TAACAGTGGC TCTCCACATG CAGAGCACAG ACATCTGGAA AGACATTCAG GACCACTCAT GGAGAATTTC ACCTTGAATC AGCCAGCTTT GACGATGCGC AAAGATGATT CGATTGAAGA TCCGGCAGTT TCCTTGACTT TAAACGAGAC GAATTCTCCT ACACCACCAG TAACCGACCA ATCGGGTTCT CGAACTGTGA GAACCAGGCC GACTTCGATG ATTTCACAGA TGAGTCAAGT GAGTCATTTG AGTCAGATGT CAACGATGAT GTCGGAATCG GAATTAGATA TCTTGGACGA AACGGATACT ATGGACGATG AATATGACGA TGACGACGAC GGAGCCTATG AGTCGAGTTT GAACATCTCG CAAGACTTCA CCAGACATCT GTCGACAGCC ATGACGCCGA CGTCGAGTTT TGGAGCGAAT GCGCAGCTGA AGTATGCTTC TGCAAAGAAG CGGCCCGGAT ATAAGCGTAA TGGATCTGAT TTCTCGCTCA AATCAGGGTC TACTTCTTAT AAGCACAACA AGAAGTTCTC ACTCAGCCAA TTGTCTTCCA ACTCGTCTGA AGAGTCTTCT ATCAAGTCCA ATAGCAACTT CGTGGCACCA ATACCTACTA AGCCGACAAT AGCATCGACA TTAAACGGTG ACTTGGACGG CACATTGACT CCAACACATA TTGCCAGAGC CAACTCGCCA GATTTGGCGA AAGGTAAGAC GAAAAAGCGT TGGAATCCGA TCTTTTCTGA CAACATTACT TCCAACTCTT ACAACCAGTC TAGTTCTCCA TTGTATGGAA ACGGTCAGAT GATGGACTAT AGAGCTCATT CGCCACCACC AAATAAGATG AACACCAAAT ACCCTGTTAA GACACTATTT GACCAAAAGA AGCAGGCGAA GACTCTGCCC TCTCCTGGAT ATGTATCCGG GCCACTGCCC AAAGATGCAG CTAGGAGTGA TACCAGATGG GAGCCTAACT TCGTGTCCAC ATCAGTGGTG GGGTCATTTA CAGCGCCAAA GTCGGGCTTT GAGCCGGTGA ATGAAGAGGA TGAGAATGAA TACTAAAATG GGATTCTTAT AGATGTACAA AGTTCAAGTA GTTTGTATGG CTGCGAAATA TAGAACAGTG AGGAAGTGAC TGTGTATGTA TTCTTTAAAG CCGTCAGCGA AACAGTCAAG CAAATAGTAA TGTTTAAGAG CGAT
|
Protein sequence | MNGASRVISD QNQRAQLAAQ FNDFYLHITS PNVTQIGNYR IIEEIGEGAF GKVYLAKHVL LNIEVVLKCG LVDDPNIVRE IYYHKQLKHK NIVSLYEVIK TESHLWLVLE YCQGGELFYY IYEKKRLDYR ECQHLFFQIV LALRHVHSLN LSHRDLKLEN ILLADKKRSI VKITDFGFVR EFDPSNRRFL STICGTTVYM APELLKNEKY SGFASDIWAL GVILFTMIYG EMPFDEDDDL RTKYKIVNEE PFYRDNIPQH LVQLIQKMLS KDPNERPHLN DILNSQFLID IHNKFTERDS KKYNDTESII SIHQYYSNCA RPFQSKVEKE IIKRLQKLNF DIDELQACVY SNDMNSLTAF YELMLTQEFS KKKRKYYREK QKKYYEAKRT LRKSRKRVKS ALSLSDSSVT GNAPPLERII SSLSLSSNRN GSRAALNKTV ESRMSSDGAE RRSSHTRTAS GSGPTLRIDN NGLLNVGSSP GSPTSTRSRG APDTPREREN SHATVETGPP LQRIVSFVPE DNRRRSDISV VASNEPLKKK IKNGNFLNKI QFWKKSRKEE DLDSNYSVHS RQFPKSSTDY SNEKNEDRPL EITINRSSPS RDMNDVNSGS PHAEHRHSER HSGPLMENFT LNQPALTMRK DDSIEDPAVS LTLNETNSPT PPVTDQSGSR TVRTRPTSMI SQMSQVSHLS QMSTMMSESE LDILDETDTM DDEYDDDDDG AYESSLNISQ DFTRHSSTAM TPTSSFGANA QSKYASAKKR PGYKRNGSDF SLKSGSTSYK HNKKFSLSQL SSNSSEESSI KSNSNFVAPI PTKPTIASTL NGDLDGTLTP THIARANSPD LAKGKTKKRW NPIFSDNITS NSYNQSSSPL YGNGQMMDYR AHSPPPNKMN TKYPVKTLFD QKKQAKTSPS PGYVSGPSPK DAARSDTRWE PNFVSTSVVG SFTAPKSGFE PVNEEDENEY
|
| |