Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_29193 |
Symbol | |
ID | 4851924 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 3193794 |
End bp | 3194984 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | |
GC content | 40% |
IMG OID | 640393632 |
Product | predicted protein |
Protein accession | XP_001387182 |
Protein GI | 126276035 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.00865439 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCTA CTTCTGTTAC TTCAGAGACC AAACAAACAT TCAAGACTAC AACTACGACT ACCTTCGATG AAATATTCAA CAAGTTGCCA AATTCACTTC CGAGAAGAGT TACACAAGAC ACCAAGAGAT CTCTCCTCCC TTTGGATAAA AAGCTTTTAA CCGAACTTAG CTCATTTCAT TCCATCGTTC CTGCTCAATT TACAATGGAT GATATTCCTG TCCAAGCATT CGAAAGACAG ATTAACAGGG CTAACTACTT ATCTGAAGAC TCCATTGTTG ACAGATATGG AATACCGAGA AGAGAAATAG GAAAAGGTTG CTTTGGAACT GCCTACAAGC TTATGAGACT CTCAGACTTG CACCCTTTCA CAGTGAAATA TGTCTCGTTC ACTTCGACTC CCTTGCTCAC CAAGAGTATG GTACTTAGAG AATTCTACTA TACGAGAAGA TTGTCTAGCA AATATGTCGC CAGGTCTTTG GATTTGATGA TTTCGAGTAG TCGCCCTGAC GAAATGATGA TTGTCCAAGA CTATGCTGCA GGTATTGACT TGTTCGATGC TATCACCAAA AACTTGTCCA AGTTCAGGCT GATCAGATAC AGCGAAAGAA TTGTTAAGCA GGTTATTGAG GCAATCTGCC ACTGCCATTC CGAAGGTATT GGACATAACG ACATCAAGCT TGAAAATATC CGCTACAGCC CGATGACAGA TCAAATTAAG CTTATTGATT TTGGTCTTTC AACCAGATTA GCAGAGCTCG CTACCGAAGA AAATATCAGT TACCTCATGT CTTCAGGAAC TCCAGGTCTT TTGGATCATA ATTGCAGAAA AGCACTCCCA GATTGCACCA TGTTCTTCGG AGATACTACT GTGAAAAGGA TGTCAATGGC AAAGGATATG TTTGCATTAG GTCTCTTAGC TTTCTCTGTG ATATCCAGAG GCAACTCACT TTGGGACAAT TCAGACTTTG AAGACCGTGA TTACTGTACA TTTTTCGAAA CAAGAGAACT TACCCAAATG ATACACTATA TCAGGGACGT GGAAGGTGAT AGACAAGACA GAGCAATTAT GTTCAAGCCA GTATTGGAGA AAATGGTTGA ACCTCAAGAT GGTAGAAGAT TGACCTTTAG TAACTTAGTA AAAAGTGATT GGTATACTTC CATTTACGAA CTTGGTGATT CTAAATATTA G
|
Protein sequence | MASTSVTSET KQTFKTTTTT TFDEIFNKLP NSLPRRVTQD TKRSLLPLDK KLLTELSSFH SIVPAQFTMD DIPVQAFERQ INRANYLSED SIVDRYGIPR REIGKGCFGT AYKLMRLSDL HPFTVKYVSF TSTPLLTKSM VLREFYYTRR LSSKYVARSL DLMISSSRPD EMMIVQDYAA GIDLFDAITK NLSKFRLIRY SERIVKQVIE AICHCHSEGI GHNDIKLENI RYSPMTDQIK LIDFGLSTRL AELATEENIS YLMSSGTPGL LDHNCRKALP DCTMFFGDTT VKRMSMAKDM FALGLLAFSV ISRGNSLWDN SDFEDRDYCT FFETRELTQM IHYIRDVEGD RQDRAIMFKP VLEKMVEPQD GRRLTFSNLV KSDWYTSIYE LGDSKY
|
| |