Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_59469 |
Symbol | |
ID | 4838552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 831762 |
End bp | 833867 |
Gene Length | 2106 bp |
Protein Length | 701 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640389867 |
Product | predicted protein |
Protein accession | XP_001384460 |
Protein GI | 150865304 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [R] General function prediction only |
COG ID | [COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain [COG5533] Ubiquitin C-terminal hydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.185317 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTGCTA GCTTAAACAA GAGTACTAGC AGCGGGTCCA AGGAGTTTAA CCCTGTACTC GATAGATATC TTTCGCATCC GCTCACTTTT AAGCCCGGAA GACAGATGGA TGCCAACACC AGTACCGAGG GCAAGCCAGG CAATTATATA ATTCTCTCGC GCAAGAAGAA GACTTTGGCC AGCGGAGGAA GCATAGCGAT TAACGGAAGC AGTTCTCCAA CTTCTGCTAC CACTTCCCCG AAGTCTTTAG CGAAAGCTAC ACACAAGCCG AGGTCTATGG CTGAGGCTGT AGCTGCGTAT ACTGGTAAGA AGTTTCTTAC TAAGGCAGAG AAAAAACAGT TATCAAGAAA GAGGAAGCTA GAAGAGGCGG AAGAAAAACG TCAAGTGAAT CAAGATGCTA ATACTACTTC ATCTGCATCT ACATCGATCA TCAAGAATAT TTTCAGCATT TATAACCGAA CTGCCAGCTC TCAGAATGAG GATTTGGGTG AGAATGTGAC GTCCGATAAC GATTCACAGT ATGAATCGGC TTCTGAAGTG TTTGAAGAAA CAAGCAACAA TAATACCGAG AGCGAGAGTC CTTTCACAGG TTTCAGTGAG TCTGAGTCCA AGAGCGGAAC TCCTGGTGCT GAAGATAAGA ATGATGTAGA TGAAGAAGAA GAAGATGAAG AAGACGACGA TTTCAATAAT TCCAGTTCCT CTTCTAGTTC ATCCGTGGAA GATTCGTCTA GGTCGACTAC TCCTTCTGAA GAAGATGAAG ACGAGGATAA AGACCTTGAG AAGTTGAAGT ATGACTTAAA AACGGATCAG CTTGAGAATC AGAAACAGAA AGACGAAGAC TATGAAGAAG AAGATGAAGA CGACGAAGAA GAAGATTTGG AAGATGAAGA GAAGCTGAAA CAGCAGTCGA AAGAGTCATC TACACCACCG ACTTCACCTG AGGAGGACAA CGATGAAGAG AAACAGCTTC AGTTCTACGA TATGGGAGAG GACCCTTCTG ATCGAGGTTC AAACCACAGC AAGCGTATAT ACAAGAACTG GCGGGAGTTG GAAAACAAGA AGCCTGTAGG GTTATTGAAC CATGGCGTGA CTTGTTACAT GAACTCAGCC ATCCAAGCCA TGGTTCACAT TCCTGCTATT CAGCATTATC TCAACGCTAT CAACCACAAC AAGGTCTCGG AGTTGAAGCC TCGTTCAGTG AGCCATGTGT TGGCTGACTT GAGTCGTCGT ATGTGGGCTC TAGATGGCAC TAAGCATGTC AAGTACGTCA ACCCCAAGAA GATCATCCAA CGTTTAGGTG ATATCAACTG TATGATGAGC GAATGGCAGC AGGAAGATGC CCATGAGTAC TTTATGTCGT TGATGTCACG TTTGCAAGAA GACCTGACAC CCAAGGGAGT CAAGATGAAC CAGTCTATCA TATATGACAT CTTTGGAGGG TTACTCGCAC AGAGAATCAC ATGTACCAAA TGTAACAATG TTTCGGAAAC GAAGCAGGAG TTCTACGACT TGCTGTTAGG GCTCAATAAG AAGAAACTCA GAGACCACCA GCCTATAGAT GACTCAATTC CCAGCTCCAA TCGTTATCTG ATTGAGAAGT CGGTCCGCGA TTTCTTCAGT AACGAATTGA TCAAGATCGA CAAGGCAGAC AGCAAACTGG GCTACTTCTG CGAGAAGTGC CAGGACCGTA CTGTAGCACA TAAGATATCA TTTATAGACA GATCGCCAGA GTATTTGACG GTGCATCTCA AGCGTTTCAA GTTCAACGGC AACTCGTCGC TGAAGGTGAA GCAGTCCATC AGCTATAGCG ATGTTCTAGA CTTGACTCGG TACACTGTGG ATGCGCGTCA TGCTGCGAAG TACAAGTTGA TGGCAGTGAT AGTTCACGAG GGTAGATCCA TCTCTAGTGG ACACTATATT GCTCATTGTC TTCAGCCAGA CGGCAGTTGG GCCACGTATG ACGACGAGTA CATCAACAAG ATTGATGCAA GAATAGCTTT GGCCGATCCG TCGGCTTATG TGTTGGTGTA TTCGAAGTTG ACACCGAAGG ATTTAAAGAG AAATGGGGAT GGTATTGAAA GTGAGGCAAA GAGAAGAAAG ATATAG
|
Protein sequence | MAASLNKSTS SGSKEFNPVL DRYLSHPLTF KPGRQMDANT STEGKPGNYI ILSRKKKTLA SGGSIAINGS SSPTSATTSP KSLAKATHKP RSMAEAVAAY TGKKFLTKAE KKQLSRKRKL EEAEEKRQVN QDANTTSSAS TSIIKNIFSI YNRTASSQNE DLGENVTSDN DSQYESASEV FEETSNNNTE SESPFTGFSE SESKSGTPGA EDKNDVDEEE EDEEDDDFNN SSSSSSSSVE DSSRSTTPSE EDEDEDKDLE KLKYDLKTDQ LENQKQKDED YEEEDEDDEE EDLEDEEKSK QQSKESSTPP TSPEEDNDEE KQLQFYDMGE DPSDRGSNHS KRIYKNWREL ENKKPVGLLN HGVTCYMNSA IQAMVHIPAI QHYLNAINHN KVSELKPRSV SHVLADLSRR MWALDGTKHV KYVNPKKIIQ RLGDINCMMS EWQQEDAHEY FMSLMSRLQE DSTPKGVKMN QSIIYDIFGG LLAQRITCTK CNNVSETKQE FYDLSLGLNK KKLRDHQPID DSIPSSNRYS IEKSVRDFFS NELIKIDKAD SKSGYFCEKC QDRTVAHKIS FIDRSPEYLT VHLKRFKFNG NSSSKVKQSI SYSDVLDLTR YTVDARHAAK YKLMAVIVHE GRSISSGHYI AHCLQPDGSW ATYDDEYINK IDARIALADP SAYVLVYSKL TPKDLKRNGD GIESEAKRRK I
|
| |