Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_68798 |
Symbol | PWP1 |
ID | 4851317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1481425 |
End bp | 1483267 |
Gene Length | 1843 bp |
Protein Length | 566 aa |
Translation table | |
GC content | 46% |
IMG OID | 640393025 |
Product | periodic tryptophan protein 1 |
Protein accession | XP_001387518 |
Protein GI | 126274348 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.715644 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTCAGTAGCA TATAGAATAT GATATCGTCC AGCGCTTGGG TTCCTAGAGG CTTTGCCTCC GAATTCCCCG AAAAGTACGA ATTGGACGAC GAGGAAATGG AAAGAATTAC GGCCATGGCC AAATTGGAAT TGGCTGATGC AAAAGAAGAT TTGCACGAAG CCCAGGTTGA GGCTGGCGAA ACTGACAAGT TGGGGGACCA GATCGATTTG GATGACGACT TGAAGGAATA TGACTTGGAG AACTACGACA ACGATGGAGC AGACAGCGAA GGAGAAGAAG TGACGATGTT CCCTGGGTTA TCTAGCGAAG CCAAGTTCCA CAAGGAAGAC GGAGAAGGTT CTGATCCCTA CTTGACTTTA CCCACTGAAA CTGATCTTCA GGAAGAAAAG AAAGAGTCAC AGATCTATCC AACTGATAAT TTGGTATTAG CCACAAGAAC TGATGACGAT GTTTCGTATT TGGACGTATA CGTCTACGAT GACGGTGCCG GGGCTCCAGA TGGAGCCGAG GAAGAAGAAG AGGACAAACT CGATGCTGAT GTAGCCAAGG GGATGGTCAG AGACTCCAAT TTGTATGTGC ACCATGATAT CATGTTGCCA GCATTTCCCT TGTGTGTGGA ATGGATTAAC TTCAAACCTG GCTCTGAAGA TGGATCTAAT GTAGGTAACT TCGCTGCTGT AGGTACGTTC GATCCACAGA TCGAGATATG GAATTTGGAC TACATAGACA AGGCATTTCC TGACTTGATA TTGGGAGAGC CAGATGCCAA CTCATTTGCC GGAGCCGGCA AGAAGAACAA GAAGAAGAAG AAGAAGTCCC AGCATGTTAC CACCCACCAC ACCGATGCGG TGTTATCGTT GTCACATAAT AGAATACATA GATCGGTCTT GGCTTCGACT TCTGCCGACC ACACCGTCAA GTTGTGGGAT TTGAACAACG GCACTGCCGT CAGATCGTTG AACACTATCC ACAACAACAA AACTGTAGCA TCTTCTCAAT GGCATTCGCA GGAAGCTTCG ATCTTATTGA CTGGTGGTTA TGACAGCACT GTAGGCATTA CTGATGTGAG AATTTCTGAT GCCTCTAGTA TGACCAAAAG CTACAATGTA GCACCTGGTG AAGAGGTCGA AAACGTCCAC TGGGGCCATT CTTCAGTACC AGAGATTTTC TACGCTGGTA CAGACAGTGG TAATGTCTAT TGTTTCGATG TCAGACAGAT GGAAAAACCA TTGTGGACTT TGCATGCTCA CGACTCTGGT ATCTCTTCCT TGGATGTTAA CGCGCACATC CCAGGAATGT TGATTACCAG TGCCATGTCC GAAAAGACAG TCAAGCTCTG GAAGGCTCCT GTTGAATCTG GCAAGGGTCC ATCCATGGTC TTGTCTCGTG ACTTCGGTGT AGGTAATGTA TTGACAACTT CTTATGCTGG AGACATTGAA GTGGCTGGAA ACTTGACAGT TGGTGGAGTC AGTGGAGCCT TGAAGATGTG GGACTCGTTT TCCAACAGCT CTGTTCGTAA CTCTTTCCGT GATGAATTGA GACAACTCCA ACGTCTGGCA AGAGACGAAG CCAAACAGGT TGGTCGTGCT TCGAGAATTG CAAGAAAATA CCAGGGCTCT ACCGGAGAAT CTGTCATGAC TGTTGAAGCC GGTGGTTTGG AAGATGACTC AGACTCTGAT GGCGAAGATG CCAACGACGA CATGGAAGAT GAAGATTAAA ATCTTCTTCA ATTACCTCAC CATTATCATA CGATCATATG TGTATATTAG TAATAGCAGC TTATATAGAA TGCATTATCA CTTAACTAGA TAACTGATTT TTCACAATAG ATTCTTGGGC TAT
|
Protein sequence | MISSSAWVPR GFASEFPEKY ELDDEEMERI TAMAKLELAD AKEDLHEAQV EAGETDKLGD QIDLDDDLKE YDLENYDNDG ADSEGEEVTM FPGLSSEAKF HKEDGEGSDP YLTLPTETDL QEEKKESQIY PTDNLVLATR TDDDVSYLDV YVYDDGAGAP DGAEEEEEDK LDADVAKGMV RDSNLYVHHD IMLPAFPLCV EWINFKPGSE DGSNVGNFAA VGTFDPQIEI WNLDYIDKAF PDLILGEPDA NSFAGAGKKN KKKKKKSQHV TTHHTDAVLS LSHNRIHRSV LASTSADHTV KLWDLNNGTA VRSLNTIHNN KTVASSQWHS QEASILLTGG YDSTVGITDV RISDASSMTK SYNVAPGEEV ENVHWGHSSV PEIFYAGTDS GNVYCFDVRQ MEKPLWTLHA HDSGISSLDV NAHIPGMLIT SAMSEKTVKL WKAPVESGKG PSMVLSRDFG VGNVLTTSYA GDIEVAGNLT VGGVSGALKM WDSFSNSSVR NSFRDELRQL QRLARDEAKQ VGRASRIARK YQGSTGESVM TVEAGGLEDD SDSDGEDAND DMEDED
|
| |