Gene PICST_83842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83842 
SymbolPRW1 
ID4839288 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1606961 
End bp1608803 
Gene Length1843 bp 
Protein Length514 aa 
Translation table12 
GC content41% 
IMG OID640390603 
Productconerved hypothetical protein with WD repeats 
Protein accessionXP_001385309 
Protein GI150865904 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAGA TTTCGGAGTT TGAAAGGCAG AGACAGGAAA ATATCCAGCG GAACAAGGAG 
CTCTTGAAGC TGTTGAATTT GGACTCTCTT TCTCAACTGA TTAAGAGAGA GCTTCCCAGA
GCCAGCGAGA CCAAAAAAAG GAAAACAACT CCAAGAACAA AAGCTGTCAA AAAAGAGGAT
GTCGAACCGT CAAGAAGATC ACGCCGGATT GCCGGAATCA AGTCGGAATT GGAAAACCCG
GAAGAGTACA ACCGTATAAG AGAAGAAGAA GAGGAAACCG AGAGAAAGAA GCGTGAACTT
GAAAAGTTGA AGAGAACTAG ATTGTTCGGA GAGTTCAGTC TTATTGATTT GGTCACAGAC
AAAAAACTGG GGAGTTTGAA ATTTGAAGAT AAAGTCATCA AATCCGATTC GACTGAACCA
GAAGTGAAGC AAGAAGAGAA AGAAGAACTA AGCGAAGACA TCAAAAACGA TAATAAAGTA
CTCCATAGAT TGCAAGCTCT TGGAGACAAG TTTTCTGCTG GAGATTTCTT TGATATAATT
CAAAAGAATC CCATCCAGTA CGACGATAAA GTATTACAGT CGACTCGGGA CGAGTTTGAT
AAGTTGAAGA TCTACGAGAA ACACAATCCT CTCGATATCA AGATCTCACA CACCAGGATC
ACAGCTATCA ATTTCCACCC TTCAACGACT GATAGAGTTG TGGCTGCTGG AGACACTAAT
GGTAATGTGG GAATCTGGGC TGTAGACTCG GGTGAGGACG ATTCCGAGCC TACCATATCG
ATTTTAAGAC CTCATGGTAA GGCTATATCT CGTATCTTGA CTCCTGTAGC TGAACAAAAT
AAGCTATATT CGGCTTCATA TGACGGTTCT GTTAGAGTGT TGGATTTGAA CAAGTTGGCG
TCGACAGAAG TGGTGTATCT TAATGATCCA TACGAAAACG ACGATTATGC CTTGGGAGTT
TCCGATATCA ACTTCTGTGC CTCAGATGCC AATCTCTTAT ACATGACAAC ATTATCCGGA
AGTTTCCATA AGCACGACAT TAGAACGCCA TTCAAACCTC TCAAAAGCAA AGATATACTT
CGTTTGCATG ATAAGAAGAT CGGTTCTTTC TCTATCAATC CTAACAACAC CTATCAAATA
GCCACAGCTT CGTTGGATAG AACCTTACGT ATATGGGACT TGAGAAACGT CTCGAAAGCC
AATGCCGAAT GGTCAGAATT TGAAAACCAA ATCTCACCGC ATTTATATGG TTCATTCTCA
TCAAGGCTCT CGGTGTCTTG TGTGGACTGG AATAGCGAAA ATCGACTCGT CTGCAATGGT
TATGATGATT ATATCAACAT CTTTGATTTA AGTGGTTCAG AAGAATTGCC CGCAGTGACA
GAATGGGAGT CAGACTTTCA GCCTAATGTA GCAAAGAAAT CCCGAAAGAG AAAGACCGAC
GAAGACGAAG AATCGCTTAT TCCGGACAAT TTGAAAGCAT TCAACAAAAT CAAACACAAC
TGTCAAACTG GTAGATGGGT GTCGATTCTT AAGTCCAAAT GGCAGGTTGC TCCAGAAGAC
GGTGTGCAGA AGTTTGTCAT TGCCAATATG AACCGAGCTT TAGATATCTA TGATCAAAAG
GGTCAGATAA TAGCCCACTT AACAGACTCT GTAGGAGCAG TTCCAGCAGT CTGTGGATTT
CATCCTACAA AGAATTGGGT TGTAGGAGGA AGCGCTAGCG GAAAGGTGTA CCTATTTGAA
TGACCTTTGA GGTTAACTTT GTTAGATGAA GGTAACCAGA CTGACAAGTC TACACAATCT
CGCAACTACA GTAGAAACAA TTAATATACA CGTTTTTTAC TTG
 
Protein sequence
MAKISEFERQ RQENIQRNKE LLKSLNLDSL SQSIKRELPR ASETKKRKTT PRTKAVKKED 
VEPSRRSRRI AGIKSELENP EEYNHKKSGS LKFEDKVIKS DSTEPEVKQE EKEELSEDIK
NDNKVLHRLQ ALGDKFSAGD FFDIIQKNPI QYDDKVLQST RDEFDKLKIY EKHNPLDIKI
SHTRITAINF HPSTTDRVVA AGDTNGNVGI WAVDSGEDDS EPTISILRPH GKAISRILTP
VAEQNKLYSA SYDGSVRVLD LNKLASTEVV YLNDPYENDD YALGVSDINF CASDANLLYM
TTLSGSFHKH DIRTPFKPLK SKDILRLHDK KIGSFSINPN NTYQIATASL DRTLRIWDLR
NVSKANAEWS EFENQISPHL YGSFSSRLSV SCVDWNSENR LVCNGYDDYI NIFDLNEESL
IPDNLKAFNK IKHNCQTGRW VSILKSKWQV APEDGVQKFV IANMNRALDI YDQKGQIIAH
LTDSVGAVPA VCGFHPTKNW VVGGSASGKV YLFE