Gene PICST_51666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_51666 
SymbolPFS2 
ID4851082 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp887884 
End bp889548 
Gene Length1665 bp 
Protein Length541 aa 
Translation table 
GC content47% 
IMG OID640392790 
Productpolyadenylation factor I subunit 2 
Protein accessionXP_001387402 
Protein GI126274068 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGGCA ATCGGTCCAA CGGGAACGGA TACAACTCCA ACTCGTATGG AAATAACAAC 
GGCCGTCCTT CGTACAATAA CAGATACAAC AATAACAGCA ACGACCCGAA GGTTCAGGAA
AAGCTAGCAG CGTACAACCA GAACATCGAA CAACAATTAG CATCTCAAGA AAAAAAGACA
GCACATCGTA GAATCGTAGA CCATGGCAAC AATATGGGAA GGTGGTATAT CCACAAGAAC
TTGGGGCTTT CTCAGCGACA ACAGGCTATA GGTAGCATCA GACCTGAGTC CTCGTATTTG
ATAGACCTTC TTCCAACTCT CGCATATTCG TCGTCTTCCA ACTTAGGGGC GGCCAACAAC
AAGAACAACA TGGCTGTGAT GGATATCCAG ACCAAATTCG TGCATTTGTC GTCCAACAAG
GTTAAACATT CTATCAATGC TGTGAAATGG ACCCCTGAAG GTAGACGTCT ATTGGTAGCA
TCACATAGTG GAGAGTTTAC TATCTGGAAC GGGATGACCT TCAACTTCGA GACAATTATG
CAAGCGCATG ATTCGCAAAT TCTCGCATTG CAGTACTCGC ACAATGACGA GTGGCTCTTG
TCTGGTGATC TGAACGGTGT CATCAAGTAC TGGCAGCCCA ACTTCAACAA CGTCAATATC
CTCAATGGCC ATACTCAGGG AATCAGAGAC ATTGCGTTTT CACCTAACGA CTCTAAATTC
TTGACCTGTG GTGATGACTC CACCTTGAAA ATCTGGAACT TCAACAACGG TAAAGAGGAA
CGTTCCTTAG CTGGACATCA CTGGGAAGTC AAGTCTGCTG ACTGGCATCC CAACTTGGGG
TTGATCGTCA GTGGATCCAA GGATAACTTG GTCAAGTTGT GGGATCCTCG TAGTTCTACC
TGTGTCACAA CATTGCATGG ATTCAAACAT ACAGTAAACA AGTGTAGATT CCAGCCTACA
GGTACCAAAA GGTTGTTGGC ATCTGTTTCT CGTGACAGAT CATGTCGTGT TTTTGACTTG
CGTACAATGA AAGATATCTT AGTCTTGAGA GATAGCGAGA CCGATTTGTC ATGTGTCTCG
TGGCATCCTA CACATGCATC GATGCTTACT ACAGCTGCCT ACAACGGATC CATGAGTCAT
TACCTTCTCG ACTCCTATAT TCCTGATAGC AACACCAGTG AGCTTTCCAA AAAGTCTACC
TCTTACGGCT CGTCTTCCGT AGGCTCTATA GAAGCAGTGC ATAGAATCCC ATATGCACAC
GAAAGAGCCA TCCATGCCTT AGAGTACCAC CCCTTGGGCC ATTTGTTGTG TTCTGCTGGT
TCAGACAAGA CCGCTAGATT CTGGTCCCGC GCAAGACCCA ACGATCCAAT GTCGTACAAG
GACCCATTGT ACACAGACGA CAAGCATGGA GCATGGTACT ATTCGGTGAA CAACAACATC
AATGCTGTTA TCGAAGATCC GAGTGGCTCC AGTGGCACGG CTACAGACTC ACTACCAGTG
CCGTATGGAG AGGACAGAGA CCGTTCGCAT ACCCCAGGAC TCAATTTGCC AGGATTGGGT
TCGAGCTACG ACTACAACGG AAATGGCAAT AGCGGGAATG GCGATATTGC AGCCACTCCG
GCCCCTAGCA ACTGGGGTTC CATTCCTGGC TTACGGGGTC ATTAA
 
Protein sequence
MYGNRSNGNG YNSNSYGNNN GRPSYNNRYN NNSNDPKVQE KLAAYNQNIE QQLASQEKKT 
AHRRIVDHGN NMGRWYIHKN LGLSQRQQAI GSIRPESSYL IDLLPTLAYS SSSNLGAANN
KNNMAVMDIQ TKFVHLSSNK VKHSINAVKW TPEGRRLLVA SHSGEFTIWN GMTFNFETIM
QAHDSQILAL QYSHNDEWLL SGDLNGVIKY WQPNFNNVNI LNGHTQGIRD IAFSPNDSKF
LTCGDDSTLK IWNFNNGKEE RSLAGHHWEV KSADWHPNLG LIVSGSKDNL VKLWDPRSST
CVTTLHGFKH TVNKCRFQPT GTKRLLASVS RDRSCRVFDL RTMKDILVLR DSETDLSCVS
WHPTHASMLT TAAYNGSMSH YLLDSYIPDS NTSELSKKST SYGSSSVGSI EAVHRIPYAH
ERAIHALEYH PLGHLLCSAG SDKTARFWSR ARPNDPMSYK DPLYTDDKHG AWYYSVNNNI
NAVIEDPSGS KDRDRSHTPG LNLPGLGSSY DYNGNGNSGN GDIAATPAPS NWGSIPGLRG
H