Gene PICST_39580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39580 
Symbol 
ID4851704 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2595600 
End bp2596730 
Gene Length1131 bp 
Protein Length376 aa 
Translation table 
GC content47% 
IMG OID640393412 
Productpredicted protein 
Protein accessionXP_001387070 
Protein GI126275321 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCCGT TCAGAACAAG AGGTTACAAC GGCTATGGAG TACAATATTC GCCATACTTC 
GATAACAAGC TTGCTGTGGC TACAGCTGCT AATTATGGAT TGGTCGGTAA TGGACGGCTC
TTTATATTGA ATATAGAGCC CAACGGCACC ATTGTAGAAC AGACATCGTG GGAAACCCAG
GACGGACTCT TCGATCTTGC GTGGAGTGAG GTTCACGAAA ATCAGGTAAC AGCAGCCAGC
GGCGATGGGT CCATAAAATT GTTTGACTTG ACGGTGGGAC AATTCCCTGT CATGAACTGG
AAGGAGCATA CGAGAGAAGT TTTCTCTGTC AACTGGAACT TGGTGGATAA AACTAACTTC
ATCTCTGCAA GCTGGGACGG ATCTATGAAA GTGTGGTCAC CACAGCGTCC AGATTCGCTT
TTGACCTTGA GCCATGCACA GGACTTCACC ACCAAATCTC TGCCTGTAGA GCTGACTGCC
AGACCACCTT TATCGCATCA ACAACAACAT CAACAGCTGC AACATGTGAA CACAGCTAAT
TGCATCTATA ACGCTACCTT TTCTCCGCAT TCACCATCAA CTGTAGTTAG TGTAAATGGA
TCTTCCCACG TTCAGATATG GGATATAAGA GCACCCAGAC CCTTACAAAT AGATTACGTT
GCCCACGGGG GTCTTGAAGC CCTTTCGTGT GATTGGAACA AGTACAAGCC CACGATTATA
GCATCAGCTG GTACTGATAA ATCAGTGAGA ATATGGGACT TAAGGATGAT CACCAAAATC
GACCAACCAC ACGCCCATGC TCCTATGCCT GCGTACCACA TCAGAGGTCC TACTCCCTTG
AACGAACTTC TTGGTCATCA GTTTGCTGTT AGAAAAGTAC AATGGTCTCC TCACGATGGC
CAGGAATTGA TCAGTACTTC CTACGATATG TCCGTGCGAG TTTGGAGAGA TGAGTCTAAC
GAGAGAGCCA GATTCTTGAA CATGAAAAAT GGAGGCTGCA AGGGTGTTAT GGGGCAGCAC
AAAGAGTTTG TCATTGGTTG TGACTACAGT TTGTGGGGAG AACCAGGTTG GGTGGCGTCC
ACAGGCTGGG ACGAAATGGT GTATGTTTGG GACAGCAAGA GGTTACAGTA G
 
Protein sequence
MLPFRTRGYN GYGVQYSPYF DNKLAVATAA NYGLVGNGRL FILNIEPNGT IVEQTSWETQ 
DGLFDLAWSE VHENQVTAAS GDGSIKLFDL TVGQFPVMNW KEHTREVFSV NWNLVDKTNF
ISASWDGSMK VWSPQRPDSL LTLSHAQDFT TKSLPVELTA RPPLSHQQQH QQLQHVNTAN
CIYNATFSPH SPSTVVSVNG SSHVQIWDIR APRPLQIDYV AHGGLEALSC DWNKYKPTII
ASAGTDKSVR IWDLRMITKI DQPHAHAPMP AYHIRGPTPL NELLGHQFAV RKVQWSPHDG
QELISTSYDM SVRVWRDESN ERARFLNMKN GGCKGVMGQH KEFVIGCDYS LWGEPGWVAS
TGWDEMVYVW DSKRLQ