Gene PICST_42488 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42488 
Symbol 
ID4837223 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2279215 
End bp2280618 
Gene Length1404 bp 
Protein Length467 aa 
Translation table12 
GC content44% 
IMG OID640388538 
Productpredicted protein 
Protein accessionXP_001382667 
Protein GI150863998 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.206932 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG ACAAGTCCCA GATCAAGATT AAGTTCTTCA CGAACGAAGA AGATGTCTCG 
TTGCAAGTTT CAGATGCTCC TTTGTATGTT CCAGTGTCAT TGAAGAGATA TGGCTTGTCA
GAAGTAGTGA ACCAGCTCTT GGGAAACGAT GGGGAGAATG ACGATTCGAA GCCAATACCG
TTCGATTTCC TCATAGATGG TGTATTGTTG CGTACTTCGA TCCAGGACTA TTTGACGAAA
AATGGACTTT CCAGCGAAAC GTTCTTGTCT TTAGAATACA CAAGAGCTGT ACTTCCACCT
TCTTTCCTTG CATCTTTCAA TAACGAAGAT TGGATTTCCT CTCTTGACAC GATAAACAAG
ACTTTGCCCA GCGTTACATT GTCGAACATG ATGATTTCAC AGCCCAAGAT CTTGTCCGGC
TCATATGACG GTATAGTTAG AACTTACAAC ATGTCTGGAA ATGTAGAGAA GCAATATGTG
GGCCATTCTG GTCCCATTAG AGCCGTCAAG TGGGTTTCAC CTACTAGAAT CGTTTCGGCT
GGTAACGACA GACAAGTAAG ATTGTGGAAA ACGTCTGCTG ACGATGGAAG TATACCCGAA
GAGGACGAAG AAGCTGAAGA CGGTAGAACG TTGGCTATTT TAGAGGGTCA CAAGGCTCCC
GTAGTGGCAT TGGCTGTCGA AAACACTTCC AACAGGATAT TGTCTGCTGG TTACGACCAT
TCTATTGGAT TCTGGTCTAC AAACTATAAG GAAATGACGA CTATACAGCC TTTAGAATAT
GATTCTAATG TTTTATCATC GTCGTCCAAG AAGAGAAGAA AGATGGCTCT TCAAGATTCG
ACTATTAGAC GTCGTTCTCC ATTGGCTCTT TTGGATAGCC ACACTCAACC TGTAGAAGAT
GTTATTTTCG ACAACACCGA CGCCACCGTT GGTTACTCTG TATCCCAAGA TCACACCATC
AAAACATGGG ATTTGGTTAC TTCTCGTTGT ATCGATACCA GATCTACCGG CTATTCATTG
CTCTCTATCG TGCAGTTACC CAAACTGAAG TTGTTGGCTA CTGGTTCTTC TGCTCGTCAT
ATCAACTTGC ACGATCCCAG AATATCCAAC AACACCACGG AACAGACCAC TTCCAAACTC
GTGGGCCATA CAAACTTTGT GGTCAGCTTG GCTGCTTCAC CAAATAATGA TAACATGTTT
GCATCTGGTT CCCACGATGG CACTGTCAAG GTTTGGGACA TAAGAACAGA TAAATCTTTG
TACACTATCA CTCGTGAATC ACCAGAAGCT GTCAAGGGTG CCGACAAGGT GTTTGCAGTT
TCGTGGGACA ACGAGATCGG TATCATCAGC GGTGGCCAGG ATAAGAAGAT CCAAATCAAC
AAGGGTAGCG ACATATCTAA GTAG
 
Protein sequence
MSDDKSQIKI KFFTNEEDVS LQVSDAPLYV PVSLKRYGLS EVVNQLLGND GENDDSKPIP 
FDFLIDGVLL RTSIQDYLTK NGLSSETFLS LEYTRAVLPP SFLASFNNED WISSLDTINK
TLPSVTLSNM MISQPKILSG SYDGIVRTYN MSGNVEKQYV GHSGPIRAVK WVSPTRIVSA
GNDRQVRLWK TSADDGSIPE EDEEAEDGRT LAILEGHKAP VVALAVENTS NRILSAGYDH
SIGFWSTNYK EMTTIQPLEY DSNVLSSSSK KRRKMALQDS TIRRRSPLAL LDSHTQPVED
VIFDNTDATV GYSVSQDHTI KTWDLVTSRC IDTRSTGYSL LSIVQLPKSK LLATGSSARH
INLHDPRISN NTTEQTTSKL VGHTNFVVSL AASPNNDNMF ASGSHDGTVK VWDIRTDKSL
YTITRESPEA VKGADKVFAV SWDNEIGIIS GGQDKKIQIN KGSDISK