Gene PICST_32842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32842 
Symbol 
ID4840353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp752161 
End bp753945 
Gene Length1785 bp 
Protein Length594 aa 
Translation table12 
GC content38% 
IMG OID640391668 
Producthypothetical leucine rich repeat protein 
Protein accessionXP_001385504 
Protein GI150866036 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.548353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTTTT CCATTATAAA CAAAGTTAAG CTGTTTCCTG CTGAGATTCA ATGTCAAATA 
ATCGATTATT TAGCTTTTGA CGACTTGGAA AGCTTGCTCA GAATACAAAC ACCTTTTCAA
GACCATGTAG TTGACTTGCT TTTTCAGTAT ATAGTGCTAA ATGATTGTGA ACCCAACGAC
GAGTCTATCC AATGGAAATG GCATACAGTG GGTATGTTAG TTGATCTTTT CCAAAGATAT
CCGAAGATGA CATTCTCTGA AATCTTGGGT GATCTCACCA GCATATATGA GTTACATTTG
GCAAGACCTG GTATCCTTAA CAGATCAACT ACAGTTACTA TAAGGTGCGA TTGTATGGAA
GGCGAAGATG CCGCAGACCT ATTGAAAGAG CTCTGCAGTC ACTCGTACTC TATCAATATA
GAAGAATGTT CTATCGAGAG CAAATCTATG GTTGAGGCTA TTTCTCCCAC TTTGACAGGG
TTAGGTATGT TCCAGGAAAG ATTCAAAAAG GGTTATGATC TCAGCTATGA GCTTACCGAG
TTGGGAACTT GCTCTAATCT TAGACAATTG GAACTAACTG AAATGCCAAT TACCTCTTCT
CAGATAAGGT TCCTACCAAG ATGCCTAGAG AAATTGACTT TTGAATTAGA AGTCGAGAGC
TTTTCATCTG TCCATGTATT AGAACTTCCT AATTGTCTCA TATATCTTTC CATGAAGCTC
TCCCAAACTA GTAGTTCCCT TAAAAATGTA GAAATCAACG TCGAGTGCCT TTCAAATTTG
AGAATACTCG AGTTGAAATA CTTCCAAATA CATTCTTTGT CATCTTGGAT CCTCCCAAAG
GGAGTTGTTC AATTGTCAGT AATATCATGT GGAATTAAAG ATCTCTCAGG ATTGCTGAAC
TTGTGCAACT TGAACAAGTT GGTACTACAA CTAAATCCAA AAACTGAAAT CGACTTCACC
GTATTGCCCT CTAGTATCAG AAGGCTATGG TTGTATTGTG AAAGTGTACC TAACGAAATA
TGTCAGTTGT CATCTTACAT TCAGTTAAAT CTTGTAGTTA CAAGTCTAGG ATCTGACAGA
GATTTCTCAA AGATCAAGAA CTTGAATAGC CTATATCTCA ATGGAACGTC GTGGGCCACT
TCTACAAACT TCACGTCTCT CGGTGACTTA AAACTTCCGT CGCAATTAAA GGAAATATCA
TTTAACATGT TCCTAGGTCT ACAAAACATT CGTGATTTGC AATCTGGAAA TTGTTTATCG
TCGTTAGCCC TTAATGGAAA GTTTGGCCCA ATGCTAGTCA AGTCACTTCT AAGAGGCAAC
GTTTTTCCAG AATCACTAGT GGATATACGA TTGTTTATTG AGTTGACGAG TTCAGATTGG
GTGGTTGATT CTAGAACCTA CCCCAAAGAA TACATTTCGT TCGGAGCTGA AGGTTCACAA
AATTTAAGAT TTGGGAATTC ATTGTGTTTA CCAAACAAGT TACAAAGGTT GGCTATAGTG
TGTGATTCGT TAGTAGCAGA AGGTGATCTC AATCTACCTT CTAGTATCCA AGAAGTTCAC
TTAAAGGTAT TTGCATACAT ACAGATGTCA AAACTAGTAA TTCCCGAAGC TGTAAGAAGG
ATTCTGCTCC CAGAAGTTTT ACCTAAAGCA GATTTCCATC TGTATCATTT CCCTACATCT
TTAACAGATA TTTATGTTCC TGACGAGAGT TATAAAGTAA CACTTAGGAA TACCAATAAC
ATTAAATGGC TCAATGTATA TCCTCCAGTA GCTAAAGATA TATAG
 
Protein sequence
MSFSIINKVK SFPAEIQCQI IDYLAFDDLE SLLRIQTPFQ DHVVDLLFQY IVLNDCEPND 
ESIQWKWHTV GMLVDLFQRY PKMTFSEILG DLTSIYELHL ARPGILNRST TVTIRCDCME
GEDAADLLKE LCSHSYSINI EECSIESKSM VEAISPTLTG LGMFQERFKK GYDLSYELTE
LGTCSNLRQL ELTEMPITSS QIRFLPRCLE KLTFELEVES FSSVHVLELP NCLIYLSMKL
SQTSSSLKNV EINVECLSNL RILELKYFQI HSLSSWILPK GVVQLSVISC GIKDLSGLSN
LCNLNKLVLQ LNPKTEIDFT VLPSSIRRLW LYCESVPNEI CQLSSYIQLN LVVTSLGSDR
DFSKIKNLNS LYLNGTSWAT STNFTSLGDL KLPSQLKEIS FNMFLGLQNI RDLQSGNCLS
SLALNGKFGP MLVKSLLRGN VFPESLVDIR LFIELTSSDW VVDSRTYPKE YISFGAEGSQ
NLRFGNSLCL PNKLQRLAIV CDSLVAEGDL NLPSSIQEVH LKVFAYIQMS KLVIPEAVRR
ISLPEVLPKA DFHSYHFPTS LTDIYVPDES YKVTLRNTNN IKWLNVYPPV AKDI