Gene PICST_29180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29180 
Symbol 
ID4851912 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3160762 
End bp3162279 
Gene Length1518 bp 
Protein Length505 aa 
Translation table 
GC content42% 
IMG OID640393620 
Productpredicted protein 
Protein accessionXP_001386933 
Protein GI126276000 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.648326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTAG TGCAGGGGTA TAGTTCTAGC GAAGAAGAAG GGGTCCAATT GCCCCAGTTG 
CCGGTATATG ATATTCGGAC CTATAGCGAG AAACACTCTG CGAAATCTGA GAATGAGTCT
GAAGCTATAG CCACTGAAAA TTCGAGGAAA AGAAAGGCTT TTGGAGCTAC TATTGAAGGT
GCTTACTATG ACAGAGCTAC ATTTGAACTC CAAGCGAAAT TGGAGCGAAG AAACAAACTG
GCATCGCAAG AAGTGAAACT GAAAGCTAGG AAAATCAAAA AGAAAAGGTC TAAGAACGGA
AGCGATGACG ATTATTTAGG ACCCTGGGCC AGATATGAAA GCGAGTCTGA AGATCTAGAT
CAAGAAAATG AAGCTGAAGT TAAAACTGAA GAATATTACA ATAATGACAA GAAGAATGAA
CAGGAGAGTG ATAATGAAGC TTCCAATGTA GGCTCTGATA ACGAAAATGA AAATGATCCA
AAGTCAACAA CTGAGTTTTT GGGTTCACAA GAACACGATT ATCTTGGACG AACTTATATG
CATGTATGGC GAGACTTGCC TATTGATCTA AGCAAAGAAC CAAGTACTCA CGAATGCTTC
GTTCCCAAGA AAGTCATCCA TACATTCCTG GGACATCCCA GGGGCGTCAA CAAGCTTGAA
TTCTTTCCCA AATCGGGACA TCTTCTTCTA TCTTGTGGTA ACGACGGAGA AGTCAGACTC
TGGGACTTGT ACCACAAATT TGAGCTTCTC AGGGTGTTTC ATGGCCACAG TCAAGCTGTA
AAGGATGTTA CATTCAACTC GTCTGGCACT GAGTTTCTAA GCTGTGGGTA CGACAAAAAA
GTTATTCTTT GGGACACCGA GACGGGTGAA ATTAAAAAGA GTCTACGAGT AAAGGCTATT
CCGAATGTTC TTCGATTCAA TCCCAAAAAT GAAGACGAAT TCATAGTAGG ATTAAGCAAT
AACGATATTG AGCACTATGA TCTTTCTTCT TTGGACTTCC ATACTCCCGT TCAAACCTAC
AATCACCACT TAGGAGCCAT CAATTCTTTA ACTATTATCG ATGATAACAA TAAATTCATG
TCTACAGGTG ACGACAAAAC AGTACGGTTC TGGAATTGGC AGATCAACAT TCCCATCAAG
TTCATTTCCG ATCCGTCACA GCATTCTATG CCTGCCGCTG CAATTTACCC TGGAGGTAGC
TTCATAGCGT TGCAGAGTAT GGACAATTCG GTAAAGGTAA TTCAAGGACA CGGAAAGTTC
CGGTTCAACA AAAAGAAAAC TTTCCGAGGC CACAATGTTG CTGGTTACGG AATCGGTCTC
GATATCTCGC CAGATGGTAA GATCCTCATG AGCGGCGATG CCAAGGGGTG TGGCTATTTT
TGGGATTGGA AGACTTGCAA GCTTGTAAAG AAGTTGAAGG TTTGCGATAA ACCCATCAGC
TGTATCAAGT TCCATCCCCA GGAATCTAGC AAAGTTGTTC TAGCAGGAAT CACAGGGGAA
ATCTATTTCT GTGATTGA
 
Protein sequence
MSLVQGYSSS EEEGVQLPQL PVYDIRTYSE KHSAKSENES EAIATENSRK RKAFGATIEG 
AYYDRATFEL QAKLERRNKL ASQEVKLKAR KIKKKRSKNG SDDDYLGPWA RYESESEDLD
QENEAEVKTE EYYNNDKKNE QESDNEASNV GSDNENENDP KSTTEFLGSQ EHDYLGRTYM
HVWRDLPIDL SKEPSTHECF VPKKVIHTFL GHPRGVNKLE FFPKSGHLLL SCGNDGEVRL
WDLYHKFELL RVFHGHSQAV KDVTFNSSGT EFLSCGYDKK VILWDTETGE IKKSLRVKAI
PNVLRFNPKN EDEFIVGLSN NDIEHYDLSS LDFHTPVQTY NHHLGAINSL TIIDDNNKFM
STGDDKTVRF WNWQINIPIK FISDPSQHSM PAAAIYPGGS FIALQSMDNS VKVIQGHGKF
RFNKKKTFRG HNVAGYGIGL DISPDGKILM SGDAKGCGYF WDWKTCKLVK KLKVCDKPIS
CIKFHPQESS KVVLAGITGE IYFCD