Gene PICST_28605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28605 
Symbol 
ID4851373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1631826 
End bp1633697 
Gene Length1872 bp 
Protein Length623 aa 
Translation table 
GC content42% 
IMG OID640393081 
Productpredicted protein 
Protein accessionXP_001387546 
Protein GI126274414 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAAAAC GTAGCAGGAG AACGCGAAAA ACGACGCCAA ACTACAAAGA AGTGGTTCCA 
GACGAGGTGG TCGAAGTTTC TGAGGAAGAA GAAGAGGATT TGGTTGAAGA TGACGAGACA
GAAGAAGAAA TCAGAGCTTT CAAGAAACCA AAATCTCAAT TGAAAACAAC TACTCAGACT
GAATTGAAGT TCAACGCTAC AAAACAAGTA CAACCAGCTA AAATAATCAA TGATTTAGTT
CCGAGGGTCA CTTCTAATCA AGTTGAACAA CTGCAACTGT TACAATGGAT CGATAAATAT
GCTCCGTCGA AATCTAACGA TATCTGTATA AATCCACAGA AACTCAGACA AGTACGACAA
CTTTTGTACG ATATGATCTT GAATAAGTCC CACAAACGGT TGTTGGTATT GACAGGACCA
GCTGGAAGTT CTAAATCAAC TACCGTAAAG CTTTTAGCCG ATGAGATCAT TGCTAGTTTA
CCTGATCACC AACTTGACGA ATACGGTTTG CTAGGTACGT CATCTGCTGA TCCTCATTGG
ATAGAGTATT TGGATGGGAA CAGTGTTGAA GGTACCAATC AATCGGATCT GTTTGAGGAG
TTTCTCACGG ATGCGAAGTA CCGGGTGGGC TCGAATATGG CTGTAGTGTT GATCGAAGAG
TTACCCAATG TCTTCCACTA CGAGACGCTA CTCAAGTTCA GAAACAAGAT AAGAGAGTGG
ATTTATTGTA ACGAAACCCT CCCGCCCCTC GTGATATGTT TGACAGATGT AGAATATACT
TCAGAACAAG GAACTCGTGA CTACTCGTAT ACTATAGACA ATAACCTCAC CATGGACACA
CTCTTAGGCA AGGAGTTGGC CAATCTGGCA CAAGTAGAGC ACATCAAGTT CAATTCGATA
GCCAATCGCT TTTTGAAGAA AACGATTGGG CAGTTAGTGA AGTCAGAAAG AAACGTATTT
AACAAAATTC CGTATAAGGA ACTTACAGAA TTTATGGACG AAATCATCAA GATAGGTGAT
ATTCGTTCAG TAATCTTCAA CTTAGAGATG TGGGCTACCA ACTACCTGAA GGAAGCCAAG
TGGTATAACC GTGAGAACCA GATAAATTTG TTCCACGCCA TTGGTAAGAT AATCTACCTG
AGTTCCGAGT TCAGTGAACT ATCTAGCGAT GAACGTGATT ACAATTCTGT AGAACAGGTT
TTGGCCAACT ACACCAACAA CCAGTTGCTA AACATGGCTA TCCTTGAGAA CTACCTGATA
TATAATGATG CCCACTTTCC AGTAGAATGC GGAAGTGCGA TTGTGGATGC TCTTAGTCTT
AATGACCTCT TGACTGGCGA AGCCAATCGA GAGTATGCTA TTAGGAGCAC AAGGCAAATC
TTGCAGGCGA TTCCTGCTAG TTCCAGCACC CGTAAGTCGA AGGTGAAGTT CCCCCGTCAA
TTCAAGATGG TCAGACAGTA CAACAAGACA TATGCCATGA TACGTTCGTA TCAGCGGTAT
ATCGATCATG AGCTGTTTCA GAATCTAAAC TTGCTTGATG GCTACTATCT TCCGCTTATT
TACAACAAAA GGCTGAAGAC GAGATTTCGC TACAACCGAT TGGGAGGTCG ATTTCACGAG
ATACATGCTG ACGAGTCTAT GGGAACTAAC GAAGAAGAAG AAGAGTCCCA ATTCTATAAT
AGGGACCAGT TCGAGATTGA CATTACAGAA AAAATCGCAG AGGAAGAAAG AGGTGACTAT
AGTGACCAAG ACGAGTTGAG CGAAGAAATC TCTGATTCTG AAATTGAAGG TGATCAAGGT
GATGACGACG ACGAAGCAGG GTTTCTGTCT GATCCCGAGT TGGATGTGCT TATATCACAG
GGAAGGCTAT GA
 
Protein sequence
MVKRSRRTRK TTPNYKEVVP DEVVEVSEEE EEDLVEDDET EEEIRAFKKP KSQLKTTTQT 
ELKFNATKQV QPAKIINDLV PRVTSNQVEQ LQLLQWIDKY APSKSNDICI NPQKLRQVRQ
LLYDMILNKS HKRLLVLTGP AGSSKSTTVK LLADEIIASL PDHQLDEYGL LGTSSADPHW
IEYLDGNSVE GTNQSDLFEE FLTDAKYRVG SNMAVVLIEE LPNVFHYETL LKFRNKIREW
IYCNETLPPL VICLTDVEYT SEQGTRDYSY TIDNNLTMDT LLGKELANLA QVEHIKFNSI
ANRFLKKTIG QLVKSERNVF NKIPYKELTE FMDEIIKIGD IRSVIFNLEM WATNYLKEAK
WYNRENQINL FHAIGKIIYL SSEFSELSSD ERDYNSVEQV LANYTNNQLL NMAILENYLI
YNDAHFPVEC GSAIVDALSL NDLLTGEANR EYAIRSTRQI LQAIPASSST RKSKVKFPRQ
FKMVRQYNKT YAMIRSYQRY IDHELFQNLN LLDGYYLPLI YNKRLKTRFR YNRLGGRFHE
IHADESMGTN EEEEESQFYN RDQFEIDITE KIAEEERGDY SDQDELSEEI SDSEIEGDQG
DDDDEAGFLS DPELDVLISQ GRL