Gene PICST_53206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_53206 
Symbol 
ID4852003 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3396942 
End bp3400919 
Gene Length3978 bp 
Protein Length1105 aa 
Translation table 
GC content43% 
IMG OID640393711 
Productpredicted protein 
Protein accessionXP_001387222 
Protein GI126276278 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCCA ATGCCAATTC GCAGCATCTG ACCTTGGCAA AGTTTGCTTT CAACATCTAC 
GGATCGTTGA ATCACGCCAC TCCCAGCTAC GAAGGACTGG CATCCCCCTC TCTGTCGTAC
AGTTCTTCAC GATCGAAGAC CAATGAGCCT TACAACTACA GATTCAACAA ATTGGTGTAC
AATTGTGAGC GTGAAGTTAC CACACTATCG CAACTTAACT ATCCACTTTC GTTGTCGAAT
ACTTTCCAAT CGGACCAGAA CTTGTCACAT CATGTAATCA TCGGTGGTAG AAACTACTTG
AAGTTGCTAG CCTTGAACGA AGACCAGTCG CGAATTGTCC AAGACATCAA TGTTCTTGAT
CAGAGCTCGA TATACACCCA CAATTCACGT GTGCCGTCCA CAAACAAGTT GAACAACATC
AACACCGTCA AGGCGCAGTC AGACACCATA GGCTGTGGGT TATCCAATGG ACTTATCACT
GTGTATAAGG TAGGCTCCAA CGGTAAGTGC AGACTCATCC ACAAGTTTTC GGACCACAAA
CGTTGCATAA ATTCTCTAGA TTATGTGGGT ATTAGGAATT TGTATGATGC ACCGACCCAG
ATGATATCTG GATCGCAGGA TGGTTCTATA AAGCTTTGGG ACATGCGACT GTCTTCTCCC
AGACCTATGC TTACGATATC TTCAGGCAGC CATCTGGATC CCATTCGTTC ATGCCAATAC
TCTCCCCATT CACAGGGCCG TAACAAACTT GTAGTTCTCT CTGTCCACGA CTCGGGAGCG
TTGTGTAAAT TCGATTTACG ACTGTTTGGC TCCAACACCG CTGCCAACAC CAGTAGTAAT
GGCCATGGCC CAGAGAGAAA ATGGAATATC CATACTGGTC CAGCATTGTC ATTACACATC
CATCCCGAAA AGGAGTACGT TGTTACAGGA GGTAGAGACC AGAAGATTTG TGTGTTCAAC
TACGGCGATT CTCAGATATC CAATAGAATT ACACCAGACG AGATGATTAA CACTTATGGT
CCTGTAATGA AGGTACGATG GTGTCTATAT CCCGATGCAT CAACATCACA GTTTGGTGAA
CCTCTCGATA CGTTCCAGCA ATCTAACGAC TTCAACAGGT TTGAAGATAA GCTTTCGTAC
GATGAACGTG AGGCCATGTA CAGCTATCCT TCCTCCCTGC GTAGCAGTTC TCTATATAGC
TACGACTTGG CTTGTCTGTA CTTGAACGAT GACTCTACTG TAGCCATTTA CAATCTCAAC
AGGAAGTTCA TTCCTAAGGA GGTGATCACC ACGTCATCTA ATAAGCCTAT TCAGAACTTC
ATCTGGGCTA ACAACCCAGG ATCGTCGCGT AAGATCTGGA CCATAACAAA GTCTAACGTG
TTTTCCAGTT ATGATCTAGA TATGCACGAC AGCCTTCTCG AGTCTGAAAT TTCCAAACCT
TTGGACGAAC TCGCTAATGT CACTGTAGAC TGGAACAATG GATTTGGCGA TCTCTGCTTA
GCCAACCAGG AAAAATATGA GTTTGAAATC ACAGAAGTAG AATCACAGGC AAGCGACAAC
GACATGGGTG ACATAGATAC AGAGTATTCG TCCAGATATG AAAGAAGCAA CTCTAACAGT
GTCATTGATG AAAGTGATAC CAGAAGCATT GATCACCATT CACCTGAAAA CGATGCTGCA
ACTATAGCAG CTGGTGGCTC AGGAAAGGCG TTTGTTGGCT CTGTACCCAT TGCATCAAAA
ATCCACGGCT CTTTATCAGG CTCATTGATA GGGTCGTCTT CGGCAGAAAA GCCTCCTCTT
TTCAGATCAA GCACTCATTA CTCGATGCAC ATGGCCAAAT CGCCATCTCC AGTACCTCGT
AGAGGTTCCA CATCATTTGC TGCTCATTCA GAATCCCAGC CAAATCTTTC TAATATGCAA
GGTTTATCGA TGTCAAGGCC AAAACTTACA CGTAATCTTT CGCAGGCTAC CGAAGACTCG
AGTATATCTA TCGGTTCGGC CCCACAGTCA AATATCCACT TAAAGCTGAA ACGTTCATTT
CAAGTTAGCT ATGCTTCACC ATATTTGGTG CCCGTGTCGT TACCGTTGTC TCTCAACGAC
GAAAACGTGT TCGAGATTCT CTCAAATAAC TACTTGATAT CCATTCCAGA TGGGTTTACT
TTAGTAGACG TATGTTTATT GAATGCCAGT GTTGCAGCCA GTGTACAACG GTTTCGTGAA
TGTCAAATCT GGCGAGTATT GGCTGTAAGC TTAGAGGAGG ATTATGTTCA AATTGACAAC
ACTACATTCT TGAGTGATCC AGAATTGGAG CATAATGAAA CTAACCAAGA TGAAAAGATC
GACGATCAGA AAGATGCTAA ATCCATACTG TCAGATTTGG GCAACTTTGT AGGGTCATAT
AATTCGAATT CAACCCTGAC CACTAACTAC GGAGGGTTGG GTAGTCTCAG TGCTAAAGAT
ACCAGTCAGG AATCTATTGG CAGAGAGATA CGTTCAATAG TATCGTCTTC TGTAGAATCG
GATGTCAAGA TCCCTCCTCC CAACCCTCCA TTAGCAAAGG CAAACAATTC CAACAATCTC
ATGGATATGA TCAACCGAAG CAGAGTGAAC AGTATGAACC ATCTTCAAAG CATAAGCCCA
TCTGGGTCCC ATACATTTAT CCGCAATGCT ATCCATGAAA ACAAAAGCAA CGAAAACGCA
ATTGTAGACG ATGACGAAAC TGACGCAGCA GTACATGGAA CTGAAGGTTC ATCACATATT
GCTCATCATA GAGAAAGACG TTCATCTTCG AAGAAAACTA AACCTACTCT TAGACATCAT
AGAAGTTCTC AAATACTGTA TGAAGTCGAT AAAGAAACTC AAAGTTCTCC AATAGCCATA
GCATCACCCT CAAAAATTGG CATTGGCTCC GGCGATAACT CGCCCAATTC TGCATTTTTA
CATCGCCATG CTGATTCGTT CTCATCTTCG TTTGCAGGTT CAAAGTTGGC AGGACGAATC
GGAACTTCTC ACGTTTCTGA AGATTTGGAC AACGAGAATC TTAACATTCT CAACAATGCT
GTTCTCAACT CGAGTCCAAA CTCAGCTATG ACGACTCCTC ATCCGCAATC AAATAGTCCC
AACTATTCAA ACTTTTTTTC ACTGTCGCAC CAGTCAGCTC AACACCATTC CATGGGTACT
GGTTCTGGTT CAACTTCAGT ACCTTCTCGT CGTAATTCTG CTATTCCAGC ATATGGATTC
CATAGACCTA AGTTGTCGTC TACATTCATG TCTCCTATTT ATGATGAATT TGCTGAAAAA
CAGGAACAAC CGCGTAGCTT GAAGGCCGAG TCCTTGTTGA ACAACGATGT CTCTGATAGA
ACTACAACGA AGTCTGAGCT TACTAAGGCA ATTAAAGAAG AAGTAGATAA TTCCAGTGGG
GCCCCGCTAA AGAAAGCTTG GAAGTCGCTG AGCTTATTGG AAAAAGCATT AGCGCATGCT
TCGAACGAGG GGGATATTAT TCTTTGCTCT ACACTTTCAC TCTTGTTTTA CGACTCGTTC
AAGCAGGTAA TCCCCCAATC GTCCTGTTTG GATTGGTTAG GGCTCTACAT TGAAATCCTA
CAAAGAAAAA GGTTGTTTGT CAATGCTATT CACGTTGTCA ATAATGCTCC TGATGACGTT
AGAAGCAAGT TGAAAAACTT GACCTCTGGC GATGTGGATC TCCGTTTCTT CTGTTGCTGG
TGCCAGAAAC TCTTGGTTAA TGAAAAGTCC AAGGAGAAAT TGAAAAATGA TGTCAATGCA
GACTTCGGCT ATTGGTACTG TGACGAGTGT AGTCAGAAGC AACTGAATTG TATCTATTGC
AACGAGCCTT GTAAGGGATT GACGGTAGTT GTTAGTCTTA AGTGTGGCCA CAGAGGACAT
TTTGGATGTT TGAGAGAATG GTTCATTGAG GACGAGAATA ATGAATGTCC GGGTGGCTGT
GATTACAGTG TAGTATAG
 
Protein sequence
MSSNANSQHL TLAKFAFNIY GSLNHATPSY EGLASPSLSY SSSRSKTNEP YNYRFNKLVY 
NCEREVTTLS QLNYPLSLSN TFQSDQNLSH HVIIGGRNYL KLLALNEDQS RIVQDINVLD
QSSIYTHNSR VPSTNKLNNI NTVKAQSDTI GCGLSNGLIT VYKVGSNGKC RLIHKFSDHK
RCINSLDYVG IRNLYDAPTQ MISGSQDGSI KLWDMRLSSP RPMLTISSGS HLDPIRSCQY
SPHSQGRNKL VVLSVHDSGA LCKFDLRLSN GHGPERKWNI HTGPALSLHI HPEKEYVVTG
GRDQKICVFN YGDSQISNRI TPDEMINTYG PVMKVRWCLY PDASTSQFGE PLDTFQQSND
FNSSSLYSYD LACLYLNDDS TVAIYNLNRK FIPKEVITTS SNKPIQNFIW ANNPGSSRKI
WTITKSNVFS SYDLDMHDSL LESEISKPLD ELANVTVDWN NGFGDLCLAN QEKYEFEITE
VESQASDNDM GDIDTEYSSR YERSNSNSVI DESSLIGSSS AEKPPLFRSS THYSMHMAKS
PSPVPRRGST SFAAHSESQP NLSNMQGLSM SRPKLTRNLS QATEDSSISI GSAPQSNIHL
KLKRSFQVSY ASPYLVPVSL PLSLNDENVF EILSNNYLIS IPDGFTLVDV CLLNASVAAS
VQRFRECQIW RVLAVSLEED YVQIDNTTFL SDPELEHNET NQDEKIDDQK DAKSILSDLG
NFVGSYNSNS TLTTNYGGLG SLSAKDTTKA NNSNNLMDMI NRSRVNSMNH LQSISPSGSH
TFIRNAIHEN KSNENAIVDD DETDAAVHGT EGRIGTSHVS EDLDNENLNI LNNAVLNSSP
NSAMTTPHPQ SNSPNYSNFF SLPKLSSTFM SPIYDEFAEK QEQPRSLKAE SLLNNDVSDR
TTTKSELTKA IKEEVDNSSG APLKKAWKSL SLLEKALAHA SNEGDIILCS TLSLLFYDSF
KQVIPQSSCL DWLGLYIEIL QRKRLFVNAI HVVNNAPDDV RSKLKNLTSG DVDLRFFCCW
CQKLLVNEKS KEKLKNDVNA DFGYWYCDEC SQKQLNCIYC NEPCKGLTVV VSLKCGHRGH
FGCLREWFIE DENNECPGGC DYSVV