Gene PICST_31515 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31515 
Symbol 
ID4838489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp989398 
End bp991734 
Gene Length2337 bp 
Protein Length778 aa 
Translation table12 
GC content38% 
IMG OID640389804 
Productpredicted protein 
Protein accessionXP_001384496 
Protein GI150865329 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.116727 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.678319 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCC ATAGACTCGA AGATGAAGAA GATCAATTAT CACCAAGAAA GATAAGACAT 
ATGGACAATT TGTCGTCCTC AGACTCAGAA TCGGTGATGT CTACCTCTGA CAGCGTCGAC
CCTGCAACTG GCAGCACGGG TAGCACCACT GTCTCCAGAT CTGGCAGCAC CGTAGGACTG
GCGTCTGTTA GGTCAAGAAG AGCTTGCGAA AGATGCCGTC GTCGAAGAAC GAAATGCACT
GGAGAACATC CATGCGAAGC TTGTATTGCT TCAGGGAACG AATGCTTGTT CCCCAGGAAG
CCAAAGAGAA TTATGGTGTT CGACACTGAC ATTGAACAGT ATCAATCAAA GATCGAGACA
TTGGAATTGG AAATTGAAAA ATTGAGAAAG GTGCCTGACA CTGACTATGA CCACAAGGCA
GACAAGTTGA CTCTTTCAAT TTTGCTAGGT TCGCCTTCTT GTGAAATGGT ATGTTGGAAC
TTGAACGAAT TTACCATTGC CAACAAGGGA ATCTTTAGCG ACATAACTGT GAGTCCTGAT
TTCTCTACTT TTAGAGAAGA GATGTCTTAC AATTTCTTAT TCGGCAATGT TTCCAGGGGC
AATGTTGATA TGGATTCAAT TAAGAATTTG AACTACGATA CTCTTATGCT GTTATACACC
TACGTGGTTT CATTTATTAG TTCGGGGTAT ATGACCGTTG ACCACGAGAA CTTCGAAAAG
AAATGCACCA AGTATTTCGA AAACGGTTTG TTTAAACCAT CAAGTGTTAA TTTCAAGACC
AAAGTTGATT ACTTTTTCTT GAAGGTTTTG GCCCTTATGT CACTTGGCGA AATTTACAGT
CCATTATATG TCCTTGGAGA AAGTAACGCA CCCGAATTAC CAGGACTCAA GTATTTCAAA
ATAGTCATCA AGTATCTTCC ATCAGAATTT AGCTTCTTCG GCAATCGTGA TGTCAACGAC
ACTTTGGAAA TAATTGAATT ATATTGCTTA ATTGCAATTT ATCTAAGAAT TTTGGATAAA
AAGATTGCTT CGGTTCTGTT TACATTGCAT GCTTTACAAT TGTGCATTTC ATTAAATTTA
CACAAGGACA GACATCTTAG GAGTTATGAA ATTAACGAGA AACCCCAACA TTATATCAAC
AGAGTTTGGT GGGGAACCTT TTGCTTGAAC AGATTCTTCA GCTCAAGAAT TGGACAACCC
GTGCTTGTCA GCATCGACAC AATAAGCAAC AACGCGTTAT TCGATGCCCC TCAACTTGCT
CTTGAAGCAA ATAATTCTGT CAACAGTAGT ATGAAATGCT ATATTGAATT GTCTAAGATA
GCGGACACAA TTACAAATGA GCTATATTCA ACATCATTCA ACAACAAACA ATATCTACAA
TCCATCTTGT CTATCATGGC AAGGCTTTTT GACTGGAGTG CCAATATTCC CGAAAGTTTA
AAATTGTCAT TTCCCATCAA AGAAACAGAG CCAATAAACA GATTAAGCTG TTCATTGTAT
TTGAACTATT TACATCACAT CTACCTTACT TGCATTCCTA TACTATTGAA TTTTGCAAAG
ATGCAAATAA GCACCTACTT CAAGTTAAAT CAGTTGATGT ACAATCCTCT CGTTATAGAT
GATCTTCCAA AAAACATCAG CAGGATTATT CAGTCAATCA TAAATAGTGG GCACCTAACC
ATGCATATTT TTAAGGCTTT ATACAAAGGG AAGTTTGTTC GGATTTTTGG ATTCACAGAC
ATTGATTATC TTTTCAGTTC ATCATTGATT TATCTAATTT GCATAATTTT GAGAATTGAT
CTGACTAATG AAAGGAGCCA TATTTTTCAA GAGCAATTGG AAAACTCTAT GGATTGGTTG
AATCAAATGC AAAAAGGGGG GAACTTGATT GCAAGGGGAA AGCTTAATCA AATTGTTTCA
TTGGTAGGTA ATCTCGAGCC AATGCTACTT GATTTGGGCC ATAATGTTTT GATACAAAAT
CTCAAGAAAT ATAAAGAAGT CCGAACCCCA ACAAAAAGAT CACCACGCTC CAGTCACTCT
GAAGGTTCAC TGATAGTTAA GAACCAAATT CCTAGCATTT TTACGCATAT GGAAAGGAGT
GTCGGATCGT CAGAGTCACT TAATAAAGAT TTGAAATCCA AGCAAGCGTC ATCTTCCAAT
ATTGTTATTT CCTCAATTGA ACTGGACCAA ACTGAGATAG TCGATAATCA CAGTCTATTC
TCCTGGGATA TGTTTAACAA TCAAGATTTT CCAATTAGTC AACAGATAAT TGAAAACCAA
GCACATTTTA GTCCAGTGAA CAACGATGAC TTAAGCATTT TCGATTTTTT TGAATGA
 
Protein sequence
MSTHRLEDEE DQLSPRKIRH MDNLSSSDSE SVMSTSDSVD PATGSTGSTT VSRSGSTVGS 
ASVRSRRACE RCRRRRTKCT GEHPCEACIA SGNECLFPRK PKRIMVFDTD IEQYQSKIET
LELEIEKLRK VPDTDYDHKA DKLTLSILLG SPSCEMVCWN LNEFTIANKG IFSDITVSPD
FSTFREEMSY NFLFGNVSRG NVDMDSIKNL NYDTLMSLYT YVVSFISSGY MTVDHENFEK
KCTKYFENGL FKPSSVNFKT KVDYFFLKVL ALMSLGEIYS PLYVLGESNA PELPGLKYFK
IVIKYLPSEF SFFGNRDVND TLEIIELYCL IAIYLRILDK KIASVSFTLH ALQLCISLNL
HKDRHLRSYE INEKPQHYIN RVWWGTFCLN RFFSSRIGQP VLVSIDTISN NALFDAPQLA
LEANNSVNSS MKCYIELSKI ADTITNELYS TSFNNKQYLQ SILSIMARLF DWSANIPESL
KLSFPIKETE PINRLSCSLY LNYLHHIYLT CIPILLNFAK MQISTYFKLN QLMYNPLVID
DLPKNISRII QSIINSGHLT MHIFKALYKG KFVRIFGFTD IDYLFSSSLI YLICIILRID
STNERSHIFQ EQLENSMDWL NQMQKGGNLI ARGKLNQIVS LVGNLEPMLL DLGHNVLIQN
LKKYKEVRTP TKRSPRSSHS EGSSIVKNQI PSIFTHMERS VGSSESLNKD LKSKQASSSN
IVISSIESDQ TEIVDNHSLF SWDMFNNQDF PISQQIIENQ AHFSPVNNDD LSIFDFFE