Gene PICST_84931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_84931 
Symbol 
ID4840420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1343010 
End bp1347191 
Gene Length4182 bp 
Protein Length1049 aa 
Translation table12 
GC content40% 
IMG OID640391735 
Productpredicted protein 
Protein accessionXP_001385952 
Protein GI150866375 
COG category[R] General function prediction only 
COG ID[COG1026] Predicted Zn-dependent peptidases, insulinase-like 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CAAAGTCAAA GGCCAGATTT GATTCGTCAC TTGTGAACTT GTGAACAGAC TTCGCTTACA 
GGAGCATTTC ACTGGAAAAA ATTGATTGTC GTTTGATTTC TCAAAAGAAT TTGAATCGGA
ATTTTTCACT TGAAAACTGC TCAAGATTGT TTGCTCAACC GTGAGTCTTG TTAATTGATA
TTACTATTTG TCCGTTTGAC TTTTGAAGTT GTTCCCATTC CTGAATTTGT TGAGATATTG
AGACTATAAT CGCTATATTG CTTCTTTCTA GATCATAGAC TATAGAATCA TAAAGTACAT
TGTTTTTTAG ATTTTTCAGA ACCCTCAGCA TGACAGGCAA GTCCCATTTT GTCAAGCAGA
CTTCGTTTGA AGTCGACTAT TCGCCCACGG AGATCACTAA ATGGCGCTCT TCCAGAACTG
GATTGCAACT CACGTATATC AACCAGCCCT CGCCTATTGT CAATGGTTAC TTCGCTGTGG
CTACAGAAAT CGACAACAAT ACTGGAAGTC CCCATACTTT GGAACACTTG GTCTTTATGG
GCTCTCGCAA ATACCCCTAC AAGGGGTTGT TGGATACACT TGGAAGTCGT TTGTACTCAA
CTACCAACGC CTGGACTTCC GTAGATCAGA CTGTCTATAC CCTTACCACT GCTGGCTGGC
AGGGTTTCAA GACTCTTTTG CCAATCTACT TGGATCACTT GCTCAGCCCT ACTATCACCG
AAGAAGCCTG TCTTACTGAA GTGTACCACA TCGATGGTGA TGGTAAGCAG AAAGGAGTTG
TTTTCTCAGA AATGCAAGGC ATTGAGAATC AGCTGTGGTT CATCACCTAT CAGAAGATGC
AAGAGACTTT GTTTTCTGAG AGCTCTGGTT ATTCTTCTGA AACTGGGGGC TTGACCACAG
AATTGCCTAC TCTTACAAGA GAAACGATCA AAAAGTTCCA CGATAGTTCA TATAGACCCG
ACAACTTGTG CGTAATCATC ACAGGCTCCA TAGACGAAAA TGAGCTTGTA GACATCATGA
CTCAGTTCGA TAATGAATTG GCTCCCTTAC CCGATACTCC CAACAAGAGA CCATTTGTGG
ACTCTAAACA TGATCCGCCA TTGTCAGAAA CAATTATTAA AGAAATCGAA TTTCCAGATG
AAGACGAATC CATGGGAGAG CTCTTGATCT CGTGGATTGG CCCAGATGGA AATGATACCT
TACAAAGTGT AGCTATAGAC ATGATCGCCT ACTATTTTAC TGATAGTCCG ATCTCGTTGC
TTAACAAACA TATGGTAGAA ATCGAAGATC CTCTTGCTAC TGACATCGAT TATTCCCCAG
ATAGTTTTGT AAGAACCATC ATCAACTTCA CCTTAGGTGG AGTTCCTGCT AATCGATTGC
AAGAGGCTGA TTCCAAGCTA AAGGAGTTGA TCTTGAGTCA AGTCAAACCA GAAAATTTTG
ACTTGTCGTA CATGAGAGAA ATTGTGCAAC AGCAGAAGTT AAAGTACATC TCCAGAGCTG
AAAAGAACTC GTCTACATTC TCTACCATTG CTACCTTAGA ATTCTTATAT GGTAATGTAG
ACGGCTCTGA CTTGAAGAAG TGGACCAAGG ACTTGCACGA ATATGAGGTT ATCTACAACT
GGACGACTGA ACAGTGGTGC AATTTGATTT CTGAACAGTT TGTTGAGAAC CACTCTGCTT
CCATTCTCGG TAAACCATCC TCTGCCTTAA ATGATGAGTA CAAAAAACGA AACAAGAAGT
TGAGAAAAGA CATCATTGCT AAGTATGGGG AAGAAGGCTT GAAGAAGTTG GGGAAAGAAT
TGGAAAGAGC TCAGAAAAAG AATGACATTC CTATTCCAGA CGAGTTGTTG CTCAAGTATG
AGAAACCAGA CCCTTCAAAG ATCGATTTCA TTGAGACTAC TTCGTACAAA GCTGGGTTTA
CGGAAGGAAT GTTTAATCCT AAGACAAACA ACTATGTAGA TGATGACTTC AGTGAAGCTT
TGAAGAGGGA CACTCCAAAG GATAGCTTCC CTTTGTTTTT CCATTTTGAA GACTTCAAGT
CTCAATTCAC TACGATTAAC TTGGTCTTGT CTTCTACTAA GATCTCTCCT CATCTCTTGA
AGTATACTTC AATTGTAGAA GAACTTTTTT CGTTATCCAT TCAGCTTCCA GATGGAACAT
ATACCCCATA CGACAAGGTG ATTTCAGAAA TCAACAACGA CTTGTTGGAA TTTCAATTGG
ATAACGGCTA TGAAAACCAA TTCCTTGAAC TCTTAGGAAT CAGAATTAAA TTTGAGTCAG
CCAAGTACAA GAAAGCTATT CAGTGGTTGC TAAATGTGAC CAAGTATGTC GTATTCGAAG
AATCCAGAGT CAAGATCATT GTAGAGAAAA TTATCAACTC ATTGCCAGAC AAAAAAAGAA
ACAGTGAGTT GATGATGTAC TCGTCCCAAC ATAGACATAT GTATAACGAG GAATCTTTGA
GAAAGTCTCA AGATTCTATC CACACTGAAA CCTTCTACAG GAACTTATTA GAAATGATTG
AGGGAGGTAA TTTTAGTAGT ATACAAAGCG ACTTGGAAGC TTACATTAAG CAATTGTTTA
CTTTGGATAA TATGAAGGTA TTTGTCTTGG GAAATGTCAA GAATTTGGAT GGACCAGTTT
CATCGTGGTC TGATTTTGTT GAGAAATATG AACAGCCCCA AAATTCACCA GATCCCTTTC
ACAAGTTGCC AAGATCGTAC CAGTTCAAAT CCCAGCTAGG CCACATCTGT GCTAAGAATG
CCTTCCTTGT TCTTAGTCCA ATTGCTGATT CCACTCACTT GATTACATCT ACTCCTATTC
CTAACGACTA CTTGGATGAA GATATTTTCA AAATTGCTCT TGCTAGCGAA GTCTTGAATG
TTGTTGAGGG TCCTTTGTGG AAGGGTATCA GAGGAGCTGG TTTGGCTTAT GGTGCCTCTG
TGAAGAGGGA TATAGAAACT GGTTTGTTAA GCTTCACTGT TTATAGAGGT GCTGATGCTA
AGAAATCTTG GGAAGTTGCG AAGTCCATAG TAGATGATTA TGCTCTGGGA AAGGTCGAAG
TCGATGCTAT CACAATAGAA AATTCTATTG CTTCCATTGT GAATGAGTTG GCCAATGGTG
AGAGTAACAA TTACTATGCT GCTACTAGTA AGATCAGTGA TAATTTGTTC AAAAAACGTG
GTCCAGCTTA CATTAAGCTC TTTTTGCTGA AGTTGAACGC CTTAACTAAG GATGACGTAG
TTTATGCCAT TGAAAAGTAT TTCAAGCCCA TGTTCGATTC AGAGCTGTCT TTGGTTTTCT
CCAGCATCCC TTCTGAGAAA GAAGAGGAAA TGGAGAAATA TTTCACCGAA CTTGGCTACA
GAGTTCACAT TGAGGTTATA GACGCAGAAC CAGTAGAGGT CGAAGATGAT CTGGAGGATC
TGGAATCTGG CTCCTCTGAA GAAACCAATT CAGAGGATAC AAGCGACGAA GATCAATAAT
ACAGTTGGAA CTGATGTCCA ACTCTACAAT TGTACATACA GATATATAGA TTTTTCTATT
AATGTTATTT GTATTTCTGG AAGCTTCGTA TTACAAAGGC TATATGAATG TTCTTAAAAG
GTTTTTCTGA TCAGGAATGA TAGAGAAGAA CAAGGTATTA CTTGTTGCTT CACTTACTGA
ATATGCAAAA TGAAGGCAAG CATCGGATAA ATGTTGTCTT CTAGGATTGT TTCACGTCTA
CGACCGTGCT AACCTAGTCC TAACTATTGT TGAAACTTAG CCTTGATTTC ATCTGGCAAT
GGAACTCCAA CATCGTTGTT GGCAGGTTGA GCGTCACCAT TCTTTTGCTC TTCTTTAGTG
TCTTCTTTAG TGTCTACTTT AGTGTCTTCT TTGGTGCCTT CCTCTTTAGT CCCTTCTTTA
GTTTCTTCTT TAGTCCCTTC TGGTTTTTCT TCTTCCTTGT CTTCCGGAAG TGGAGGGTGT
CCCTCACGGT TTTCCAGGAT CAATTTTCCA TTTTCATCGA TTAAACCCTG TTGAGCAAGC
TTGTTCTTGT TCTTATTGAA TTCGTATTCC TTTTGGTTAT CCGCTAACTG CTGCATACGG
TTCTCGGTAT CAATATCCTT TAATATTTGA TGGATTTTCT TGATGTAAGA AAGCACGGTA
GCATCGCTAT CCTTTGAATC CTTTTTCAAC AACTCAGTGA AC
 
Protein sequence
MTGKSHFVKQ TSFEVDYSPT EITKWRSSRT GLQLTYINQP SPIVNGYFAV ATEIDNNTGS 
PHTLEHLVFM GSRKYPYKGL LDTLGSRLYS TTNAWTSVDQ TVYTLTTAGW QGFKTLLPIY
LDHLLSPTIT EEACLTEVYH IDGDGKQKGV VFSEMQGIEN QSWFITYQKM QETLFSESSG
YSSETGGLTT ELPTLTRETI KKFHDSSYRP DNLCVIITGS IDENELVDIM TQFDNELAPL
PDTPNKRPFV DSKHDPPLSE TIIKEIEFPD EDESMGELLI SWIGPDGNDT LQSVAIDMIA
YYFTDSPISL LNKHMVEIED PLATDIDYSP DSFVRTIINF TLGGVPANRL QEADSKLKEL
ILSQVKPENF DLSYMREIVQ QQKLKYISRA EKNSSTFSTI ATLEFLYGNV DGSDLKKWTK
DLHEYEVIYN WTTEQWCNLI SEQFVENHSA SILGKPSSAL NDEYKKRNKK LRKDIIAKYG
EEGLKKLGKE LERAQKKNDI PIPDELLLKY EKPDPSKIDF IETTSYKAGF TEGMFNPKTN
NYVDDDFSEA LKRDTPKDSF PLFFHFEDFK SQFTTINLVL SSTKISPHLL KYTSIVEELF
SLSIQLPDGT YTPYDKVISE INNDLLEFQL DNGYENQFLE LLGIRIKFES AKYKKAIQWL
LNVTKYVVFE ESRVKIIVEK IINSLPDKKR NSELMMYSSQ HRHMYNEESL RKSQDSIHTE
TFYRNLLEMI EGGNFSSIQS DLEAYIKQLF TLDNMKVFVL GNVKNLDGPV SSWSDFVEKY
EQPQNSPDPF HKLPRSYQFK SQLGHICAKN AFLVLSPIAD STHLITSTPI PNDYLDEDIF
KIALASEVLN VVEGPLWKGI RGAGLAYGAS VKRDIETGLL SFTVYRGADA KKSWEVAKSI
VDDYASGKVE VDAITIENSI ASIVNELANG ESNNYYAATS KISDNLFKKR GPAYIKLFLS
KLNALTKDDV VYAIEKYFKP MFDSESSLVF SSIPSEKEEE MEKYFTELGY RVHIEVIDAE
PVEVEDDSED SESGSSEETN SEDTSDEDQ