Gene PICST_31214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31214 
Symbol 
ID4838640 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp220450 
End bp222009 
Gene Length1560 bp 
Protein Length461 aa 
Translation table12 
GC content42% 
IMG OID640389955 
Productpredicted protein 
Protein accessionXP_001384345 
Protein GI150865218 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGATG AGAAGCAATG TTTGAAGGCC AGTTATTTAT CTCCTTCTTA TATTTTGCTA 
GGAGATGAAT ATGATCTTGT TCAGGCATCT CAAAATATAT CCGGGTTCAA ATTTATAGAA
ATTCACGTGA TCAACCGGAA CTAGAAAAGA ATATCTGAAA AACTTTTAAA CCCCAGAAGT
TTCTTTTCCT CCAGGATTTG ATTGTAATTG CATTTACCTC CAAAATATGA TATGATAGGC
AACAGAATAT TGAAGAACAG TACACGGATT ATACGGCTGC CGATTTCGAC TACCTTTCGC
CGAACGTACA AGGTTCTAGC TATAGAAACG TCATGTGACG ATTCTTGTGT AGCTCTCTTA
GACCGATATC TGCCTCTAGA GCCCCCCAAA GTGATTGACC AGATCAAAAA GACATTAGAT
TCTGCTGATA TAGGTGGAAT TATGCCTACG GCAGCGTATG ATTTCCATCT CTCTACCATA
GGAGGTTTGG TAGATGAGCT CTGCAAGAAA CATGGAATGA ATGCTCGTAA TCCACCAGAT
TTGATATGTG TAACCCGAGG TCCTGGAATG ACAGGATCTT TATGTTCGAG CACACAATTT
GCCAAAGGGT TATCTGTTGC ATGGGATGTA CCAATTGTAG GTGTTCACCA TATGTTAGGG
CATTTGCTTA TAGCCCAGCT TCCTAAGACC GAGCAGCCAT GGTTGGGTGC TCCTAAGTAT
CCTTTTCTTA GTTTACTTTG TAGCGGAGGT CACACGATGT TGATATTGCT GAAGTCGATT
CAGGAGCACG AGATCATTGT CGAAGTGAAT GACATCGCTG TGGGAGATTC TCTTGACAAA
TGCGCTCGCG AACTTGGGCT TTATGGGAAT ATGCTCGGAC AAGAACTAGA AAAGTATATC
AATAATTTCC CTGAGGAACT CAAACAAGAG TTCGACAATG TTGATATAGA AACCAGGGAC
AACGAGTACA AGTTCAAACT CAAGATGCCA TTCAAAGGAC CAGGAACTGG ACGAGTTCCT
AAGAATATCC AGTTTTCGTT TGCTCAGTTT TTGAGTGCTA TTCAATCGTA CCGGATTCAT
TATTTAAACA ACGAGCAGTT TGACAATAAA ACGAAGCAGA TGATCGCTTA CAAGACACAA
GAGACAGTAT TTGATCATAT AGTGGACCGT ATCAACGTAG CATTCCAGAA ACACGGCTTG
GACAGAAGCG TGTATAGAAA CGCCGATGGA AAGTTCGTAG GTATCCAAGA CTTCATCTGT
TCCGGAGGTG TAGCAGCAAA CAGGCGTTTG CGTCAAAAGT TGAGTTCGAA TCTTGAGTAT
AAGGAAGCGT TACGAACCGA CCAAGATTTA GCGTTCCATT TCCCGGACTT ATCGCTTTGT
ACGGATAATG CCATCATGAT CGGAGTTGCC GGAATCGAAA TCTTTGAGAA ATTGAGAGTC
AAGTCGGACT TGAACATCAC TCCTATAAGA AGATGGCCCA TGAACCAGTT GCTTGATGTG
GATGGCTGGG TAAAGGTGGA TGACGCCGAG TTCAACAAAG TGTGCAAGTT TGAAAACTAA
 
Protein sequence
MSDEKQCLKA SYLSPSYILL GDEYDLVQAS QNISGQQNIE EQYTDYTAAD FDYLSPNVQA 
LLDRYSPLEP PKVIDQIKKT LDSADIGGIM PTAAYDFHLS TIGGLVDELC KKHGMNARNP
PDLICVTRGP GMTGSLCSST QFAKGLSVAW DVPIVGVHHM LGHLLIAQLP KTEQPWLGAP
KYPFLSLLCS GGHTMLILSK SIQEHEIIVE VNDIAVGDSL DKCARELGLY GNMLGQELEK
YINNFPEELK QEFDNVDIET RDNEYKFKLK MPFKGPGTGR VPKNIQFSFA QFLSAIQSYR
IHYLNNEQFD NKTKQMIAYK TQETVFDHIV DRINVAFQKH GLDRSVYRNA DGKFVGIQDF
ICSGGVAANR RLRQKLSSNL EYKEALRTDQ DLAFHFPDLS LCTDNAIMIG VAGIEIFEKL
RVKSDLNITP IRRWPMNQLL DVDGWVKVDD AEFNKVCKFE N