Gene PICST_28842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28842 
Symbol 
ID4851589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2228700 
End bp2229896 
Gene Length1197 bp 
Protein Length398 aa 
Translation table 
GC content44% 
IMG OID640393297 
Productpredicted protein 
Protein accessionXP_001386776 
Protein GI126274954 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.991697 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.128685 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATTCA CTAAGTTGTT GTTCATACTT GCTTTCTACT TCAACTTCAT ACACGCCGAA 
AACTACTTGA TCTCGTTGAA GAACAATGAG AGTTTAGAGG CATTCTTCAA ATATGATATT
CTCAGACCAG CTACTGAACA AGTGAGGGCG CTTCTTACCA ATTCATTCTC AATCGGCAAT
TTCACTGGGT TTGTTGGGGA CTTCTCCAAA ACCAACCTTG AGAGACTCAA AAGATGTCCC
TTAGTGAACG AGATCACACC AGATGTGATA TTTAAGGCTT ATGGAACTAC TACTCAAGAG
CAAGCCCCAA GACACCTCGC TCGTCTCTCC AGCAAAAAGA AGCTCAAGTC AGGAAAAAGC
TACCAATATG TTTACAATGA CGACTATACT GGATCTGGGG TGTATGCCTA TGTGTTAGAT
TCTGGTGTTG CTATTGGTCA CCCTGAGTTC CAAGGTAGGG CTCGGTTTGG CAAAGACTTC
ACCAGCCAAG GCTCTGGTGA TTCTAATGGG CATGGAACAC ACGTTGCTGG TATTATAGGT
TCTTCTACTT ATGGGGTATC CAAGAATGTA GAGATTATAG AAGTCAAAGT ATTGGATAGT
CTGGGCTCGG GTTCTCTCAG TACAATTATC TCAGCGCTAG AGTTCTCTGT GAACCATAGA
AAAAGAAGTG GAAAGATGGG AGTAGCCAAT CTTTCATTGG GGTCGTTTAG AAATGGAGTC
TTGAACAGTG CAATCAATGC TGCTGCAGAT ACCGGTCTAG TTGTGATAGT TGCAGCTGGA
AATTCCAATA TCAATGCCTG CTTATCTAGT CCAGCTAGTG CTGAAGGTGC AATTACTGTT
GGAGCTATAG ACGACTACAA CGATTCTTTG GCATCTTTCT CTAATTGGGG GGAGTGCGTT
GATATTTTTG CCAGTGGAGC CTATGTTAAG AGTGTGAATG CTGCAGACTA TAATAATCCA
GAGACTCTCT CAGGCACTTC CATGGCATCT CCTGCCGTCT GTGGACTCGC TGCAAATCTA
CTTAGTGAAG GGGTCCCTCC CCACAAGATC AAGAGCAAGC TTCTTAGCCT ATCACTCAAG
GACCAGATCA AAAGATCTTC CTTGTTCCTC AGAAGAGGCA CTCCAAACAG AATAGCTTAT
AATGGAATTG ATGACGAATA CAGGGATGAC ACGGACTCCG ACTCCGACGA TGATTAG
 
Protein sequence
MLFTKLLFIL AFYFNFIHAE NYLISLKNNE SLEAFFKYDI LRPATEQVRA LLTNSFSIGN 
FTGFVGDFSK TNLERLKRCP LVNEITPDVI FKAYGTTTQE QAPRHLARLS SKKKLKSGKS
YQYVYNDDYT GSGVYAYVLD SGVAIGHPEF QGRARFGKDF TSQGSGDSNG HGTHVAGIIG
SSTYGVSKNV EIIEVKVLDS LGSGSLSTII SALEFSVNHR KRSGKMGVAN LSLGSFRNGV
LNSAINAAAD TGLVVIVAAG NSNINACLSS PASAEGAITV GAIDDYNDSL ASFSNWGECV
DIFASGAYVK SVNAADYNNP ETLSGTSMAS PAVCGLAANL LSEGVPPHKI KSKLLSLSLK
DQIKRSSLFL RRGTPNRIAY NGIDDEYRDD TDSDSDDD