Gene PICST_32361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32361 
Symbol 
ID4839541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1313079 
End bp1314776 
Gene Length1698 bp 
Protein Length565 aa 
Translation table12 
GC content46% 
IMG OID640390856 
Productpredicted protein 
Protein accessionXP_001384943 
Protein GI150865635 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5207] Isopeptidase T 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.136395 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTACAT CCAGAAAAAC CGTTCCTGGA GGAAAGTCCG TGCCGAATGC CGCTGGAACC 
TCGAATGGAA GTGTTTCTTC CAACGGCCAT GGTAAAAATG GTACTTCTGA ATTAGACTAT
GAAAGGGAAT CCCAGCATAG GTTGCGACCA CAAGACAACT ACTCCACCAT CGCAGCCTGC
TCCCACCTCA AGTCCGTGTT AGAATCATCA GCCCGAGAGA CGGCCCTCAT CACATACAGA
CAAGCTGTCA ACATCTCTCG GCCAATCGAT AATGACCTTA TCTACACCGC GAAGAAGGAT
GGCTCTGTAG TGTCGCACCA TCGTTTGTTA GTACGAAAAT CCTCATCCTT GCGGTGCACA
GACTGTTCGC TCAACAACTT CCACCACAAT TTTACCTGTT TGCAGTGCCC GCATGTGGGC
TGTTTCAATG ACGTCCACAA CCATGCTTAC ACCCACTATA AGCTCACCCA ACATGTCTTC
GCTATCGACA GCCACTCGGG CTTGCTTTAC TGTTTTCCGT GCGGAACCTA TGTCAACCAT
CCTGCTCTAG ATAAAGTCAG ACAGGAGGTG TTGTTGAGTG CTACAGACTA CAGCGATCTA
ATTAAAAGCG AGGTAGATGA AGAATTTGAT TATAGTGATG TAGATGCTCA CTACTCAGAC
CCCAGCCGCT TGGGCGTAGA CGGGTTGAAG GGCTTTGTCA ACTTGGGTGC CACTTGTTTC
ATGAGTTCCA TCCTCCAGAC CTTCATCCAC AACCCCATCA TCAAAAACCA TTTTTTTAAC
AACGACTTGC ATTACTTCAA CTGCGAAAAG AGCATGGCAC AGGGTTCGAC TCTCGACGAA
AATAACGCAT GCATAACATG TAGCATCGAT AACATATTTC AGCTCTTCTA CACCTCTAAC
AGCATTGAAG GCTTTGGAAT GACGAACCTC TTGACCACAG CGTGGTACAA GAAAAAGTCG
TTGGCCGGAT TCCAAGAACA AGATGCCCAC GAGTTCTGGC AGTTTATCTT GAACGAGTTC
CACTCAGACT ACGAAAGGAT CAGATCCAAC ACTGGTTTAA GTCCAATGTC AACTTCAGAC
TGCAATTGCA TTACACACTC TACATTCTCA GGAGAACTAC AAAGCTCTAT AAGATGCCTC
TCGTGCGAAT CTGTGACCAA GACTATCGAC CCGATGGTAG ACTTGTCGCT CGAAATCAAT
CACTTGAAGC TGAACCATCC TGGAAGCCAG ATAGATTTGT ACGACTGCCT CGACCTCTTC
ACCAGCGATG AGAAGTTAGA TGTTATGTAC ACCTGTCAAT CGTGTGGTGA CAAGACCAAG
GCTATCAAGT CGTTGAGTGT CAAGTCGCTT CCGCCTGTTC TATCCATCCA GTTGAAGCGA
TTCAAGCATA ATTCGTTGAA CGACACTTCG TCCAAAATCG AAACTCCTAT AAAGATTCCT
CTCTATTTAA ACATGACTAG GTATTCTATA GGTCATGATC CCCACGATTC AGAGCAAATT
GATGAAGACA AAATCTTCGA GCTCTTCGCC TTGGTGTGCC ACATCGGCTC GGTGAATACG
GGCCACTACA TAGTACTCAC CAAAGATGGC AATGGCCAGT GGTTCAAATT CGATGACAGC
GTTGTCTCGA TGGTTTCGCA AGAGGAGGTA ACCAATACAA ACGCATACTT GGTGTTCTAC
ATCACCCACA AGATCTAG
 
Protein sequence
MSTSRKTVPG GKSVPNAAGT SNGSVSSNGH GKNGTSELDY ERESQHRLRP QDNYSTIAAC 
SHLKSVLESS ARETALITYR QAVNISRPID NDLIYTAKKD GSVVSHHRLL VRKSSSLRCT
DCSLNNFHHN FTCLQCPHVG CFNDVHNHAY THYKLTQHVF AIDSHSGLLY CFPCGTYVNH
PALDKVRQEV LLSATDYSDL IKSEVDEEFD YSDVDAHYSD PSRLGVDGLK GFVNLGATCF
MSSILQTFIH NPIIKNHFFN NDLHYFNCEK SMAQGSTLDE NNACITCSID NIFQLFYTSN
SIEGFGMTNL LTTAWYKKKS LAGFQEQDAH EFWQFILNEF HSDYERIRSN TGLSPMSTSD
CNCITHSTFS GELQSSIRCL SCESVTKTID PMVDLSLEIN HLKSNHPGSQ IDLYDCLDLF
TSDEKLDVMY TCQSCGDKTK AIKSLSVKSL PPVLSIQLKR FKHNSLNDTS SKIETPIKIP
LYLNMTRYSI GHDPHDSEQI DEDKIFELFA LVCHIGSVNT GHYIVLTKDG NGQWFKFDDS
VVSMVSQEEV TNTNAYLVFY ITHKI