Gene PICST_42365 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_42365 
Symbol 
ID4836996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1882210 
End bp1883349 
Gene Length1140 bp 
Protein Length379 aa 
Translation table12 
GC content40% 
IMG OID640388311 
Productpredicted protein 
Protein accessionXP_001383131 
Protein GI150864354 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5160] Protease, Ulp1 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.201544 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACATA TTGTCTTCGA CTCTGCCAAG GGCTTTTGTT ACGACAGTTC TGGTGCGGGC 
CCAGGTAGGC ATCTGGCTAC CAGATCCAAT GGATTTGATC AGCTACCTTG GGACGACTAC
TTGCCTGATG AAGTAGACGA AAGTGAAGAA GAGACACCTG GTAATGACGA CGATTCCAGC
ATTCACGCGA CTAGAAAGAA ACGGCTCACT AAACTGGAGA AAAAGGAACA GAGACAAATG
AAGAAAATTA ATGCTGCTCG CAATAAGCAT ATCAAGAGAC AGCAAGAGCT ACTGGGTTCC
GGAAAAGACA GGGACATGTC TGAACTCGAA ATCTTCAATC CATTCCTAGC CAAAAGCAGT
ATAAAATCAA TACATAGTAA TATCCTCAGA ATGGCAGAAC AACCCAAATC AATCGATTTC
AAGTTGTTCC AGTACCATTC TATCGCACTT TATAGCTCGG ATCTAGACCA TATTCTTCCT
GGTGAGTGGC TCAATGACAA CAATATTTCA CTTATTTTCG AGCTTATTAA CCAGCTCTTC
CTCAAGAGTC AAGATCCGGC TAAAAAATTC AACTACCAGG TCCAGATGTT GTACCCATCC
TTGGTACAGC TATTTTTGCA TTTCCCAGTC ACCGATGACT TGGAAAATAT TCTTCCTATT
AATGAATTGA AGCAGCTGAA GTTCATATTT ATACCGATCA ACTTCATTGA CGACTACGAA
GACATTGATT TGGAAGATGT TAATAATGGC GATCACTGGG CACTTGCGCT TTTGTCGATT
TTGGAGAATA GACTCTATTT GTACGACTCC ATGGCTATTG ATGGAGACGA ATTTGCATCG
CAATCTGAGA CCAATTTGTT GAACGAATTG ATAAAGAGAT TGAAATCGTG TAAAAGCATA
TTCAAGGCAG GCGACAAGAC CAAGATAGAT ATCATAAGGA TGAAGTGTGA CCAACAGGAT
AACTTTGATG ACTGTGGAGT ATATCTCATT ATGATAGCAT GCTTTTTAGT AAAGCAACTA
CTCTTCTCCG ATTCAGCGGA AGGGGCTGTA GACTTGGATA TTGGAAATGT CCGTTTCAAT
GCATTAAGTG CAAGGCTCTA TATGATGAAA TTGATTCATA AACTATATAA ATCATTATAG
 
Protein sequence
MPHIVFDSAK GFCYDSSGAG PGRHSATRSN GFDQLPWDDY LPDEVDESEE ETPGNDDDSS 
IHATRKKRLT KSEKKEQRQM KKINAARNKH IKRQQELSGS GKDRDMSELE IFNPFLAKSS
IKSIHSNILR MAEQPKSIDF KLFQYHSIAL YSSDLDHILP GEWLNDNNIS LIFELINQLF
LKSQDPAKKF NYQVQMLYPS LVQLFLHFPV TDDLENILPI NELKQSKFIF IPINFIDDYE
DIDLEDVNNG DHWALALLSI LENRLYLYDS MAIDGDEFAS QSETNLLNEL IKRLKSCKSI
FKAGDKTKID IIRMKCDQQD NFDDCGVYLI MIACFLVKQL LFSDSAEGAV DLDIGNVRFN
ALSARLYMMK LIHKLYKSL