Gene PICST_36132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36132 
Symbol 
ID4838432 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1798927 
End bp1800177 
Gene Length1251 bp 
Protein Length416 aa 
Translation table12 
GC content39% 
IMG OID640389747 
Productpredicted protein 
Protein accessionXP_001384298 
Protein GI150865185 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2377] Predicted molecular chaperone distantly related to HSP70-fold metalloproteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTAA ATTCAGGCAC TTCAATTGAT GGCATTGATG TTGTTTTATG TAACTTCAAG 
CAAAGCTCTG TTGATTCACC TCTACACTTA TCTGTACTCA AATATGATGA AATGGATATG
CCACCAGCTT TGAAGAGTAG AGTATTAAGA ATGATTAAAG AAAACAAGAC AAAACTCGAA
GAGGTTTCAG AAATAGCTGC TCTCCTTGGA ATGGCCTTCG CAAAGGCTGC AGATGATTTT
TGTCAGAAAC ACGGGATCGA GAAGAGCATC ATTGATATAA TAGGTTCGCA TGGTCAAACT
ATCTGGTACG TACCTGATTC GAAGCCCGGC CAATGTCGGT CGGTAATTAC TCTGGGAGAA
GCTTGCTATA TAGCAGAAAA GATGGGGAAA ACAGTTGTAT CTGAGTTTAG AATTTCGGAG
CAAAGTGTAG GAAGACAGGG GGCACCAATG ATTGCATTCT TCGATAGTCT TCTCTTAGTT
CATCCTAAAA AGTTTAGAAT ATGTCAGAAT ATTGGAGGAA TTGCAAATGT TTGCTTTGTT
TTTCCTGAAA AGGATGGAGG TTTGGATAAG TGTTTTGACT ATGATACAGG ACCAGGTAAT
GTCTTCATAG ATGCCGCTAT GAGATATTTT ACCAAAGGTA CTCTTGAATA TGATAGAGAT
GGAAAGTGGG GGAAAAGGGG TGTTGTGCAC TTACCGCTAG TTGATGAATT CTTGACTGGT
GAATACTTTT TAAGAGAGCC TCCAAAAACC ACGGGAAGGG AATTATTTGG TGATTCAGTT
GCATTTGAAT TAATAGAAAA TATGATAGCC AAGGGTCTTA GCAAATATGA TATAATAGCC
ACGTTGACAA GGATAACGGC TCAATCTATT GTCAACGAGT ACCACAAATA TTCTCTGGGG
CATATTGACG AAATTTTCTT GTGCGGAGGA GGAGCTTTGA ATCCAAATAT TACAGAATAT
ATTCAAAGCT CTTTTCCAGA CACCAAAATC AACCTTCTTG ATGTCACTGG AATTAGTGGA
AGTGCAAAAG AATCAATCAC TTTTGCATTC CAGGGTCTTG AAGCTATTTT AGGAAGGTCA
TTGATAATAC CTGATAGGGT TGATAGTCGA ACTCCGGTGG TGGTTGGTAA GGTAACCCCA
GGTAAAAATT ACAGAGCATT GCAGAAGATG GCTGTTGAGT TTACTTCAAC TTGTAACTGT
GATGGGTACT TACCATCTGT TAGAAAAATG GTAATAGATA GAAATGCATA G
 
Protein sequence
MGLNSGTSID GIDVVLCNFK QSSVDSPLHL SVLKYDEMDM PPALKSRVLR MIKENKTKLE 
EVSEIAALLG MAFAKAADDF CQKHGIEKSI IDIIGSHGQT IWYVPDSKPG QCRSVITSGE
ACYIAEKMGK TVVSEFRISE QSVGRQGAPM IAFFDSLLLV HPKKFRICQN IGGIANVCFV
FPEKDGGLDK CFDYDTGPGN VFIDAAMRYF TKGTLEYDRD GKWGKRGVVH LPLVDEFLTG
EYFLREPPKT TGRELFGDSV AFELIENMIA KGLSKYDIIA TLTRITAQSI VNEYHKYSSG
HIDEIFLCGG GALNPNITEY IQSSFPDTKI NLLDVTGISG SAKESITFAF QGLEAILGRS
LIIPDRVDSR TPVVVGKVTP GKNYRALQKM AVEFTSTCNC DGYLPSVRKM VIDRNA