Gene PICST_50233 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_50233 
Symbol 
ID4841063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp365493 
End bp366653 
Gene Length1161 bp 
Protein Length386 aa 
Translation table12 
GC content40% 
IMG OID640392378 
Productpredicted protein 
Protein accessionXP_001386463 
Protein GI150866761 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0652] Peptidyl-prolyl cis-trans isomerase (rotamase) - cyclophilin family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0164202 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGG AACCAGTGAG ACCTCATGTC TATTTGGATA TCTCCATCGG TGCAAGGGAT 
GTTGGCCGTA TTGTAATCGA ATTATTCGAT GATTTAGCAC CCAAATCCAC TGAGAACTTC
ATCAATTTAT GTGATGGGGT ATCTCTCGAT GGCGAGATAC TAGGATACAA GAATAATGTT
TTTCATAGAG TGATCAAGAA CTTTGTCATC CAAGCAGGTG ATTTGAAGTA TGGGCAATTC
TCTTCAGTTG ATGCCTATTA TCAAGAAGAT ATAGGGAAAG GTAACATATC CACTGTAGAT
CCTCCCAACA TGATAGAGGG CGAAAACTTG TCGGAAGCCC TAGATGCACC ATTCAAGGTA
TGCATGGCTA ACAGTGGAGA CAAAAATGCA AACGGCTCTC AATTCTTCAT AACTACTTAT
CCCCTGCCGC ATCTTACTGG ACGTCACTCA GTCTTTGGAA GAGTGATACA TGGGAAATCT
GTAGTCAGAG AAGTCGAAAG AGTTAACACA AATAAGGAGA ATATCCCTAA AAAGGAAGAG
ATAGTATTGA TCAAGGATTG TGGAAAATGG GATGAAAGCA TGCCTGTTCC TATTTTCAAC
GCCAGCTACG ACACCAGAGG TGGAGATATA TACGAAGAGT TTCCAGACGA CGACGAGCAT
ATAGACAAGG AATCATCAGA ATCAGTATAT GAAGCTGCTT CCAGGATCAA AGAAAGTGGT
ACCTTGCTAT TTAAAGCTGG AAAAAAACAA GAAGCTTTCT TAAAGTACAG AAAGTGCATG
AGATACATTA TGGAATACAT TCCTGACCAG GATCAAGAGC CTGAATGGTA TGAAAAGTAC
ATTGATTTGA AGAAGAAAGT CTACTTGAAC TTGTCTTTAG TATGTCTCCA GTTGAAGAAC
TATGTGAAAG CAGTAGACTA TTCGTCGTAC TTATTGGAAA TGGACAATGC TTCCAGTCAA
GAAAAGGCAA AGGCTCACTT CAGAAAGGGA TCAGGCTTAA TAGAGTTGAA GAAGAATAAT
TTGGCACTTG TAGATCTAGA AGCAGCTAAC AAGTTAGTAC CTGATGACGC TGCTATCAAC
AGAGAACTTA CCAGATGCCA AGATTTGATA GAACGCCAAA AAAAGGAAGA GAAAGCTAAA
TACGCCAAGT TCTTCAAGTA G
 
Protein sequence
MKVEPVRPHV YLDISIGARD VGRIVIELFD DLAPKSTENF INLCDGVSLD GEILGYKNNV 
FHRVIKNFVI QAGDLKYGQF SSVDAYYQED IGKGNISTVD PPNMIEGENL SEALDAPFKV
CMANSGDKNA NGSQFFITTY PSPHLTGRHS VFGRVIHGKS VVREVERVNT NKENIPKKEE
IVLIKDCGKW DESMPVPIFN ASYDTRGGDI YEEFPDDDEH IDKESSESVY EAASRIKESG
TLLFKAGKKQ EAFLKYRKCM RYIMEYIPDQ DQEPEWYEKY IDLKKKVYLN LSLVCLQLKN
YVKAVDYSSY LLEMDNASSQ EKAKAHFRKG SGLIELKKNN LALVDLEAAN KLVPDDAAIN
RELTRCQDLI ERQKKEEKAK YAKFFK