Gene PICST_33238 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33238 
Symbol 
ID4840457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp94377 
End bp95543 
Gene Length1167 bp 
Protein Length388 aa 
Translation table12 
GC content43% 
IMG OID640391772 
Productconserved hypothetical protein 
Protein accessionXP_001386231 
Protein GI150866582 
COG category[R] General function prediction only 
COG ID[COG3568] Metal-dependent hydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACGT CCGCTGAAGA AGGCACACAC CTTAGTGAGA GTACACCATT GTTGACTAGC 
GGCTCGGGTC TGGAGGAAAC TCGAACTAAT CCTCGTAGTT CTGGTAGATC CTCTCGTTTC
AGACTATTGA TTATCCCCAC GTTGATCATT ATTGTCTTGG TCTACGCTAC CAATTGGTAC
ATTACTAGCC ACAGCAGTGT TAACCAAGCC TTGCCATTTT TGGGAAAGCC TTTGAAACTT
CGCGTCTACA CCAACAACAT TAGACTTGAT AATCGTTACC CAGTCAAGGG CGAGCAGCCA
TGGTCCAAGC GTAAGAAGCA AGTCATCAAC TCCATTGACT TCAACACAGC TTTGGGGCAT
GCCAACGTGG TATGCTTACA GGAAGTATTG CACAACCAAT TGGTTGATAT CCTTGAGGGG
TTGAATAAGA ACGCTGAGCA GATCTGGACC TATTATGGAG TGGGTCGCAA CGATGGTTTA
GAAGCTGGCG AATATGCTCC TATATTGTAC AAGAACTCTG ATTGGATTTT GCTCGATAAC
CAGACGTTCT GGCTTAGTGA AACTCCTTGG AAGCCAAGTA AGGGATGGGA TGCAGCCCTT
GAGAGAATTG TCACTATGGT CACATTGGAA TCTAGGATTA ATCCTTTGAT CAAGGTGAAT
GTGTTCAATA CACACTTTGA CCATCGGGGT GTATTGGCTA GGAAGAAGTC GGCGGAGTTG
ATTGTTGACA AGATGGAAAA CTTTAACGAT AACCCATCGT TTCTTTGCGG TGACTTTAAT
ACCCAGCCCA AGGATCAACC TTACCATGTT TTATCTGATG CTGGATTCAA AGATAGTAGA
AAGTTGGTTG ACTATGATTA CTCATATGGC CATAGTACGA CGTTCACCGG CTTTAATAAG
GAGAAGGAGG ACTCTTCTAT TATTGATTAC ATCTGGTCAC CATACTTTTC CCAAGGAAAT
TTCGGAAACG ATACCTCGCC GGTTAAAGAT TATGAAGATG AAGTAGCCAA TGAGATGAAC
AACTACTACA ACCTTGAACA CCATTTGGTT TATGATATAG TAATCAAGCA ATTTGGGATC
TTGCACAATT ACTTCAAAGG TTTTTACTTC TCTGACCACA GACCTGTCGT CGCCAGCTAT
GAGATAACTA GAACACATCT TCTTTAA
 
Protein sequence
MSTSAEEGTH LSESTPLLTS GSGSEETRTN PRSSGRSSRF RLLIIPTLII IVLVYATNWY 
ITSHSSVNQA LPFLGKPLKL RVYTNNIRLD NRYPVKGEQP WSKRKKQVIN SIDFNTALGH
ANVVCLQEVL HNQLVDILEG LNKNAEQIWT YYGVGRNDGL EAGEYAPILY KNSDWILLDN
QTFWLSETPW KPSKGWDAAL ERIVTMVTLE SRINPLIKVN VFNTHFDHRG VLARKKSAEL
IVDKMENFND NPSFLCGDFN TQPKDQPYHV LSDAGFKDSR KLVDYDYSYG HSTTFTGFNK
EKEDSSIIDY IWSPYFSQGN FGNDTSPVKD YEDEVANEMN NYYNLEHHLV YDIVIKQFGI
LHNYFKGFYF SDHRPVVASY EITRTHLL