Gene PICST_84214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_84214 
SymbolSDT1 
ID4839592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp959905 
End bp961072 
Gene Length1168 bp 
Protein Length287 aa 
Translation table12 
GC content40% 
IMG OID640390907 
Productsuppressor of deletion of TFIIS 
Protein accessionXP_001385193 
Protein GI150865821 
COG category[R] General function prediction only 
COG ID[COG1011] Predicted hydrolase (HAD superfamily) 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED
[TIGR01993] pyrimidine 5'-nucleotidase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.538947 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0127046 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TCGTCGAAGA AAGATATTAA GAGTTTATTA GATAGAAAGA ACTTGTTGAA CCTTTCTAGA 
ACTTTTCAAT TTTTGTTTCT ATTTTTCGAT TTCAGCATCC ATTTCCAGAT ATTTTTCACT
CCTTAGTCAT CTGAACCATA ATATAAATCT ATAATTCGAT TTCATCACAA CAAATACTAA
ATTTCATCTT TTTCAAATAC AAATTTATAC ATGAAATCAT TCTGAAAACT CAGAATCTTC
TAACCTCTCA GCCCTCACAC AGCAGATCTA CACACGATGA CTATTTCAAA ACTCGAAGTC
CAGACGAATC CCGTTCACTA CACCAACCCA GAGCTCACGG AACAGGAACA ACTTCCAGGA
ACTATTGTCC ACTTGCCCTT TGGCTACGGG CCCATGCCAG AAAGCTTGAC CAACAAGAAG
ATCTTCTACT TTGACATCGA TAACTGTTTG TACCATCGTT CAACGCTGAT CCACGAATTG
ATGCAAGTCA AAATCCACAA CTATTTCAAA GACAACCTAC AGCTCAACGA CGAAGACGCC
CACAAGTTGC ACATGAACTA CTACAAGACC TACGGGTTGG CTATTGAAGG TTTGGTAAGA
AACCACCAGG TGGATGCTTT GGACTACAAT GCCCAAGTTG ATGATTCTTT AGACTTGAAA
TCTGTTTTGT CGTACAATGC TGAATTGCGT AAAATGTTGA TTGCTATTAA GGCAAGTCAT
CAGTTCGACT ATTTCTGGTT GGTGACGAAC GCGTACAAGA ACCACGCCTT GAGAGTGGTA
TCGTTCTTAG GATTGGGTGA CTTGTTTGAA GGCTTGACCT TTTGTGATTA CTCTAAGTTC
CCTATCATCT GTAAACCTAT GGCCAAGTTC TTTCATGGTA CACTTAACGT TACCAATGTG
GACTATAATG ACGCCGAGGT CATGAAGAAA CAGTACTTTA TCGACGACAG CGAGCTTAAC
GCAAAGGCTG CTCACAAGTT GGGCTTTGGA AATGTGATCC ATTATGTGGA AATTGACCTG
GACTACGATA GAATCAAAGC AAAGCCCGAT TTTGAAGAAT ATTATGGAGC TGGCGATAAT
AGCGACAAGT CCAAAATCAG AATACTCCGC CACATACTTG AATTGCCTTC TGTCTTGTAG
ATCATATAGA ATAATAAACA CAATATAG
 
Protein sequence
MTISKLEVQT NPVHYTNPEL TEQEQLPGTI VHLPFGYGPM PESLTNKKIF YFDIDNCLYH 
RSTSIHELMQ VKIHNYFKDN LQLNDEDAHK LHMNYYKTYG LAIEGLVRNH QVDALDYNAQ
VDDSLDLKSV LSYNAELRKM LIAIKASHQF DYFWLVTNAY KNHALRVVSF LGLGDLFEGL
TFCDYSKFPI ICKPMAKFFH GTLNVTNVDY NDAEVMKKQY FIDDSELNAK AAHKLGFGNV
IHYVEIDSDY DRIKAKPDFE EYYGAGDNSD KSKIRILRHI LELPSVL