Gene PICST_30500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30500 
Symbol 
ID4837816 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp219272 
End bp220444 
Gene Length1173 bp 
Protein Length390 aa 
Translation table12 
GC content43% 
IMG OID640389131 
Productpredicted protein 
Protein accessionXP_001383672 
Protein GI150864723 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.363686 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCTTCG ATTTGGAGAA AAAGATAACC ACTGCCCATA TACCTCGGGC CAGTGGTTCG 
ACTATTTTAG CCTCAGACAC CTTGAATGTC GTCTACAACA AGTACAAGTC AACGGCTGCA
ATTCCTACAG ATGCCATCAG ATACAACTTG ATTTTCTCCC ATGGAACAGG AATGAACAAG
TCGATCTGGC ACTATCACAT CAAAAGCTTG TTTGAGTGGT CGCAGAAGAG CAACGGAAAA
ATCTACATCG ACTCCGTCAT TGCCATCGAT GCTGCTGGCC ATGGTGATTC AGGAGTTATC
AACAGGAATA AGCTCGGCTG GATTTTCAGA TGGGACGAAG GAGGCAAAGA CATTATTGAA
GTGGTCAGAA ACGAACACAG AACTACGGGG GATTTCCAAA ATAACTTCAA GTCAAGAAAC
ATCCTCATTG GACATTCCAT GGGAGGCTTT CTGTCATTGT TGGCTGCTTT CTATGAGCCG
GACTTGTTCG ATGCAACTGT GCCAATAGAA CCTGTCGTTT ATCTTGACTC CAGATCAACT
CGTAAATTTT CTCAGAGATT TCTGATCATA GGCAAGATGA TCATCAATGA ATTCGACACG
AAACAGGCAT TTGAAGATTT CTTCAAGGTG CACTCGTTTT ACAAGAACAT AGACCCCAAG
GTAATGGACG ACTTCTTGAA TGATGAATTA TTGGAAGTGA TCGACCCTAA AACCAAAGAC
GTCAAGTACC GCATCAAGTC AAGTTCTCAA GCCCAGATGG CAGGATATGT ATCTTCTGCT
TTGGTGTTGC CTCTAGGCAT GGATATTTAC AAACACATCA GAGTCCCCAT TGCCCATGTC
ATTGGTAAGA ACGCTAAATG GAACCCTCCC GAATCCACTG AATTTTTCAG AGGCAGTGTA
AATCCAGATT TTTTAGCAGC TACATACGAT ATTGAAGGAG GTGAACATTT GGTCAATGCG
GAAAAGCCAG ACGATTTACT CGAGGTTCTC AAAGATTTCA TCTTGAAGAG AAAAGTTGAG
TTCAAAAGTA CTGCTGCGCA ACTTCCAGAG CAAAAAGCAG AGGGTTTGAG ACAAAAGGTA
TTTGAATCTG AGATCCCCAA GTTGCTTAAT GGTGATTTAG GCACATTGTA CGGAATACAA
CACACCGCCT TGGCCAAAGC TTCCAAGTTG TAA
 
Protein sequence
MSFDLEKKIT TAHIPRASGS TILASDTLNV VYNKYKSTAA IPTDAIRYNL IFSHGTGMNK 
SIWHYHIKSL FEWSQKSNGK IYIDSVIAID AAGHGDSGVI NRNKLGWIFR WDEGGKDIIE
VVRNEHRTTG DFQNNFKSRN ILIGHSMGGF SSLLAAFYEP DLFDATVPIE PVVYLDSRST
RKFSQRFSII GKMIINEFDT KQAFEDFFKV HSFYKNIDPK VMDDFLNDEL LEVIDPKTKD
VKYRIKSSSQ AQMAGYVSSA LVLPLGMDIY KHIRVPIAHV IGKNAKWNPP ESTEFFRGSV
NPDFLAATYD IEGGEHLVNA EKPDDLLEVL KDFILKRKVE FKSTAAQLPE QKAEGLRQKV
FESEIPKLLN GDLGTLYGIQ HTALAKASKL