Gene PICST_33291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33291 
Symbol 
ID4840462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp228544 
End bp229632 
Gene Length1089 bp 
Protein Length362 aa 
Translation table12 
GC content39% 
IMG OID640391777 
Productpredicted protein 
Protein accessionXP_001386061 
Protein GI150866452 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACTCTC CTATACTTCG GTATGATTCC GAATCTACTT TAGAAAGTCC CTTCCATCCT 
CCGCGCAACC AATCAAAGAG ACACCACTAC AGTGAAGAAT CAGACAACGA TATGTCCCCC
TTGAAAAGAA GACAGATTCT ACATGAACCT TCTGAGCTTG CGAACGAACG AAAATTCGAA
ACGAAGTACA ACTTTAATGA TTGGATCACT AGACATCCAA TAGAAGCTTC AACACCCATG
AGACCTACGA AGGGAAAGTT TCCAACGGAC TTGTACAAAG AAGAATCCAC TCCATTTTCT
TATAGGGAAG AAGCTTCAAC TTCTGAAAAA AGTCTAGGTC TATCTCTATC CGGAAAACTC
TACAGGGACT TCAAAACGCC TCCTACTACT GCCTTCGACA TTTTTGAAAG GACCCAAGTT
ACAGAAGAGA GCATTATCTC GCCTGAACTT GAAGCAGTTA GAAGAAGCTT CTTTGCGGAT
TGTTCACTCG GAATTGAAAA TGATCATATA ACCCAACAAT CGAAGGAAGA AAAAACTGGA
CAATCAAATA CTGAAGGCTA TGTAATGGAT CTTATAGGGT CTTCGCTTCT TCCTCAAAAT
CTAAGACCTT ATCGATACTA TATCAATAAA GATGTGTCAC CCATTTACAA GACAACACGA
AGACTAGAAG CTGAAGATAA GAAAAATAGC TTAATCGATT CAGAAGCAGA GCTAGTGCCA
ATAACCACTT CTTTCTTTTT AGAGAAAGAA CCTGGGACTT TTCCTATTCA ACCAATTTTC
CCAAATTGGT TGGTCTCAAA ATTGGCAACT ACCGAAAAAT CAAACTTCAA AAACACAATT
CCTTCCACTA AGACTGAAAG AGAGTACTTT CCAGAAGGTG AAGACAATCA GACTGTGCTC
TCTTGTTCGG ATATCCCAAT TCTGAGTGGT ACTAAAAGAG CATGCAAGCG TTCAACTTCT
CAAAGGTTTT TTGATTGGAT TGAAAAGTTG AAGAGAGAAC ACGAAGAAAG GAGACTCAAA
CGTAAAGCCG TAGGCAAAGG TGTGATTCAA CTTACACGTC AACCACAGTA TTCTTATAAA
GACAAATAG
 
Protein sequence
MYSPILRYDS ESTLESPFHP PRNQSKRHHY SEESDNDMSP LKRRQILHEP SELANERKFE 
TKYNFNDWIT RHPIEASTPM RPTKGKFPTD LYKEESTPFS YREEASTSEK SLGLSLSGKL
YRDFKTPPTT AFDIFERTQV TEESIISPEL EAVRRSFFAD CSLGIENDHI TQQSKEEKTG
QSNTEGYVMD LIGSSLLPQN LRPYRYYINK DVSPIYKTTR RLEAEDKKNS LIDSEAELVP
ITTSFFLEKE PGTFPIQPIF PNWLVSKLAT TEKSNFKNTI PSTKTEREYF PEGEDNQTVL
SCSDIPISSG TKRACKRSTS QRFFDWIEKL KREHEERRLK RKAVGKGVIQ LTRQPQYSYK
DK