Gene PICST_31672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31672 
Symbol 
ID4838861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1399208 
End bp1401406 
Gene Length2199 bp 
Protein Length732 aa 
Translation table12 
GC content42% 
IMG OID640390176 
Productpredicted protein 
Protein accessionXP_001384227 
Protein GI150865134 
COG category[R] General function prediction only 
COG ID[COG2940] Proteins containing SET domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.418717 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATA TAGACAACTC TGAGGTGGAT TCTCCATTCA CGGCCGATAC AACACCAGCT 
CCCAAACAAA GAACTCCGCA GCTCTTCCTT GATGTAGAGG ATAAGACGGA AGAGGCCCGG
TCCAAGTTTT CGGAGTTGGA AGCATGTACC TATTCTAGCA AGTTGATTGG TTCCTCGGGC
CAACAGGAGT ATATGACGTG TGACTGTGTG GAAGATTGGG ATTCAGAGTC TCAGCGCAAC
TTGGCCTGTA GCGATGATTC CAACTGTATA AACCGTGTAA CCTCTGTGGA ATGTATCAAT
GGCCATTGTG GTTGTGGAAA GAACTGTCAG AACCAGCGTT TCCAAAAAAG GCAATATGCC
TCAGTTTCTG TTTTTCAGAC AGAACTCAAG GGTTATGGTT TACGAGCAGA TGATGTTATT
CCCGAAGGTG GATTCATCTA TGAATATATT GGCGAAGTCA TAGACGAAGC TAGCTTTAGA
GCCAGAATGA TCGATTATGA TTCCAAGAAT TTCAAGCATT TCTACTTCAT GATGCTCAAG
AACGACTCGT TCATCGATGC CACCATCAAA GGTTCATTGG CCAGATTCTG CAACCATTCA
TGTAGTCCCA ATGCTTTCGT TGATAAATGG GTCGTTGGCG ACAAGTTGAG AATGGGGATC
TTTGCCAAAA GATCCATTCT GAAGGGTGAG GAAATTACTT TCGACTACAA CGTGGACAGG
TACGGAGCAC AATCACAGCC ATGCTACTGT GGAGAAGCCA ACTGTATCAA ATTCATGGGT
GGAAAGACTC AGACTGACGC GGCGTTGTTC TTGCCAGATG GTATAGCTGA AGCCTTGGGA
GTTACTCCCA AGCAGGAAAA GCAGTGGTTG AAAGAAAACA AGCATCTTAG AGCTAAACAG
CAAAGTGATG ATGCTGCAAT CAATGAAAAG TTCGTCAGGG AATTGATTGT AGAAGAATTA
AAGGAAAATG ATGTTTCGAA GGTGATGGGA GCTTTGATGA AATCGCAGGA TCTCAATATC
ATCAAGAGAT TAGTAGAAAG AATTCACAAA ACCAACGACG ATTTCATCAA TTCCTTGATT
ATCCGATTCC ATGGCTACAA GACATTATCG ACTATAATCA AGGAATTCAA ATCGGAGGAA
GACTTGATTG TACCGATTCT TGAGATCTTG GGCAAGTGGC CAAAAGTAAC AAGAAACAAG
ATTTCATCCT CTCAAATCGA AAGTGCAATC CAAGACATCA AGTCTTCGAC CAGTAACGAA
GAAATCCGCA CATTGTCCAC TGAGTTATTG AATGAGTGGA GTAAATTACA GATGGCCTAC
AGAATCCCCA AGAGTAAGAA CTCCTACTCT TTGCTGTATG GAAGAAACAC CAGATCGCCT
GAACCGGAAG AACCCAATGC AAGCAGTGAA CTCAAAGAAT CTAGTGAAGA AGCATTGCCT
GCTGGTTGGG AAGTGGCATA CGATCCAAAT ACTGGTAACA ACTATTACTT TCATAGAGAG
TTGAGTCTCA CCCGTTGGGA TAGACCCACA GCATCTGTGC CTACTGGACC AAAGACTCCA
CAAGGAAGAG GAGCTGGATC AAGAGGAACA TTGCCTAAAG GACCGAGAGA TAATGGATTC
CAGCAGAAGG ATATGAACAG AAGAGAAGAA GAAAGATTGA AGCAGGAAAA GGAAGAGCAG
TTCAACAAAC TCCAAGAAAA GGAGAAGCAG TTGTTATTAT TGATTCAACA GGCTGGAGAA
CAGGAAAGAC AGAAACAGTT AGAGGAAAAG ACACAAAAGG TAACAAAGAA CAAGAAGTCG
TCTAATGGTC AACACCGCCA TCACAAATCG CACGATGACT CAAAAAATAG TAGCTCCAAG
TCCATTTCAG TAGAGGATAA ATGGAGGCAT GTTTTGGCCA AACATGTTCC CAATTTGATC
AAAAAGCATG AGAAGGAAGT GGGTAGAGAA AATATAAAAG GTTGTGCTCG TGATTTGGTG
AAGATTCTTG TGGCCAAGGA GTTGAAGAAA GATGCCGAAA TCTCTCCTCC CAAAGAGTTG
GACAGTAACA AACTTAAGAA AATTAAAGAG TACTCGAAAG TGTTTATGGA CAAGTTCTTG
ATTAAGTATA GAGCAAAACA TGAAAAACAC AGAGACGATA AAAAGAGGGC CCATGAAGAC
GATGGTGAAG AAAACGATGT CAAGAGGGTA AAGGAATGA
 
Protein sequence
MSDIDNSEVD SPFTADTTPA PKQRTPQLFL DVEDKTEEAR SKFSELEACT YSSKLIGSSG 
QQEYMTCDCV EDWDSESQRN LACSDDSNCI NRVTSVECIN GHCGCGKNCQ NQRFQKRQYA
SVSVFQTELK GYGLRADDVI PEGGFIYEYI GEVIDEASFR ARMIDYDSKN FKHFYFMMLK
NDSFIDATIK GSLARFCNHS CSPNAFVDKW VVGDKLRMGI FAKRSISKGE EITFDYNVDR
YGAQSQPCYC GEANCIKFMG GKTQTDAALF LPDGIAEALG VTPKQEKQWL KENKHLRAKQ
QSDDAAINEK FVRELIVEEL KENDVSKVMG ALMKSQDLNI IKRLVERIHK TNDDFINSLI
IRFHGYKTLS TIIKEFKSEE DLIVPILEIL GKWPKVTRNK ISSSQIESAI QDIKSSTSNE
EIRTLSTELL NEWSKLQMAY RIPKSKNSYS LSYGRNTRSP EPEEPNASSE LKESSEEALP
AGWEVAYDPN TGNNYYFHRE LSLTRWDRPT ASVPTGPKTP QGRGAGSRGT LPKGPRDNGF
QQKDMNRREE ERLKQEKEEQ FNKLQEKEKQ LLLLIQQAGE QERQKQLEEK TQKVTKNKKS
SNGQHRHHKS HDDSKNSSSK SISVEDKWRH VLAKHVPNLI KKHEKEVGRE NIKGCARDLV
KILVAKELKK DAEISPPKEL DSNKLKKIKE YSKVFMDKFL IKYRAKHEKH RDDKKRAHED
DGEENDVKRV KE