Gene PICST_73391 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_73391 
Symbol 
ID4839799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1256577 
End bp1258780 
Gene Length2204 bp 
Protein Length712 aa 
Translation table12 
GC content45% 
IMG OID640391114 
Productpredicted protein 
Protein accessionXP_001385934 
Protein GI126138822 
COG category[R] General function prediction only 
COG ID[COG5354] Uncharacterized protein, contains Trp-Asp (WD) repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGCAATATTC CAGAATGTCC TTGACAGAAG CTGAGTATCA CGAGCTTGAG AAGCAAGTTA 
ACTTGGACGA CATTGACTTT TCCGACTTGG AAGAGCAATA TGAAGTCGAT GTCGGCTTAG
ACAATTATGT TGTAGTAGAC GGTGCCCCAA TTGCACCAGA GGCCAAGGTC CCTGTATTGA
TCAAGGTCTT AAAGAAGTTG TTCAACACTG TAGGAAAGGT TGTTGAAGGC GACGAAGGAA
TTTACATGCC TTTGGAAGAC GGAAAATCGA AGGGTTACTT GTTTGTTCAG TTTGAAACTT
CGGAAATGGC CGAAGCTGCA ATCAAGCAGT TGCATGGCAA AAAGTTAGAT CAGAAGCACA
GATTGCTTGT CAACAGATTG TCTGATATCG AGAAATACGG TGTAGAAGGC AATGTTGCTG
CCGAATTTGT AGAACCAGAG TTACCTCCAT TCAAGAGCCA CGGCTACTTG AAGTCATGGC
TTCAGGATCC TCAGGGCAGA GACCAAATAG CCTTGCACCA TTCTGAGACA TTTGGTGTTT
TCTGGAACAA GAAGAAGAGC GACCCCGAAC CTGTCTTTGA GCCAAGAAAG TTCTTCACCT
CGAAATACGC CAAGTTCTCT CCAAAGGGTA CCTACTTGTT CTCTATCCAT CCCCAGGGTG
TTCAGTCTTG GGGGGGAGCT GATTTCTCCA GCATTGACAA ATTCATGCAC AACCAAGTCC
GTTTGGTTGA CTTTTCCCCC AACGAAAAGT ATATGGTGAC TTTGTCGCCA CTTCCGATCA
CTGCACCTGA TTCTGCTGCT GAAAGAGCTG TTTTCCCCTT TGGACCAGAG TCTTATGGCC
ACAAGTTGGT CATCTGGGAC TTGACCACTG GTGAACCAGC TAGAACTTTT GCCTTGCCTC
CCCACTTAGA AGGACAGAAG GAAATGCCAT GGCCATTGGT CAAATGGTCC CATGACGATA
AGTACTGTGC TCGTCAAGGT CCTGGTGCTT TAGCTGTTTA CGAAACCCCA TCGTTCCAGT
TGTTGGACAA GAAGTTGATC AAGATTGACG ACATTGTCGA CTTTGAGTGG GCACCTGCTG
GCGTCCATCT TGCCAATAAC AAGTCCGAGA ACGGCCACCA CTTGTTGTCG TACTGGACTC
CTGAATCCAG TAACCAAACT GCTAGAGTCG CTGTGATGCA AATTCCAACG AGACAGATCC
TCAGAACCGT CAACTTGTTC CAGGTTAGCG ACTGTAAGAT GCACTGGCAA AGCGAAGGTA
AACTCTTGTG TGTAAAGGTA GACCGTCATA CCAAGTCTGG TAAGACCATC TTTACCAACT
TAGAGTTCTT CAAGACCACT GAAAAGGACA TCCCTGTAGA GAAATTGGAG TTGAAGGAAA
TCGTTATCAA CTTCGCCTGG GAACCAAAGT CCGAGAGATT TGTTATCATC TCCAGATTAG
ACGATGGCAA CCTCAACTCC GCCATCCCTA AGAATATCAT TGACTTCTAC GCTCCAGATG
TGAATGGTAA GGGTAAGTCT GCCACATCTG TATACAAGTC GTACAAGACC ATTACTGACA
AGCACTCTAA CACTGTCTTC TGGTCACCAA AGGGTAGATA TGTTGTTGTA GCCACCATCT
CCAGAAGTAA CGGTGAGATC GAGTTCTTTG ATGTTTCGTT TGATGACTCT AACAAGAACG
CTCCAGCTAA CGTCAAGTTG TTGAAGAATG ACAAGTTCTC TGGTATGACG AACATCAGTT
GGGACCCATC TGGTAGATTT GTAGCAACTT GGTCGTCTTC GTGGTTACAT ACTATCGAAA
ACGGTTACAA GTTATACGAG TTCACAGGTA ACTTGTTGAG AGATGACTCC ATCGATCAAT
TCAAGGAATT TATCTGGAGA CCAAGACCAG CCTCTTTGTT GAACTCTGCC GACAGAAAGA
AGGTCAGAGC CAACTTGCGT GAGTACAGTG CTCAATTCGA AGAGTCTGAC GCCATGGAAG
CTGATGCTGC TCTTAGAGAA TTGATCTATG CCAGAAGAAG AGCCTTGGAA GACTGGAAAG
CCTACAGAGC TAAACACGCC AGCAAGGCTG TCAAGGCTAA TGAGGTTCAA GCTGAAATCA
TTGAAGAGAT CAAGGAGGAG ATCATTGAAG AGAAGGAAGA GATTGTTGAG TAATCTCACT
AATTGTCTTC ATACTATACA ATCATACATT ACTAATCTTA GAAA
 
Protein sequence
MSLTEAEYHE LEKQVNLDDI DFSDLEEQYE VDVGLDNYVV VDGAPIAPEA KVPVLIKVLK 
KLFNTVGKVV EGDEGIYMPL EDGKSKGYLF VQFETSEMAE AAIKQLHGKK LDQKHRLLVN
RLSDIEKYGV EGNVAAEFVE PELPPFKSHG YLKSWLQDPQ GRDQIALHHS ETFGVFWNKK
KSDPEPVFEP RKFFTSKYAK FSPKGTYLFS IHPQGVQSWG GADFSSIDKF MHNQVRLVDF
SPNEKYMVTL SPLPITAPDS AAERAVFPFG PESYGHKLVI WDLTTGEPAR TFALPPHLEG
QKEMPWPLVK WSHDDKYCAR QGPGALAVYE TPSFQLLDKK LIKIDDIVDF EWAPAGVHLA
NNKSENGHHL LSYWTPESSN QTARVAVMQI PTRQILRTVN LFQVSDCKMH WQSEGKLLCV
KVDRHTKSGK TIFTNLEFFK TTEKDIPVEK LELKEIVINF AWEPKSERFV IISRLDDGNL
NSAIPKNIID FYAPDVNGKG KSATSVYKSY KTITDKHSNT VFWSPKGRYV VVATISRSNG
EIEFFDVSFD DSNKNAPANV KLLKNDKFSG MTNISWDPSG RFVATWSSSW LHTIENGYKL
YEFTGNLLRD DSIDQFKEFI WRPRPASLLN SADRKKVRAN LREYSAQFEE SDAMEADAAL
RELIYARRRA LEDWKAYRAK HASKAVKANE VQAEIIEEIK EEIIEEKEEI VE