Gene PICST_33460 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33460 
Symbol 
ID4840618 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp698987 
End bp701440 
Gene Length2454 bp 
Protein Length817 aa 
Translation table12 
GC content42% 
IMG OID640391933 
Productpredicted protein 
Protein accessionXP_001386147 
Protein GI150866515 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.242264 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATTT GCATAGCTAG GACTTTGGCT CGAAGTCCCG TGCGACGTTT AGGGCGACCG 
GTTCGACTAA ATTTGCGAAT AGCATATTCT TCCAAAACCA AACCGCATCC AACTACTTCT
AAGGAAGAAG AACCTACTTT CTTTGAAGAA AACTCCGAAA ATAACAAGAA TAGTATTCCA
TTCAAAGACA AGTTGTCAAG CTTTATAGTT AACTTCATTA AGCAAGAGTC GCAGGCATTG
GCATCTTCAT TTGACAACGA TCTTCTCTAT GGAAATGCCA TAGATGTCAA TACAGACTAC
AGTATCATTC TCGAGCCACA ACTTCAGAGC GAAATAGACA ATGCCTTGGA TAACTACAAC
AAGCTCTTTA TACTATTGTC ACATAACCCG ATATCCATTC CTGCTCAGGC ATATTTGCAT
TTTTTGGAGA AAATTGACAC TCCCCTAGAT CTGAAACTTA GGTCTCTTTT GTTGAAGAGA
CTTCTATACC ACCAGCAGTA TGAAACGTGC TGGCGGATTT GCATAGACAC ATATACTTCT
TTGACTGATA TTGAAGATTT CATAGACTTG GCAGCTGTAA GTTTAAGGGA GAACAACAAT
TCCACCTTTG GGCTAAATCT GCTTCTTGTT GCTTCCCATT CTCAGGTGTT CAACCAGAGA
CTTCATAACC ATATACTTGA CACCCTAAGT TTCAAGTTCA AGATTCCTCG TCCAGATCTT
GAACATAAAT TGAGTTTCTA TGACGAGCTC CAGACGATGC AGACCCTAGA AGATTTGGCG
TTATTTAGGG AGAATAATGG GCTGTATATG TCAAATGATG TCGACTACGA AGTCATGTAT
TTACGAAAAC ACATTCTGCT CATCCGGAGC GATACAAGCA TAAAAGTGAG CGACTGCTAC
GGTTTGCTTC ACGAAAGAGA AAACTTGGTT AGAATGCCAG GTTGGCTCAG TTTCATTTCA
CCATCATTCT TGGGATCCAC GACATTGAAT AATGCTGTAG GATCTTTAGT CTCAACCCCA
TCCATATCTA CGAAAATCGT AGAAAGCATC AATCACATAT TGAGGACAAA CAAACATATT
GGACTCAACG AAGCTGATGT CGTGTACATA CTTAATTCAC AGGCCAATAT TAAGGCCTAC
AACATCTACC TGTTATATAT TGCATCCAAT CCCCTGTTGC CCAACAAGCG GATTGTCAAC
CTTCTCATGG CACAAGTTAT TTTACAACTA CATTACGTTC AGATTCGGTC GATACTCTTT
CGTCACTACC AAGTCCTAGA CGACGATGTC TTGGTTGAGG CATTGGTAAG AGTATTGGGA
GAGTCTACTA AAGACTTTGA AGAGATTATA GGTAAACTCT TCAAGAACTC TACTTTTTCA
CAATCACTAA CAATTTCTAC TACTATTGTC GACTTAGCTG TTAACTCTGG CTACTCGGTG
TCACAGATTG AGCGAATGTT GCTCATTTTC CACGGGTTCG ACAAGTCTGG CAAGTTGTTA
GGAAACTTAC TCAAGTCTGA GACACTCCTG TCATATTCTG AACAACAACA TATAGAACTA
TACTCCAAAT TAATCATGGC ACCAGAGATT AGCTCGACCA AGACGCTACT TGAATTGAAC
CGTTGTATAC TTCGCCGTGG TTTGATTGAG GAGACGTTGA TAAGTGGCAT TCTTGAACGA
GTACTCAATA GCACCCTACG AAAAGATTTC ATCTTGGCAA GAGCTCGCGA TCTGAAAAGA
AAACTTCCCC AGGGTTTCCA ACGTATTCAC ATGTTAGCCA ATGTGAGTGA GCGAGCCAAT
TTCCACAACT CACTCAGAGC CTTGGGCCAG ACCTATTCTC TCTTAGGTGC TAAAGACATG
GCACGAGTTG TGGACATCAC CAGCAACTAT ATCTTCTCGC GCCACTTCAC GTTTTGTAGA
GACAAGTTTG GGCGTGATTA CTTGATTAAT AATGTCGTAT CTGAGATGAT GCGATTTGTA
GAACGAGAAT CACGTACAAA GCCGAAAGAA ACGATTTTCA AAGTCAGAGA CTTGTTAACA
GAGTTGAAGA GCGACTCCAA GGTGATCCGG TGCCATTTGT TCAGAATGAT GGTAAGAGAG
GATCCTTCAA AGGCGATACA GTTGCTTCAG TTCTACAGTG ACAACAAGTC CAGCTTGGCT
GGGATCATAC CGTATATGAT TTCGGGAATT CTTTCTACAG AGAAGTTGGA GAAGAATCGC
AAACTTCAGG TTCTAGATCG ATTCCTTTCT GAGCTTGTGG TTTTGGGATA CAGGCATAGA
ATTACGCAGA AGACGGGCCA CGAGTTGGTG AGACTCTTGA AACAGGATAG TGTTTCAGGC
AAAGCGCTTA CGCCGCAGCT GGTGAGCTGG ATTCTTGAGT TTTCACGTAA CAATAAGGCT
CTTAACAGAG TGCTACAAGT ACATTTCCGC AGAGACAAAA AGAACACGTT GTAA
 
Protein sequence
MFICIARTLA RSPVRRLGRP VRLNLRIAYS SKTKPHPTTS KEEEPTFFEE NSENNKNSIP 
FKDKLSSFIV NFIKQESQAL ASSFDNDLLY GNAIDVNTDY SIILEPQLQS EIDNALDNYN
KLFILLSHNP ISIPAQAYLH FLEKIDTPLD SKLRSLLLKR LLYHQQYETC WRICIDTYTS
LTDIEDFIDL AAVSLRENNN STFGLNSLLV ASHSQVFNQR LHNHILDTLS FKFKIPRPDL
EHKLSFYDEL QTMQTLEDLA LFRENNGSYM SNDVDYEVMY LRKHISLIRS DTSIKVSDCY
GLLHERENLV RMPGWLSFIS PSFLGSTTLN NAVGSLVSTP SISTKIVESI NHILRTNKHI
GLNEADVVYI LNSQANIKAY NIYSLYIASN PSLPNKRIVN LLMAQVILQL HYVQIRSILF
RHYQVLDDDV LVEALVRVLG ESTKDFEEII GKLFKNSTFS QSLTISTTIV DLAVNSGYSV
SQIERMLLIF HGFDKSGKLL GNLLKSETLS SYSEQQHIEL YSKLIMAPEI SSTKTLLELN
RCILRRGLIE ETLISGILER VLNSTLRKDF ILARARDSKR KLPQGFQRIH MLANVSERAN
FHNSLRALGQ TYSLLGAKDM ARVVDITSNY IFSRHFTFCR DKFGRDYLIN NVVSEMMRFV
ERESRTKPKE TIFKVRDLLT ELKSDSKVIR CHLFRMMVRE DPSKAIQLLQ FYSDNKSSLA
GIIPYMISGI LSTEKLEKNR KLQVLDRFLS ELVVLGYRHR ITQKTGHELV RLLKQDSVSG
KALTPQSVSW ILEFSRNNKA LNRVLQVHFR RDKKNTL