Gene PICST_55159 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_55159 
Symbol 
ID4837480 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1841296 
End bp1843065 
Gene Length1770 bp 
Protein Length540 aa 
Translation table12 
GC content42% 
IMG OID640388795 
Productpredicted protein 
Protein accessionXP_001383126 
Protein GI150864350 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.922766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GCGATCCCCA ACTACGAAAA ACCGCTTTAC GAAAGATTCG CTTCTATCTA CCACACTTTG 
TTGGATCTCA AGAACAACCG TAACAAATAC ATCAACTCCA AACAGGTGTA CCATATCTAC
GAGCTGTTTC TCGACTTGCT CCACGAATTG AAGATCACGC GTAAGGACGA AGAACTCAAG
GGCATGACAT TGAACTTGCC CAACGCTAAC GACTTGTTGA TCGATGACAT CTGGCAGTTG
GTGTCACTCT GTTTCGTCAC ATGTGGCTTG ATCAAGTTTG CCCCAGCCAC CTATTCGTCT
TTGTCTACTG TCAGCAAGTT GTTGGACCAC TTGAGAGGTT GTCAGGTTTA CACATTGGAT
GACTTGCAGC CAATCAAAAG TCGTTTGGAT GAAGTGAAGA GTATCATCGA TAACAACAAC
GATGATGACG ATGAAGACGA TGAAGGAAAT TCGCAAAAGA ACATACACAA GATCGAAGAG
AACTTGTTAT TGAGAAACAA GTTGAACAAG TGTGAAGCGT TGTACCGTGA GTTGGAGGCC
AACTTTAACA ACATTCCTCT TGATTTGGAG CAGACTTACA ACGAATTGAT TTCTATGAGA
AAGACCATAT TGAACTACTT GACCAACTAT GACGACAACG AAGTTGGCCC TGGTTCATCC
AGCTCTCGTT ACAACAAGAT TGTAGCCAAA GTCAATCAGT TCAAGTTGAA ACTAAAGGAA
ATTGAATCAA CGAGAGACCC CGTAGATGGC AAGTTCCACA GTAAAGAAAT CAGCGATTTC
GATGACAATA AATTGAACTC AGCGCAGGCT GTTCTCAATG GTTTAATTGA TGACTGCAAC
AATTTGTTAA GCGACTTGTT GATCCAAAAT GACACGGGCA ATTTGTCTGT TTTGCTTGAG
ACGTCATTAG ACTTAAAAGA CGATGAAACA ACAAGAGGAC TCAAAAACAT CAACAAACGA
TTCGGTGCCT TGTACCATCA GCTTCTTGAT CTCAAGGTAA CTTTGGAAAA CTTGTTGGTC
ACTAGAAGAT GGACCATGAG AGAGACGGAT TTGTACACGT ACCAGAAATC GCTTAAGAGC
ATCGACGACG AGAGAATAGC GTTGAACGAT CAAGTCTCAA AGCTTCCTCG TGAAGAGGCT
GCTACTTCTA CGATTCGTAC AAATATCAAT CAGCATTTGC GTAAGAACCA CATCCTCATA
TTGTATCTTT TGAGAAGATG CTATGCCTTG ATCTACAAGC TTTTGGAAAG TTCTGAGCCT
GTCAGTGAAT CGTTGCAGCC GATCCATAAC CAGCTCTCTA CTGTTAGAAA GTGTTTATTA
GAAATCAAGA GAGTAGATGG CTTGAACAAC CTCAGAGAGT TGTACCCATT CCAGTTCAAA
TTGGCATCGT TAGACAATTT GAGAAGCGAT GGTAAGTTTA TCATCAACAA CACCATTCCT
GAAGGACAAG GGACGTTGAA CGCGTTACTT GCTGAGTGTT TTGATATCAT CTACGAACTC
AAGATAGAGT TAGAGGAAAA GGAGGACAAT GAAGACATCG AAGAAGATGA TCCCGCTGCT
GTCTCGAAAG TGTCGCACAT AATGTCGGAC AACAACTTGC TGGAGATGAC AGATGACGAA
GACATTCAAT CAGACGATGA AGTTGAGTTG AAGAGAAAGA GATTCATGGG CTTCAACGAG
GCTGATTACG ACCAGGAATC CGAGTCTGCT TTTGACGATG ACGATTACAG CTTGAGCGAG
TCAGAGTTCG AGGGTAACGA CTACTACTGA
 
Protein sequence
AIPNYEKPLY ERFASIYHTL LDLKNNRNKY INSKQVYHIY ESFLDLLHEL KITRKDEELK 
GMTLNLPNAN DLLIDDIWQL VSLCFVTCGL IKFAPATYSS LSTVSKLLDH LRGCQVYTLD
DLQPIKSRLD EVKSIIDNNN DDDDEDDEGN SQKNIHKIEE NLLLRNKLNK CEALYRELEA
NFNNIPLDLE QTYNELISMR KTILNYLTNY DDNEVGPGSS SSRYNKIVAK VNQFKLKLKE
IESTRDPVDG KFHSKEISDF DDNKLNSAQA VLNGLIDDCN NLLSDLLIQN DTGNLGLKNI
NKRFGALYHQ LLDLKVTLEN LLVTRRWTMR ETDLYTYQKS LKSIDDERIA LNDQHLRKNH
ILILYLLRRC YALIYKLLES SEPVSESLQP IHNQLSTVRK CLLEIKRVDG LNNLRELYPF
QFKLASLDNL RSDGKFIINN TIPEGQGTLN ALLAECFDII YELKIELEEK EDNEDIEEDD
PAAVSKMTDD EDIQSDDEVE LKRKRFMGFN EADYDQESES AFDDDDYSLS ESEFEGNDYY