Gene PICST_55575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_55575 
Symbol 
ID4837011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1479070 
End bp1482060 
Gene Length2991 bp 
Protein Length996 aa 
Translation table12 
GC content39% 
IMG OID640388326 
ProductProtein required for cell viability 
Protein accessionXP_001382504 
Protein GI150863879 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.154341 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.828998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTCCTA AGATCGAGGA ACTTCCTCCG AAAGTGGAGA ATGTGTCCAA GTCCAAGAAA 
AAGAATCCGT TTGCTACACC AAAAAAGACA ACTGTTCGCA GGAACGCTCT GGAAGTCTAT
CCCACCCACA AAGGTTTGAA CAAACTACAA TACATAGGTG ATAAACCTAT AGACCTTCTA
TTCCATGACC TCGAAGTAAA ACTCGAAGGA AATTACCAAG ATCTCACTAT TGATGTGTTA
TACCAACGTC TTATTCAGGT GAAAGTGGAG GAAGCCGATG ATGTAGACTA TATGAAGAGA
TTCCAAGTCT TGGAATATTT ACTTGACAAA TTGATTGAAA TCCAGAATTT ATCAAACGAG
AACGATCTCA AAGACAAGAA TTTGATAAAA ATCTCACTTC ATGATATACG AACCTTCAGC
AAGGTTGTGA ATTTGATTAT TGTGCATGGT GTCTATCCTG CTATTACTGC GTTTAAAATT
GGGATACCGT TTGAAAAGAG AAGGTTGAAC CATTTCAATG TCAGCATGGG CAAGAACCCG
GTAAAAATCG ATAAAATACC TATAAATTCG AAGCTGTCGA CTCCATTTGA ACGTATACAG
AAACTATTGA TGTTGATGTA TACCAAATTG TACGTGGTCT TTCAAGTACA ATCGGACGTC
AAGGACTTGC TTAGCAAAGG TACTGGAATA TCAGACTTTC TCACTATTGC AATTACATTG
ATCACAGTTC CATATTTTTC GAAAGATGTG ATAGCGAAGG TTCTTTCTGA TTTTCCTAAC
ATCATAAAAT TAGTAGAAAC ATACGAATTG TACCAGACAT ATACACTTCT TTTATCGACG
CAGTCACCTT CATACTTTAA GCTGTTTGTG ATGCAGAAAC TTCTGACCAT CCACTACGAT
ACACCCACTG GAGTGTTAAC TCTCATTGAA TTCGTTCTTG GATTACGTGA TAATGACGAA
ATAGAAGTGG AAAAGTACGA ACATGTGTCG AATGTAGTCT TGCTGAAGCC GAAGAGCATA
TCCACGGTAG ATTACTTTAC AAACATTGGG AACCAATGCT ATAATCTTCT TGTAAATATT
AACAGACCAA TGGTTACCAG CTGTGTGGTT TTCATCTTGG AGAATCTCTG GAATCGGAAC
CAAATGGTCA CCAGAGACTT CTTCTTGAAA CGGATTTGGA ACAATTTCAG CCCACCAAAC
AGCAATTCAG ACGAGATATT GGTTACTGAA GCTCAACTCA ACAATAATGT CAATGTGTTG
ATTTCATTGA CCAAGAAGGG CTTACCGGTG GAGTTGTTGA AGGTGGTTTT TGAGCCAATT
ATTCTTTCAG TGTGGTCATA CTTGAATTTC CTTAAGAAAA ATAAAAAGTC CACTGAAATC
ATAAGTGGTA TACTTGTAAG TTACTTCACG ATGGTAAAAG ACTCAGAAAT AGAAACAAAG
GACGTTTATG GGTTGGATGC AATCGCAAAG AATTTATTGT ACGATGCTGA AGATCACGAA
TTTGCCATTG GTCCGAATGA ATTAGTACAG ATTCAGAGAA AACAGAGGAA AATTGAAAAT
TCAAGCAAGG ACCAAAAAGT GAACATGTTT ATTTCTGAAC TAGACATAAG TTGTGAAAAT
TTTGTGGCTC TTTTGGATAA TTTGGATGAC GATTTAGTTC AAGCTATATT CCTCAGTACT
CTCAAGCGGT GGTTACGTAG CGGAGACAGT TCAAATGGCA ACGAAAACCC CTTTATTGTT
CTTATTGATT TACGATTGCT TGAGTCCATT GGAAACAAAT TCAAGGATAG TTTGGCCAAG
ACTCCATTCG AAGTACTTCA GATAGTGCAA AACTTTTTAT CTCCCCAGGC AAGAGAAAAG
TTGCAAGTTG AACATGTCAA CTTGGTTTCA CAGAGCGGTG ATGTGGATTC AGATGACGAG
GATGATTTTG ATGAAAATGT CGAGGCTCAA GCACTTCCAA TAGTGTTGGA GCTTTTGTCA
GCAATTCTTT CCGAGACTGA AGTTTCTTTG GACGAGAAGT CATTCGAAAG TTTGCGCGCT
ATTCAGAAGT CGTTGGCAAG ACTCTCGGCT ACAGATGTTC CATCTAGTAT CAAAAGTGCG
TCGACTTCAC TCAACGAAAG AATCGATGAC TTATTGAACG GAGACATACC AGTTCAGAGT
GAAGAGGAAG CTGAAAAGGC TGACCTAAAG CGTGCAGTCA CCAGTCTCAA CGATCCGCTA
GTCCCAATTA GAGCTCATGG GCTCTATTTG CTTAGGCAAC TTATTGCAAA TAGGAGTAGC
GTGATATCTC TCGAGTTTGT AGTGGATTTG CATTTGGTTC AATTAAAAGA TCCTGATCCC
TTCATCTTCT TGAACGTCAT AAAAGGACTT GAGAATTTGA TTGAGTGGGA CGAGAAGCGT
ATGCTTCTGA TTTTGTGTGT TTTGTACTTG AATGAGTCCA AAGAAACCGA TCTTGACGAG
AGATTAAAAA TAGGAGAGGT TTTGTTGAGA TACATACAAG GTGCCAACGA GATGTTTTCC
GGAGAGTCTG CAAAGAGAAT TGTCAGTACG GCATTACACT TGATAAGAAG AAAAGTACCC
GAGGAAGAAA ATGAAGACAA CCGATTGAGA ATGTCTAGTA TGTCGTTGTT GGGTACTTGT
TGTAAGGTCA ATCCACTTGG TATTGTTGAC CAATTAGAGA ATGCATTGGA CTGCGCACTT
GGAATTTTAC AATTTGAAAC TGATAAGGAT AGTGCCATCA TGCGTCGTGC TGGTATAGTA
TTGATCCACG ATTTGATAAT TGGTACTTCC AACCAAAAGG AAGTACCATT TCCGGAAAGT
TATAGATTCA AAGTGGTCAA TACTTTGCGC TACGTTAAAG ATACTGACAA TGATATTTTG
GCTAGAGAGC AGGCAGAGAC GGTGTTGGAT TCTATAGAGG AATTGTCCAG TCTTGCCTTT
GAGCAGCTCG AAGAAGACAG TGAAGATCAG TTCAAGTCCA TGAGAGTATA G
 
Protein sequence
MPPKIEELPP KVENVSKSKK KNPFATPKKT TVRRNASEVY PTHKGLNKLQ YIGDKPIDLL 
FHDLEVKLEG NYQDLTIDVL YQRLIQVKVE EADDVDYMKR FQVLEYLLDK LIEIQNLSNE
NDLKDKNLIK ISLHDIRTFS KVVNLIIVHG VYPAITAFKI GIPFEKRRLN HFNVSMGKNP
VKIDKIPINS KSSTPFERIQ KLLMLMYTKL YVVFQVQSDV KDLLSKGTGI SDFLTIAITL
ITVPYFSKDV IAKVLSDFPN IIKLVETYEL YQTYTLLLST QSPSYFKSFV MQKLSTIHYD
TPTGVLTLIE FVLGLRDNDE IEVEKYEHVS NVVLSKPKSI STVDYFTNIG NQCYNLLVNI
NRPMVTSCVV FILENLWNRN QMVTRDFFLK RIWNNFSPPN SNSDEILVTE AQLNNNVNVL
ISLTKKGLPV ELLKVVFEPI ILSVWSYLNF LKKNKKSTEI ISGILVSYFT MVKDSEIETK
DVYGLDAIAK NLLYDAEDHE FAIGPNELVQ IQRKQRKIEN SSKDQKVNMF ISELDISCEN
FVALLDNLDD DLVQAIFLST LKRWLRSGDS SNGNENPFIV LIDLRLLESI GNKFKDSLAK
TPFEVLQIVQ NFLSPQAREK LQVEHVNLVS QSGDVDSDDE DDFDENVEAQ ALPIVLELLS
AILSETEVSL DEKSFESLRA IQKSLARLSA TDVPSSIKSA STSLNERIDD LLNGDIPVQS
EEEAEKADLK RAVTSLNDPL VPIRAHGLYL LRQLIANRSS VISLEFVVDL HLVQLKDPDP
FIFLNVIKGL ENLIEWDEKR MLSILCVLYL NESKETDLDE RLKIGEVLLR YIQGANEMFS
GESAKRIVST ALHLIRRKVP EEENEDNRLR MSSMSLLGTC CKVNPLGIVD QLENALDCAL
GILQFETDKD SAIMRRAGIV LIHDLIIGTS NQKEVPFPES YRFKVVNTLR YVKDTDNDIL
AREQAETVLD SIEELSSLAF EQLEEDSEDQ FKSMRV