Gene PICST_88189 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88189 
Symbol 
ID4838311 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp371456 
End bp375286 
Gene Length3831 bp 
Protein Length1256 aa 
Translation table12 
GC content40% 
IMG OID640389626 
Productpredicted protein 
Protein accessionXP_001383348 
Protein GI150864510 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.30399 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0222067 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CATGAAATGA GTGTCTCGGA CATAGAAACT GAGCGGACCC GTTATTATGA CGCCTACGAC 
GCTGACGACG CGGAGTTGGA TGGTTTTATC GTTTCCGACG AAGAAAACGA CAATCTCGAA
AATGAGAATC GCAATGAAAG AGATCCAGAT TCACGGAACC AAGACGATGT AGACGAATAC
GGCCAGCCCC AAATAGACAG GATACGAACC ATGACAGAAG CATTGGCTCC AAATGAATCT
CAGGCCCTGA AAGTGTTGAG AGCTCATATT TCCGTTTTGG TGTCTGCTTT GGGTGGTCCA
GATCACACTT CCAATATCTC CCCACCTCCA TATAAACTCG GCCACGATGC TTTGGCATGT
TTGAAAGATA TCAAAAGATG GATTAAATCC GTAGACGACA GACAGGACAG TTATCAGGTA
GCTTTGGCAT GTGCCGAATC AGGATTGGTA ATTAACGACT TGATTGTAAT TCTCTGTCAA
TGGGATTCAA AGAATAAGAA AAAAGAAGAT TTCAGAAACA AACGAACTAT GGAAAAAATT
ATGTTGGCAT GTTTGGAATT GCTCGTTTTA TTGACTTGGT CTGTGGAGTT GCGCCCGGAG
CTGTCAGACA AGCAGAAGTT GCTCTATTAC GATGTGAAGA AAGCACAGAT CAAATACAAG
AGAGCTATTT TAACCTACAA CAAGGGCCAG ACTCTCAAGG CAGTAATTCG CTTGGTTCTT
CCCATAATTT CAAAGGAAAG AGTTGACCGT GAACCAAAAG ACAATGCTAT CATGAGATTA
GTTCTTTTCT TCTTCAGAAA CATTTTGTAC GTAGAGCCCC CCAGTCCCAG TATTTCCAAA
AAGTCATCCA AGACAATTGT TGTTACCGAT AATATGCCTG AAGGGGTCTC ATACGATGAC
ATTTCGTTGT CCGCCACAAT TTCTGCATTC AGCAAGAACA GAGTATTGAT GCTCTTTCTC
ACTCTATCCA GTGGAATAGG TGTAGATTTC GATAGTCGGT TTCTCGGCCC TACTCTTTTG
GAATGTATCC ATTTGTTAGT CAGAGGTGTC GATCCAAATG ATATCTTGAA ACTAAAGCAA
ATGCGCATTC CAACAGAGAT CAACGACCCC GGCAACTCCT CACTGCCATT CCACAATGTA
CCACCAGCTT CATCTACAAC AGGCTTGCAA TTGCAAGATC TTCTAGCAAA AGAATCCAAG
ATCAAAAATA GTCAAACGCA GAGTATGTCC ACAAGACATG GCCGCTTCGG ATCGTTGCTT
TCCATCAGAG GTGATCATTC AATGTCTTTC GTAGTTTCTG GACAAGAAGC ATTGATCAAT
GCTGGACAGA CAATGCAAAA ACTTGATCGT TCTAAGAAAT GGAAGAATCG TTCATATTTC
AAATACGATT CGGACGAGTA CACAAAGTCA TCGAACACTA CCTATATGAA CTATGGGGGC
CTCGTCATCC TTCATGAGTT TATCGAACTG TTCTTAGCTG GCGGTTGCTT TAACATTTTA
ATAGAAAAAT TATCGTCTGT TTTCTCCAGT TCAGACTCCA TTCTTGAAAA AGAATACGAA
ACTGCTACAT TTTTCTTGAC GATTGCATGG TTCTTCCAGT ATAAGCGAGA AAAGACCGTT
CTTTACTCTT CTGGGACAAC TAACTTGCAA CCAATAGGTG AAGAAGATGA CAGGAACGAT
TTTGGCTCTG TTGGTGCAGC TCTCAGTCAA GTGAATTTTA TCTTGCTTGT AAAGTACTGT
GTTGATTCTT TCAGTATTTC CCCTAAGCGA TGGAGTTCAC TTCATGTAGT GTTGATCTGT
TTGAAAGAAC TTTTGGAGAT TTCTAATACG CTCTTTACTA GATCTTCGAG TTCGGCAGGG
GACGAGGAGC AGAACGAGTT GGATCGAGAG TTGGCAGAGG GAATTATAAG TCACTTACTC
GTAACACAGG ATTTCCTTTC TATTCTTTAT CATTTACCTC AGACTGCTTC TAGGCATTCT
CCTGAGTACC TAAAGGTTTG TATTTCTGTG GTTCACATCT TGCTCAAGAC ATTAAAGAAT
TTTGCCGAAG AGGATGTTAA GTTATTTATA CAAACAACTA GAAGGAACTC GAAGAAAAAA
GCAAATAAAG AATCAAATGA TCAATCCGAC GCTGTTGAGC AAGTAGATGA ATCGGATACT
GAAGATAAAC GTACTCATGC CCGAGTCACC AGAGAAAGAA AAATCAACTA CGAGCGCACA
GAGGTTAAAT TCTTCCACCA AGACACTGTT TCAACCTACA TTGAGTATTT ATCGCGATAC
GAAGATCTTA CCCACTATGA AATCAAGAAA TGTCTTACTT ATTTCCACCG ATTATTTGTG
GTAAGAAAGG ATTTCAATGG ATTGTACAGA CTAGATTTCA TGCAAGTGCT ACACAAGCTT
CGGGATTACT TACCATCTCA AAGCAATATT CGAGGGCAGG TAGATGAATT CATATACTAT
TTCATGAAGA AATTCAAGGC TAGCATCGAG AGATTCCCCA ACCCTATTGA AATTTTGTTT
CCAAGATTTG AAGATGCAGA ATCTAAAACT TATTTGGCAA CAGGAGAACT ATACATTCAA
ACAGAAAGAG AAATTAGAAG TGCTAATGTC AAAAGATTTG AACCTGGCAA ACCATTGGAG
TTTGTTAGAC ATTTTGAGGA CAACGAAAAG TACAAGATTT TGGTAAGTGC CCTATATGAA
CAAGGACTTC TGAATATGTT GGTTTGTCTA GTGGAAGACT TGCAACGTAT TCATGGAATT
AAGCAATTGG ATGAGGATGT TGATGAAGTG CTCCAACTTA AAGAAGACTT CAGGCAATAT
GTTCTTACGA ATTCTCATTT GCGGTTGTTG TTGAGAACTG TTGGTTTAAT TCCTGGTTAT
TCCTTGAGTC AGGAATGTCA AGTTCCGAGT AAGTTGACAA GTTCAGATAT TGACAGTGCA
ATAACGTTGA TAAAGAAATG GATGGGCTTA CAACCTGCTA CCTTCGAGGA TGGCAAAGAC
CCATCGTATT TCTTGAGAAG CACAGGGGAC ACAGTTGATA GACCAATTGA TTACCTGGAC
AGCGAAGACG ATATTGCATT CGAGATCACA CCAAAAGATA GACCAAACGA GAGCCACTAT
GATATGCTTA CTGAATTGGA TGAATTGGAA CGTGCTATCA GTGGAGCTGA AAATCGTGAG
AAAGGTAAAG CTAGAAAAAG AGGAAATAAG TCCAGCTCCA CTATCAAGAA AAAGCTGTTG
CAATCACGTA GTCGTCGTCC TCCAAGGTTT AATGTGGATT CCGATGATGA AAGTGAATTG
AGAAAAGAAG TGAAGTCTTC AGAATTCGTT CATGATTCTG ACGACGAATC GGACGATGAA
GCTTTCTTTG AAAGAGAGGA GCGTTTGAGG CAAATGTTGA ATTCATCTGG TGGTATCGTT
AATGCACAGC AGTTGAGTGA ATTCAAGAAG GTATGGGCTA GTTTAGAGTC TACAGGTGGA
ATTGCTACAG CTAGTCAAGT TGTTCGTGCA GTTGAATCTG TAGCTAACAT TTCAGATGGT
GTTGATACTC ATAGTCACCA AGACGACAGT CAACTTGGAC GTACTGGACT TACAGTTACT
CAGCCAGAAT CTCAAATTTC AGAATCTCAA CCTTCTGATT CCGAGAACGA AGACTCTAGT
GAAGAAGTAT CTGAGGTCCA AATCCGAAAG AGATTAAGAA TTGAAGATGA TGAAAATGAC
AGTGACAATA ATGTGTCTGA GAATTACGTC AGTGCTGAAG AGGGAGAAGA AGCGACACCA
ACCGTTAAAC GTAAGAAGAG GCTAGTTATT AGTGATGATG AAGATGAGTA A
 
Protein sequence
MSVSDIETER TRYYDAYDAD DAELDGFIVS DEENDNLENE NRNERDPDSR NQDDVDEYGQ 
PQIDRIRTMT EALAPNESQA SKVLRAHISV LVSALGGPDH TSNISPPPYK LGHDALACLK
DIKRWIKSVD DRQDSYQVAL ACAESGLVIN DLIVILCQWD SKNKKKEDFR NKRTMEKIML
ACLELLVLLT WSVELRPESS DKQKLLYYDV KKAQIKYKRA ILTYNKGQTL KAVIRLVLPI
ISKERVDREP KDNAIMRLVL FFFRNILYVE PPSPSISKKS SKTIVVTDNM PEGVSYDDIS
LSATISAFSK NRVLMLFLTL SSGIGVDFDS RFLGPTLLEC IHLLVRGVDP NDILKLKQMR
IPTEINDPGN SSSPFHNVPP ASSTTGLQLQ DLLAKESKIK NSQTQSMSTR HGRFGSLLSI
RGDHSMSFVV SGQEALINAG QTMQKLDRSK KWKNRSYFKY DSDEYTKSSN TTYMNYGGLV
ILHEFIESFL AGGCFNILIE KLSSVFSSSD SILEKEYETA TFFLTIAWFF QYKREKTVLY
SSGTTNLQPI GEEDDRNDFG SVGAALSQVN FILLVKYCVD SFSISPKRWS SLHVVLICLK
ELLEISNTLF TRSSSSAGDE EQNELDRELA EGIISHLLVT QDFLSILYHL PQTASRHSPE
YLKVCISVVH ILLKTLKNFA EEDVKLFIQT TRRNSKKKAN KESNDQSDAV EQVDESDTED
KRTHARVTRE RKINYERTEV KFFHQDTVST YIEYLSRYED LTHYEIKKCL TYFHRLFVVR
KDFNGLYRLD FMQVLHKLRD YLPSQSNIRG QVDEFIYYFM KKFKASIERF PNPIEILFPR
FEDAESKTYL ATGELYIQTE REIRSANVKR FEPGKPLEFV RHFEDNEKYK ILVSALYEQG
LSNMLVCLVE DLQRIHGIKQ LDEDVDEVLQ LKEDFRQYVL TNSHLRLLLR TVGLIPGYSL
SQECQVPSKL TSSDIDSAIT LIKKWMGLQP ATFEDGKDPS YFLRSTGDTV DRPIDYSDSE
DDIAFEITPK DRPNESHYDM LTELDELERA ISGAENREKG KARKRGNKSS STIKKKSLQS
RSRRPPRFNV DSDDESELRK EVKSSEFVHD SDDESDDEAF FEREERLRQM LNSSGGIVNA
QQLSEFKKVW ASLESTGGIA TASQVVRAVE SVANISDGVD THSHQDDKSQ PSDSENEDSS
EEVSEVQIRK RLRIEDDEND SDNNVSENYV SAEEGEEATP TVKRKKRLVI SDDEDE