Gene PICST_85730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_85730 
Symbol 
ID4840916 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp576114 
End bp579066 
Gene Length2953 bp 
Protein Length871 aa 
Translation table12 
GC content40% 
IMG OID640392231 
Productpredicted protein 
Protein accessionXP_001386506 
Protein GI150866793 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.144196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTTTGGC AGTGTTCAGA TCGAATACAA GAGTGCCGAG GAACCTGGTT CTTTTTTTTT 
TTAGACGAAT ATCGCATTCA TGTCTGGTAA GCTTACCTTC CATATCCAAG ACACTCCAAG
CACAGACTGC TAGAAACGAA GAACCCATAG CTGAACAACT TTTAGGAGCT ACACTTGACT
GGGACCGTGG AGGACGAACA AAAAAGAATG GCTTAACTCC TCTATATTCA CTGATGAAAA
AAATCATCGA TCTGAACCAA GGATGCGTTT GTTTAATTCA AGTAGGAAGT TTCTATGAGC
TCTATTTTGA GCAAGCTCTG GACTACGGTC CCCAATTGGG ACTCAAAGTT TCCACAAGAA
AAACCAACAA CTACACCATA CCTATGGCAG GGTTTCCTAC TTATCAATTG CAGAAATTTG
TTAAGATTTT GGTGCAAGAC TTGGGTGAAA ATGTAGCCAT TATAGACCAA TTCCCCACAA
GAAAGATCTC AGAAACGATA ATACACCGCA AAATATCACG AATTGTTTCC CCAGGGACTC
TTGTAGATGA AACATTCATG AACTATAACC AGAACAACTT CTTGCTAGCA ATCTCGTTTC
CTGCTAATTG TACAAAGGTT CCAGCCGATC CTGAGACTGC GGTAGGACTC TCATGGATTG
ATGTAAGTGT TGGGGAGTTC TATGTTCAGA ACACAACTTT GGGGAATATG ATTTCTGACA
TTTCCAGGGT TAATCCCAGT GAAATCATCA TCTCCAAAGA GTTTCAGGAT ATGAACATTA
TCGACGGTAA TTGGTATCCA CCTCTTCAGG AATTGCGTCG GTTCTTCTTA CGCTACCACA
AGACGACATA CAACGATCTG AAGTTGAAGT TCAAAAGTGG GTTACAAACC ACAAGAAAGA
TGTTGGAAAG CTTTACCGTT AGAGAAGAGG CAGCAATGAA CATGATCTTG TCGTATATTG
ATGTAAATTT ACCTGAATCC AATCCTTCCT TGGATCACCC CATCACATAC TGGAATCAGA
GTTGTCTACA GATGGATGCC CGAACTCGTG AAGCACTAGA ATTGACCGAA AGATCCACCA
GTGGCAGATC TTCTGTTGTA GGATCTCTCT TGACGACTGT TAAGCGAACT ATCACACCTT
CTGGATCTAG ATTGTTAACT CAATGGATAA AGTCTCCAAT TCTCGATGTT AACGAAATTC
GCCGCAGACA AGGTTTTGTC CAGACTTTCC TTGAGAACCA CCAAGTGACA ACCTCTTTGA
GGTACCAGTT GCTGCAGCTT GGTGACTTTA TTAGGTCGTT ACAAAGACTA GCGTTCGGTG
CAGGTGATAG CGTTACCCAC TTGCTAGCAA TTGCCGATAG CATAGCAAAG TTACAGGAAA
TAGAAGTATT CTTAAGAACA GAACATTCCA ATAACAAAAA GGGATTGAAA ATTTTGGACA
AATTTTTGAA GGAGTTTGTT GTTCCTTCAG ACATTTCAGA AGAGATCATA TCAACACTCC
ATATTCAGAT AAATGACGTT AATTCAAGTT TCCTGGACAT GTCAGAGGAA GTAATAGGAG
AATTCGAAGA TTCTGAAGAT TCTGAAGAGT ATCCACATAT TGATACAGGT TCTTATAGCA
ACAAATCTAT TGACAAGTAT AGATTCGTAC CAAAACCTAA AGGGGAAAGT GTATTCAGCT
TCTCAGTGAG GCGTGATTAT AACAAATCTT TGTTAGACTT GCACAACCTG ATGGATATCT
TGAAAGATAA AGAGGATAAC ATGATATCAG CTGTTAGAGA AGAATTGGGC AAGATCGATC
CAAAACTACT TGTTTCCAAA AAAGAACAAC ATGGAAGGTA TCTGAATATT TTGCACATTT
CTGGTAAACA AAAACTGATA GAAGAGGTCT ATGTTCATCT TGGTAATGAT GTCAGAGACA
AGAAGAAGGC TTCACTCTTG TATAAGCCAA CTGAATGGAA CAACTTGCAG GTGATCATTG
AAGAAAAGAA AGAGCATATA AGAGAATTGG AACGCCAAAT CGTTGATCTG CTTAGACAGA
AAGTGCTAGA TAAAGCATCT GATATCAGAA AAGTCAGTAA GATGGTTGAT TTCTTGGATG
TGACATCTTC TTTTGCAATT TTGGCGGAAG AGAATAACTT AGTATGTCCA AAATTTGTGA
AGACTTCTCT GATTAACATT GAGAATGGCA GACATTTTGT GGTTGAATCA GGCTTGAAGT
CAGTTGGTAA GATGTTTACA CCTAATGATA CCAAGATCAC TTCCAGTGCC AACCTTTGGG
TCGTTTCAGG ACCCAATATG GGAGGTAAGA GTACGTTCTT AAGACAGAAT GCAATTATAG
TCATTCTAGC ACAAATTGGT TCTTTTGTTC CTGCTTCAAA AGCCAATCTC GGAATTGTAG
ATAAGATATT TACCAGAATA GGAGCCTCAG ACGATTTATT CAACGACTTA AGTACTTTCA
TGGTTGAGAT GGTAGAGACT AGCAATATCT TGCGCAATGC AACCTCTCAT TCGTTAGCCA
TTGTTGATGA AATTGGAAGA GGAACCAGTG GAAAAGAAGG ACTAGCGCTT GCATATGCTA
CTTTGTACAA CTTGTTGCTG GTCAACAAAT GCCGTACACT CTTTGCAACT CATTTTGGTA
AGGAATTAGA GCAATTATTG AAAGCAAATA AAGTGAGCCA AAGTAAGATA CGCTACTTCC
GTACCAGAGT AATTCAAGAT GATGACGACA AAAACCCCTC AGGACTTGGT CTTGTCATAG
ACCATACTTT GGAAAAGGGT ATCAGCGAGA GATCGTATGC ACTTGAAGTG GCCCAGATGG
CAGGATTCCC GCCAGAAGCA TTAAAAAATG CTCGAATGGC ACTTGATCTA CTAGATTAAG
ATTTAGACAG ATAGCTTAGC ATATACCATA TAGCATACAT GCCATCAGAG CATATATTTA
CAGAATCTAT AAT
 
Protein sequence
MKKIIDSNQG CVCLIQVGSF YELYFEQASD YGPQLGLKVS TRKTNNYTIP MAGFPTYQLQ 
KFVKILVQDL GENVAIIDQF PTRKISETII HRKISRIVSP GTLVDETFMN YNQNNFLLAI
SFPANCTKVP ADPETAVGLS WIDVSVGEFY VQNTTLGNMI SDISRVNPSE IIISKEFQDM
NIIDGNWYPP LQELRRFFLR YHKTTYNDSK LKFKSGLQTT RKMLESFTVR EEAAMNMILS
YIDVNLPESN PSLDHPITYW NQSCLQMDAR TREALELTER STSGRSSVVG SLLTTVKRTI
TPSGSRLLTQ WIKSPILDVN EIRRRQGFVQ TFLENHQVTT SLRYQLSQLG DFIRSLQRLA
FGAGDSVTHL LAIADSIAKL QEIEVFLRTE HSNNKKGLKI LDKFLKEFVV PSDISEEIIS
TLHIQINDEV IGEFEDSEDS EEYPHIDTGS YSNKSIDKYR FVPKPKGESV FSFSVRRDYN
KSLLDLHNSM DILKDKEDNM ISAVREELGK IDPKLLVSKK EQHGRYSNIL HISGKQKSIE
EVYVHLGNDV RDKKKASLLY KPTEWNNLQV IIEEKKEHIR ELERQIVDSL RQKVLDKASD
IRKVSKMVDF LDVTSSFAIL AEENNLVCPK FVKTSSINIE NGRHFVVESG LKSVGKMFTP
NDTKITSSAN LWVVSGPNMG GKSTFLRQNA IIVILAQIGS FVPASKANLG IVDKIFTRIG
ASDDLFNDLS TFMVEMVETS NILRNATSHS LAIVDEIGRG TSGKEGLALA YATLYNLLSV
NKCRTLFATH FGKELEQLLK ANKVSQSKIR YFRTRVIQDD DDKNPSGLGL VIDHTLEKGI
SERSYALEVA QMAGFPPEAL KNARMALDLL D