Gene PICST_89086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89086 
Symbol 
ID4838401 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp215562 
End bp218096 
Gene Length2535 bp 
Protein Length736 aa 
Translation table12 
GC content43% 
IMG OID640389716 
Productpredicted protein 
Protein accessionXP_001384344 
Protein GI150865217 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTATTC AACGTCTCTC AGAGTCTGTG ATCAATAGGA TCGCAGCTGG GGAAATCATC 
ATCCAACCAG TCAACGCTTT GAAGGAAATG CTAGAAAACT CCATTGACGC CGGAGCTTCT
TCTATAGATA TTGTTGTCAA AGACGGCGGC ACCAAATTGC TCCAAATCGC TGACAACGGC
CATGGTATTG CCAAAGAAGA CCTACCGTTG CTCTGCGAAC GTTTCGCAAC TTCGAAGTTG
TCCAGGTTTG AGGACCTAGA ATCGATCCAG ACGTACGGAT TTCGTGGGGA GGCACTTGCT
TCCATTTCCC ATATTGCTCG GTTATCTGTT GTGACGAAAA CAGCCACTTC TGCTGTCGCA
TACAAAGCGT TTTATGCGAA TGGCAAACTT TCAGGACAGA ACTTCAAGTC TTCTGCCAAC
ACAGAGCCCA AGCCCGTTGC CGGTAAGGTC GGAACCCAAA TCACAGTAGA AGACCTTTTC
TATAATCTCC CACAACGTCT TAAGGGTCTC AAGTCGAAAA GTGACGAATT TTCACGCATT
CTAGACGTAA TTGGCAGATA CGCTATCCAT TGCAAGGATG TAGGTTTTAG TTGTAAGAAA
CACGGAGAAC CTTACCAAAT TCTTCTGACG CGTGCCCAGT TACCTATCAA AGAAAGAATA
CGAACGATAT TTGGCAATTC AATCGCTACT GACATCTTGG AAGTAGATTT GGATACGAAT
ATTGAAAAGG AATATGGCAC AGACAATCTG AAGTACGGCT TGATTTCAGT CACGGGTGCA
ATTACAAACT CCAACTACAA CAATAAGAAA CGGATTCCAC CAGTTTTCTT CATCAATAAT
AGACTTGTGG CGTGTGAACC GTTGAAACGA GCAGTTTCTG GAGTATATCA ATTCTTCTTA
CCCAAAGGCT CGTACCCGTT TATCTACTTG AGTTTGCAAA TTGATGCCCA GAATGTCGAT
GTAAATATTC ACCCTACGAA GCGAGAGGTG AGGTTTTTAC ATGAAGAGGA GATCATAGAG
TTGATTGTAG ATAAGGTCCA TTTGATACTT TCTAGCGTAG ACACCTCACG AAAGTTCAAG
ACCCAGACCA TTCTTTCTAA TACCGGCACA GCCAAAAGAC CCATTGATGA GTTTTCGGCT
CTATCAACAC AATCACAGAA GAAATATCGA CAGGAGAATA AACTCGTGCG AGTGGATAGG
CAGCAAACCA AGCTATCGGC GTTCATTGCA GGGCAGTCGG AGACAAGTTA CAAAGAGAGT
ATTCTCAAAG AAACAAAGAG AAAAGAAGAC AAATCCAACG AGCAGATAGT GGAAGAATTG
GAAGAACTGG ATAAAGAAGT TGATGAAGCA GAAGTTCTTG CAGAAGAAGA AGAAGAAGAA
GAAGAAGGCG AAGGCGAAGA GGAGAAAGAA AATGAAGATG ACGAAGACGG CGACAAAGAT
TTACAACACA ACCATGACGG CGAATTCGTC GTAGATACCG TTACAACAGG AACCATAGAT
GAACACAGTA AAACCAAGGA TACAGAGACC ACAAATACGT CAGACATCGA CACAAAGGTC
ACCACCAACT CCAGAAGAAG AGTGAGGGTC TCTTTGGATA GTATTATTGA ATTGAGGAAA
CAAGTTAATG AGGAAGTCCA CAGACCACTC ACCGACATAT TGAATAATGC CGTTTATGTT
GGCATAGTAG ACGAAGAGAA ACGGTTATGT TGTTTCCAAT ACGACGTCAA ATTGTATCTC
TGCGACTACG CTTCACTTCT TCATGAGTTC TACTATCAAG TGGCACTTTA TGAATTCTGC
AACTACGGAG AAATCCTTCT ATCTGAGAGC ATACCGTTGG AGGACATTCT TTCACCACTC
TATGCAGAAG AAAGAGAGAA GACACTTATA GACAAAGATA CCATCATCGA CACCATATGG
GCCATGAGGA ATATGTTTGC CGAATACTTC AGAATTGGAT TTGTAGAGAA CTCCAAGGGA
ACAAAGTGTT TACAATCATT GCCAATGTTG GTTAAAGACG TCAAACCTGC ATACCCTAAG
TTACCGTACT TTATCTACAG ATTGGGGAAC CGCATTAACT ACAATGATGA GAAGGAGTGT
CTTGGAGGCA TCATGAGACA GATTTCACTT CTTTACGTGC CTGAGCCTAT TTTTAGTGGT
TCTAGTGATC CACCTAGTGA TCCATCTAGT GATCCATCTA AAGCAGATAC AGAGTGTGGG
GAAGATCCAC CTACCACTTC TGATAATTCT ATCGTCAGCA AAGTTGGATC CGCGGATAGT
TCTACAGTTG ATTCTACTAC TTCTACAGCT AGTTCTAGCA GCGAGGCTCG ACAATGGCTC
GACCACACTT TGGAAGACGT TTTGTTTCCG CAGATCAAGA CTCGGTTTCT TGCACCCAGC
CAGCTAATGA AGGATGTGGT CCAGATTGCC GATCTTCCTG GGTTGTACCG AGTATTTGAA
CGGTGCTGAA TTCATAGAAC AACAAAATAG GCAACTGCTA TGTACATTGT ATAGAAGTAG
AACCGATAAT TACCT
 
Protein sequence
MPIQRLSESV INRIAAGEII IQPVNALKEM LENSIDAGAS SIDIVVKDGG TKLLQIADNG 
HGIAKEDLPL LCERFATSKL SRFEDLESIQ TYGFRGEALA SISHIARLSV VTKTATSAVA
YKAFYANGKL SGQNFKSSAN TEPKPVAGKV GTQITVEDLF YNLPQRLKGL KSKSDEFSRI
LDVIGRYAIH CKDVGFSCKK HGEPYQILST RAQLPIKERI RTIFGNSIAT DILEVDLDTN
IEKEYGTDNS KYGLISVTGA ITNSNYNNKK RIPPVFFINN RLVACEPLKR AVSGVYQFFL
PKGSYPFIYL SLQIDAQNVD VNIHPTKREV RFLHEEEIIE LIVDKVHLIL SSVDTSRKFK
TQTILSNTGT AKRPIDEFSA LSTQSQKKYR QENKLVRVDR QQTKLSAFIA GQSETSYKES
ILKETKRKED KSNEQIVEEL EESDKEVDEA EDTETTNTSD IDTKVTTNSR RRVRVSLDSI
IELRKQVNEE VHRPLTDILN NAVYVGIVDE EKRLCCFQYD VKLYLCDYAS LLHEFYYQVA
LYEFCNYGEI LLSESIPLED ILSPLYAEER EKTLIDKDTI IDTIWAMRNM FAEYFRIGFV
ENSKGTKCLQ SLPMLVKDVK PAYPKLPYFI YRLGNRINYN DEKECLGGIM RQISLLYVPE
PIFSGSSDPP SDPSSDPSKA DTESSSSSEA RQWLDHTLED VLFPQIKTRF LAPSQLMKDV
VQIADLPGLY RVFERC