Gene PICST_68406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_68406 
Symbol 
ID4840512 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp1078049 
End bp1081148 
Gene Length3100 bp 
Protein Length999 aa 
Translation table12 
GC content40% 
IMG OID640391827 
Productpredicted protein 
Protein accessionXP_001386399 
Protein GI150866717 
COG category[L] Replication, recombination and repair 
COG ID[COG0249] Mismatch repair ATPase (MutS family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.094593 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AAAGAACCAA TCTTCTTCTC TATATCAGCA ACGATGTCCA GTACTCGTCC CGATCTCAGA 
TTCACCGATA CGGTAGACGA ACGGCTGTTC TACCGTAAAT ATGCTGGCCT TTCTACCAAA
GATGCCTCTA CTATCAGATT CATTGACCAC AACAACAAGG ACTATTTCAC TGCCCTCGAC
GAAGACGCTG ATCTTGTAGC AGAGAACATT TACAAAACAC AATCTGTACT CAAATATAAC
AACAGCAACA AAAACAGATA CGTCACAATC CTGCCTCAGG TATTCTTGAA CAATGTGTTG
AAGTTCTGTA TCATCGACAG ACATATGAAG GTGGAGATAT ATCACAACAA AACGTTCCAA
TTGCTCAGCA CAGCAACGCC GGGTAACTTG GAAGCTCTTG CCAACGAGTA TGGTGTCAAT
TTGGAAGGCA TGTTTCAGGA CTGTTCGACT CCCATGGTGG CTAGTATTAA GTTTCAACAA
ACAGGAAGCG CTAGAAAAGT TGGGGTTTGC GTCATAGACA CACTGAACTC GACTATTCAA
GTATCTGAAT TCGAAGACAA CGACCTCTTC TCTAACCTTG AAAGTTTATT GTTACAATTG
GGTGTAAAAG AAGTCGTGTT GCCTTCGAAC TACAGTGCCA AAGATGAGAA TACCGAATCA
ATAAAGCTCT TTCAAGTCTT GGATAAAATC GGTTACCTTG TCGTCAGCTC CGTTAAATCG
TCATTCTTTA CAACTAAAGA TATTGAGCAG GATTTGCGCA AGTTGGTGCT GTCTGAAAAC
CAAAAAGATG ATGATGATGT CAACGTCGAT TTGTTGTTGG CTTCCAAAGG AATCAATACG
GCCGACTTCG CCCACTCGCT TGCTTGTTGC AACGCATTAA TTGCATACCT CCAGTTGTTG
CTGGACGATG TACAGAATTC TTTCACTATA GAGCAGTACA ATTTGAGTTC ATACATGAAA
TTGGACTCTT CTACAATGAA GGCTTTGAAT ATTTTCCCAT CTTCGAATTC TGGAGTTTCC
AATGCTCTTG TTAAATCTTC CAATATCAGC TCGATCTTTG AGTTATTGAA CAAGTGTAGA
ACTGCGGCAG GTTCTAGATT ACTTTCTCAA TGGTTGAAAC AGCCCCTCAC TAGCTTGTCC
ATGATCGAGG AAAGGCTAGA TTTGGTGAAC TATCTCGTGG ACGGTACCAA CTTCAGAGTA
TATGCCAACC AAGAGTTCTT ATCGCAGGTA CCTGATATAA GAAGACTTTT GAAAAAGATT
AGCAATGGTT TGTCAAAGTC TACTGGCAAC GAAAATAAGA AGTTAGAGGA TATTGTCGTA
TTGTATCAGT TAGTATTGGC TTTGCCTGCA TTCATTGACA TGAGTAAAAT GGTGATTGCT
GATATCGAGG AAAAGGATTC ATTGCCGGTA GCAAATTTGA TAAAGAAGCA TTGGCTCGAG
CCAGTCGAGA AGAGTCTTGA ATCCTTATCA AAATTCCAAG AGATGATCGA GACAACCATT
GACTTATCGC CATTAGAATC AAGTTCTGCT TATGACCAAT TGCATTCTGA TTTCAATGTG
AGACCAGAAT TTGACGAGTC CCTTATTGAA ATTAATGATA AATTACAAGC CAGTCTTGCT
GAAATAAAAC AATTACATAT TGAAGTTGCT GACGACTTGA ATATGGAATT GGACAAGAAA
TTGAAGTTAG AGAAACATAT ACAACACGGT TGGTGTTTTA GAGTTACCAG AAATGATTCT
ACCGTCTTGA GAAATACCGG TAACAAATAT TCTCAGCTCC AGACTGTTAA GGCTGGTGTC
TTCTTTACCA CCAAGAGATT AACTTTGCTA TCCCAGGAAT ATGCAGAGGC TCTTCAAGAA
TACAATACCA AACAGCGCGA GTTAATTAAG GAGATATTGT CCATTTCTTT GTCATATCAA
TCGGTTTTTA TGAACTTGTC ATTGACGCTT GCACATTTGG ATGTATTAGT CAGCTTTGCT
AATGTGGCAA TAGTGGCACC AACCGTATTT GCAAGACCGA AGTTGCATCC ATTGAGCAAT
GATATTGATT CGGACCAATT CAAGAATAGA AAAATCAAGC TAAGAGAAGC CAGACATCCT
GTATTGGAAG TACAAGATGA CATTAATTTC ATTGCCAATG ATGTCTTTTT ATCAAACGAT
GCATGTGACA AAGGGAAGCC TTTTGTTATC ATAACTGGTC CAAATATGGG TGGTAAGTCA
ACATACATAA GACAGATTGG TGTTATTGCC TTGATGGCGC AAATTGGATC ATTCATCCCT
GCTAATGAAG ACGATTTTCC AGAATTGCCC ATCTTTGATG CTATCTTATC AAGAGTGGGA
GCTGGAGACT CCCAGCTTAA GGGTTTATCT ACTTTCATGA TCGAGATGTT GGAGACTTCG
TCCATTTTGG CCACAGCAAC ACAAAACTCG TTGATTATCA TCGATGAGTT GGGAAGAGGT
ACTTCTACTT ACGATGGTTT TGGATTAGCT TGGTCAATTC TGGAACACCT CATTAAAGAA
AAAAGCTGTT TCACGTTATT TGCAACCCAT TTTCACGAAT TGACTCAATT ATCATCCAAA
TATGAGGACA AAGTTGACAA CTTACATGTT GTTGCCCATG TAGAAAACAA AGATGAAAAT
GACGATGACA TCACTTTGAT GTACCGTGTT GAACCAGGAG TATCCGACAA ATCGTTTGGT
ATTCATGTTG CTGAATTGGT TAAGTTTCCA TCGAAGATTA TCAACATGGC GAAGAGAAAA
GCTTCAGAGT TGCAAGATAT GAATGTTACA GAGGAAGACA AGTTTATCCA GAACAAAAAA
ACGAAGTGTT CTGCCGAAGA GATTGACCGT GGAGTTGACA CCTTGAAGAC GATCTTGAAG
AAGTGGAAAG ATGAATGCTA TGATCCTGAG ACATCCAAGA GTCGCTTTGA AAGTGGCGAG
GCAGTCAACA AGCTCAAGCA ATTGGTCGAA GGTGAGTTTT CAGGGGTGGT TGCGAACGAT
AAGTTCATAA ATGAAGTTCT TACGGCATTG TGAAGTGGTA TTGTGATATC TAGTGTAGAA
ATTGAACTAT TATTATCATA TAAATAAACG AATGAAATGT
 
Protein sequence
MSSTRPDLRF TDTVDERSFY RKYAGLSTKD ASTIRFIDHN NKDYFTALDE DADLVAENIY 
KTQSVLKYNN SNKNRYVTIS PQVFLNNVLK FCIIDRHMKV EIYHNKTFQL LSTATPGNLE
ALANEYGVNL EGMFQDCSTP MVASIKFQQT GSARKVGVCV IDTSNSTIQV SEFEDNDLFS
NLESLLLQLG VKEVVLPSNY SAKDENTESI KLFQVLDKIG YLVVSSVKSS FFTTKDIEQD
LRKLVSSENQ KDDDDVNVDL LLASKGINTA DFAHSLACCN ALIAYLQLLS DDVQNSFTIE
QYNLSSYMKL DSSTMKALNI FPSSNSGVSN ALVKSSNISS IFELLNKCRT AAGSRLLSQW
LKQPLTSLSM IEERLDLVNY LVDGTNFRVY ANQEFLSQVP DIRRLLKKIS NGLSKSTGNE
NKKLEDIVVL YQLVLALPAF IDMSKMVIAD IEEKDSLPVA NLIKKHWLEP VEKSLESLSK
FQEMIETTID LSPLESSSAY DQLHSDFNVR PEFDESLIEI NDKLQASLAE IKQLHIEVAD
DLNMELDKKL KLEKHIQHGW CFRVTRNDST VLRNTGNKYS QLQTVKAGVF FTTKRLTLLS
QEYAEALQEY NTKQRELIKE ILSISLSYQS VFMNLSLTLA HLDVLVSFAN VAIVAPTVFA
RPKLHPLSND IDSDQFKNRK IKLREARHPV LEVQDDINFI ANDVFLSNDA CDKGKPFVII
TGPNMGGKST YIRQIGVIAL MAQIGSFIPA NEDDFPELPI FDAILSRVGA GDSQLKGLST
FMIEMLETSS ILATATQNSL IIIDELGRGT STYDGFGLAW SISEHLIKEK SCFTLFATHF
HELTQLSSKY EDKVDNLHVV AHVENKDEND DDITLMYRVE PGVSDKSFGI HVAELVKFPS
KIINMAKRKA SELQDMNVTE EDKFIQNKKT KCSAEEIDRG VDTLKTILKK WKDECYDPET
SKSRFESGEA VNKLKQLVEG EFSGVVANDK FINEVLTAL