Gene PICST_58454 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_58454 
Symbol 
ID4838521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1071461 
End bp1073890 
Gene Length2430 bp 
Protein Length809 aa 
Translation table12 
GC content37% 
IMG OID640389836 
Productpredicted protein 
Protein accessionXP_001384508 
Protein GI150865337 
COG category[L] Replication, recombination and repair 
COG ID[COG0323] DNA mismatch repair enzyme (predicted ATPase) 
TIGRFAM ID[TIGR00585] DNA mismatch repair protein MutL 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0929212 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATCA AAAGTATTGA TCAGAAGGAC GTTTCAAAGA TAACCTCTGG TCAGGTTATA 
ATTGATTTAA AGTCAATAGT TAAAGAATTG GTCGAAAATT CCATTGACGC AAACAGTACT
AAGATAGAGA TCAATTTCCA GAACTATGGA ATCGATTCCA TTTCTGTCAC TGACAACGGC
AAGGGAATCA AAAAAGAAGA CTTTGAATTC GTTTGTCTTA GGAGTCATAC CTCCAAGATA
TCTGAGTTCG AAGATTTAGA TAAACTATCT ACTTTAGGCT TCAGAGGAGA GGCACTAAAT
TCGATATGTT CTGTATCATC CAAAGTCAAA ATTGTTACAT GTACTGACTA CCCGAAAAAT
CACGAATTGG ATTATGACAA AGCTGGAAAA TTAAGCAAAT CTGTGTCGAA AATCGGAGGA
GGTTTTTCAA AACAGACAGG AACCTCTGTA AGTATCGAGA AGATATTCTT TGACTTACCA
GTTAGACTTA AGAATTTTGT GAAGAATTCC AAGAGGGAAT TTCATAAAGC AATAAATTTC
CTTACCCAAT ACCTATTAAT TTACCCAGAA ATCAAATTCA GTGTTTTCAA TGTTGTGAAC
TCAAAGAAAA GTTTGGCCTT GAGTAGCAGA GGTGGTTCTA AGTCCACGAT TTTGGACAAC
ATGATCACTG TTTTTGGTGC AAATGGTGCA AAGGGGTTGG TTGAATTGAA ATTGTTGATT
AGTGAGGATA TCCAAATTGA AGGGTACATA TCAAGCTATT CATTTGGACT TGGCCGCCTG
GTGACTGATC GACAATTTAT GTTCATAAAT AAACGACCTG TTATGTTGAA AAAGTTGTCC
AAAATTGTCA ATGATGTGTA TAAATCTTTC AACCATATGC AATATCCAAT ATTCATATTG
AATATAGCTA TCGATCCACT GGCCCTAGAT GTTAATTTGA CTCCAGACAA AGGGATGATA
ATGATCCACC TGGAGCAAGA TTTGTTTGAA AAGATTCGGT TAGACTTAAT TGGATTCTTT
GAGAGTCAAG ACAATGTGAT TCCTAGGAAT TTAGAGCGGC ATGAGTTAAA CAAATCCACT
TTCACTAAAA GAGAAAAGAA GCCATCAGTT CAGAACATAG AGCCAGATTC TGACATTGAA
GATATCAAAT TACCTCTTGC TAAGGAAATG GGTGAAAGTA GTATCGCTAC AGAGTCTGAT
ACAATTGTAC AAGAGAGCTG TGCTTCCGGA CTTGATAACA TCAAGGAGCA TTACCAAGAA
TCTAATGTAG TGGAAGATGG ACAGGAACAC TTACTTCCAG AGCATGAGGA AAGCTTGCAA
GTAGTTACAG TCGTTACAGA TCCAGAAACC AGCAATATAG ATCAAGAACA GAGGAATACT
TTCGAATTAA TCAACAAAGA AAGAGAACAA GATATCGGCA GCCCAGACGA TGACGAGGAT
CAACCAATTG AAGGTAGGGA AGATGTTGAT ACTAAAGAAG AGGGAAATGA TCAGATTATA
GAAGACGATA GCAAGAATTT AGCATCTTCC AAATATTCGC CGATCAACAC TCTTGCCCCA
GAATATGATG ATCATAGAAA AAACTATCAA ATTACACAAC AAAAAAAATG TTTGATTAGA
ATCAACGAAC AGGAGTTCGA AGAAAGCCAC AACAAGAGAT TAAAGATTGA TATGTTACAC
AACAGAGTAG AGTCGGCTAA TAGCCATGCC ATTGTATCTT CGATAAAGGA AGAGATGTGT
CATACTTCCC TCAATAATAA CGGGAATCGT AGTTTAAAAC TACTAGAGAT ACAGGATGAC
CAGAGTTTAT CATACACAAT ACTGAGAAAA GATTTTCTAA AGATGAAGCT CATTGGACAA
TTCAACCTTG GATTCATCCT AGTAACACTT GATGACAATA ACTTGTTTAT TATTGATCAG
CATGCAAGCG ATGAAAAATA TAACTTCGAA AGATTAAACC AGGAGCTTTC GATCAAGATT
CAAAGGTTGA TTGTCCCTCA GACAATAGAG TTAAGTATTA TAGATGAGTT ACTTGTTATT
GAACACGAGC AAATATTTAT GTCCAATGGG TATCAATTCA CTGTGGTCCT TGAAGCAAAA
CCAGGATCCA GAATAAGGTT GAACACTATG CCAAGTTCGC GTGGCGTCGT ATTTGACTTG
AACGATTTCC AGGAGCTTAT CAACCTCGTC AACACTAATC CACGAAACAA GAACTTGAAA
TGCTCTAAGA TCCGGAACTT ACTAGCAATG AGAGCCTGTC GCTCAAGTAT TATGATTGGC
CAGCCGTTAA CCCGCGGGAG AATGACTAAA GTTGTTCAGA ATTTAAGTCA ACTAGATAAA
CCTTGGAATT GTCCTCATGG CCGGCCGACA ATGAGACATC TAGTAGAGTT CGACCATTGG
AGAGACAATC GTGTAGACTA TGAGATATAA
 
Protein sequence
MAIKSIDQKD VSKITSGQVI IDLKSIVKEL VENSIDANST KIEINFQNYG IDSISVTDNG 
KGIKKEDFEF VCLRSHTSKI SEFEDLDKLS TLGFRGEALN SICSVSSKVK IVTCTDYPKN
HELDYDKAGK LSKSVSKIGG GFSKQTGTSV SIEKIFFDLP VRLKNFVKNS KREFHKAINF
LTQYLLIYPE IKFSVFNVVN SKKSLALSSR GGSKSTILDN MITVFGANGA KGLVELKLLI
SEDIQIEGYI SSYSFGLGRS VTDRQFMFIN KRPVMLKKLS KIVNDVYKSF NHMQYPIFIL
NIAIDPSALD VNLTPDKGMI MIHSEQDLFE KIRLDLIGFF ESQDNVIPRN LERHELNKST
FTKREKKPSV QNIEPDSDIE DIKLPLAKEM GESSIATESD TIVQESCASG LDNIKEHYQE
SNVVEDGQEH LLPEHEESLQ VVTVVTDPET SNIDQEQRNT FELINKEREQ DIGSPDDDED
QPIEGREDVD TKEEGNDQII EDDSKNLASS KYSPINTLAP EYDDHRKNYQ ITQQKKCLIR
INEQEFEESH NKRLKIDMLH NRVESANSHA IVSSIKEEMC HTSLNNNGNR SLKLLEIQDD
QSLSYTISRK DFLKMKLIGQ FNLGFILVTL DDNNLFIIDQ HASDEKYNFE RLNQELSIKI
QRLIVPQTIE LSIIDELLVI EHEQIFMSNG YQFTVVLEAK PGSRIRLNTM PSSRGVVFDL
NDFQELINLV NTNPRNKNLK CSKIRNLLAM RACRSSIMIG QPLTRGRMTK VVQNLSQLDK
PWNCPHGRPT MRHLVEFDHW RDNRVDYEI