Gene PICST_60166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_60166 
Symbol 
ID4839443 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1358978 
End bp1360672 
Gene Length1695 bp 
Protein Length564 aa 
Translation table12 
GC content40% 
IMG OID640390758 
Productpredicted protein 
Protein accessionXP_001385266 
Protein GI150865875 
COG category[L] Replication, recombination and repair 
COG ID[COG1948] ERCC4-type nuclease 
TIGRFAM ID[TIGR00596] DNA repair protein (rad1) 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000965923 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACACA GCGTAGGCTC AGATTCTGAA GAGTATCTTC TTGAAGAATT GCCTAAGTGG 
CACGAACTTG GAAACATGTT GGATGACATA TTCCACGAGA AGTCACTATC ATCAGAGAAT
CTGGGTCCTA TACTCATTAT GTGCTCGGAT ACTCGTACAG CTCGTCAGTT GTACCAGGTG
ATAGAATCAA TGAAAGAGAT TAAAGTAAGC GGTAAGAAAT ACTTCAGTTC CAGAAGATTT
ATGATTCTGA AGTTGCATGA GTATCTCCAA TGGAAGGAGA TCAACAACTT GTCTAAACAG
CTTAACGAAG ATTTGGAGAA ATCTGAAGAC CAGACCGAAG AACAGATTAT TACGTCTAAA
TCATTTACCA GAAATGGACA ACCAGCTAGC AAAAGACGAA GAACTCGTGG AGCTTCTTCC
ACAGCAAGAG TTGCCAAACT ATACTCCGGG GAAAACAGAG GGGCTGTCGA TATTGATGAA
AATGTATTAG GTCAAATGGA CCAAGAAATC GTAGAGTCAG AGGAAGACGA TGTCGTGGAA
ACAGGACCTA CTGGTTTATT TGTCGAAACT GAAGACATCA TCGTGCCTTC GTTAAGTCAT
ATCAATATGG GTGATCAGGT AATAATCCAA GTCTACGACG AAGGCAGAAA CGATGCTTTA
CTTCAGGAAA TTTCGCCTTC ATACATAATA ATGTATGAAC CAAACTTGTC ATTCATACGG
CGTACAGAAA TCTTTCAAGC CATAAACAGG GACCAGCCTG CGAAGGTCTT TGTAATGTTT
TACAGCAACT CCACAGAAGA ACAAAAGTAT TTGCTTCGAT TGAAGAAGGA GAAAGATGCA
TTTACTAAGT TAATCAGAGA AAAGGCATCG TTGAGTAAAC ATTTCGAGAC GAGTGAAGAT
AACTATAAAT TCCAAATTCA GAGAAATCAA ACGATGAACA CTAGGATAGC AGGAGGGGCT
TCGTTCAGAA CGACCGATGA AATGAGAATC GTCGTGGACT CAAGAGAATT TGGTGCTCTG
CTACCAAATT TGTTGTACAG AATTGGAATC AAAGTTGTTC CATGTATGCT TACAGTTGGC
GATTATGTCA TTTCTCCTAA GATTTGCGTA GAGAGAAAGG CAATTCCAGA TTTGGTTTCT
AGTTTTAAAT CTGGAAGATT ATTTACTCAG TGTTCTCAGA TGTTCAAGCA TTATGAGACA
CCTACGTTGC TAATCGAATT CGACGAGAAC AAGTCATTTT CTTTGCAGCA GTACGCTGAT
TCCCGGTTTT TAAAAGGAAG AGCAGAAACA GCCAACGATT CGCCCATCAA CCAACTGTTG
CAGTCTAAGA TTATGGAGTT GTTGGTTGCA TATCCCAAGT TGAAAATCAT ATGGTCGTCT
TCTCCGTATG AGACAGCACA GATATTCATG TCGTTGAAAG CCAATCAGGA GGAGCCAGAT
GTAGAATCAG CTTTGAATAA AGGTGTCAGC AAAGAAGTCA TAACTGAAGA TGGAGGGCCA
CCAAACTTTA ATGACGACCC GATCGACTTC ATACAAAACA TCCCAGGCAT AAACGATATG
AATTACTATA AGATTATCCA AAATGTCAGG AATTTAGAAG AGTTGGTTCA GCTCTCAAAG
GAGCAGTTTG TGAAGTTGCT TGGAAAAGAA AACGGAAAGA AGGCTTATAA CTTTATCAAC
CATAGAATTA AGTAG
 
Protein sequence
MKHSVGSDSE EYLLEELPKW HELGNMLDDI FHEKSLSSEN SGPILIMCSD TRTARQLYQV 
IESMKEIKVS GKKYFSSRRF MISKLHEYLQ WKEINNLSKQ LNEDLEKSED QTEEQIITSK
SFTRNGQPAS KRRRTRGASS TARVAKLYSG ENRGAVDIDE NVLGQMDQEI VESEEDDVVE
TGPTGLFVET EDIIVPSLSH INMGDQVIIQ VYDEGRNDAL LQEISPSYII MYEPNLSFIR
RTEIFQAINR DQPAKVFVMF YSNSTEEQKY LLRLKKEKDA FTKLIREKAS LSKHFETSED
NYKFQIQRNQ TMNTRIAGGA SFRTTDEMRI VVDSREFGAS LPNLLYRIGI KVVPCMLTVG
DYVISPKICV ERKAIPDLVS SFKSGRLFTQ CSQMFKHYET PTLLIEFDEN KSFSLQQYAD
SRFLKGRAET ANDSPINQSL QSKIMELLVA YPKLKIIWSS SPYETAQIFM SLKANQEEPD
VESALNKGVS KEVITEDGGP PNFNDDPIDF IQNIPGINDM NYYKIIQNVR NLEELVQLSK
EQFVKLLGKE NGKKAYNFIN HRIK