Gene PICST_35911 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_35911 
Symbol 
ID4838979 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp850663 
End bp853641 
Gene Length2979 bp 
Protein Length992 aa 
Translation table12 
GC content40% 
IMG OID640390294 
Productpredicted protein 
Protein accessionXP_001384465 
Protein GI150865308 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR00600] DNA excision repair protein (rad2) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.3407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.878994 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTGTCA ACTCCTTGTG GGATATAGTT GGGCCCACGG CTAGGCCGGT CCGGTTGGAG 
GCTCTTTCCC GCAAGAAGCT AGCTGTTGAT GCCTCTATCT GGATATATCA GTTTCTCAAA
GCAGTCAGAG ATAGCGAAGG GAATTCCCTT CCACAATCCC ATATAGTTGG GTTTTTCAGA
AGGATATGCA AGCTTTTATA TTTTGGAATC TTTCCCATTT TTGTATTTGA CGGAGGTGCT
CCTGCTCTTA AGCGAGAGAC AATAAACCAG AGAAGAGAAA GAAGACAAGG ACAGGCAGAA
ACCACGCGGC AAACGGCTCA GAAGTTGTTG GCAATTCAAA TCCAGAGAGA GGCAGAAAAG
CTGAGAAACG TGGCGAATGG AAAACACGTC AATTACGATG CCGAAGATGA TGTGATATAT
TTGGAAGACT TGCCCATCAA CCATCCACAG CATGGTGCGA CTGTTGGAAA TCAGCCAGTC
AAAAGTAGAG AGGGAACTCC TAGTTCTCAG ACAAATGTAT TTAGAAAGGC TGATGAGTAT
CATTTACCAG AATTAAAGCA GTTCAAGGTA TCCAGAGATG ATGGCAGAAT TATGCCAGAA
GAAGAATTTC TCGAAGAAAA TATTGACCAT ATTGATGGTG TAGATATCAA CACGGTGGAT
CCCAATTCAA AGGAATTTCT TTTATTACCC AAAGCTACAC AGTATATGAT TTTGTCACAT
CTCAGATTAC GATCCAGATT GAGAATGGGG TACAGAAAGG AACAACTTGA AGAGCTATTT
CCCAATAGCA TGGATTTCTC GAAGTTCCAG ATCAAGCAGG TACAAAAGAG AAACTTTTAT
ACTCAGAAAC TAATGAAAAT TGCTGGTATG GGAGAAGATG GAAACATGAC AAAGAGAATC
GCAGGTGACA AGGATAGAAA ATATGCCTTA GTCAGAAACG AAGATGGCTG GACTCTCGCT
TTACAAGGTG AGGAATCTAC TGCTTCAAAT CCAATCGTTC TAGACGAAAA TGTAGAGGAT
ATTGCTTCCA CAATCAGGGA CGAACCGAAG CCTGAAATTG CTGTTATAGA AGATGAAGTA
GTAAAAGTGG AAGAACAAAG CGATTCAGAG TTTGAAGATG TTCCATTTGA AAGCGAAGAA
GTAAAAACTG AAAATATAGA TAGTGAAGAC GATGATTTAC AGAATGCGTT GATTGAATCG
ATTTATGAGA AGTTTGAAAG TCGCCCTTCC GAAATCAAGA GCATAACTTC TGATGGTTTT
GATGAAGAAG AACTTTCGAA AGCAATTGAG TTCTCTAGAA AAGATCTTCA ACAATTACAG
CAACAGGAAA AGGAAATAGA AAGAGGTTCA GCTAATACTT CATCTACCCA CACCTTTGCA
ACTACACAGG ATAGACAACT AATGCCATCT GAATCTGAAT TTTCTTTTGG AGAATCGTTG
TTATTTGTAA ATTCAGAATC ATCTAAAATC AATGAGCAAA ATTTGGAACT TGATTTGGGC
AGTTCGATTC TCTTTACAGG GTCTGACAAA AGAGAAACAC CAATTGCCAA TGTACATACA
ACAGAGGCAA AGAGCCAAAA CGAAGCTAAA GAAAGGGAAA AAGAAAAAGT AGACTCTACT
GAAGCTGCAA CTGACAAGAA AAGTGATAAA ATAGTTGATG AACCAAGACA ACTCCCCAGC
TGGTTTAGTG CTGAATCTTC TTCCAATCCT CATAGTCAGC CTTTCATGAC TTATAGTAGC
CAAAACACCT CTAAGAAATA CAAGGATGAT GAAGAAGCAG GGCTAATCAG TTGGAACGAA
GCAAAGACTT ATCTCGATGC AGAAGAAATT AACGATGTCC TGGAAGAGTC AGAAGGCGAT
ATCCAAGAAA TCAGTGCTCC AGCGACTGAA ATTCCGGAGT CAATTACTCA GCAAACGGTA
TTGGAAGTAC CCAGTAATAA AGAAAATGTT AACGAGGAAC GGAGAGTTGA AGTATTAGAC
TATGATTTTG AACAGGAAGA AGAGGAAGAA CTAGTGGAAC AACTCCAGAA AGAAGAGGCT
GCCCATGAGA GTTTCCGTGA AAACATTAAG AAGATACACC AGATCCCAGT TGGATCAACA
AGCACGAGCA TTAGTGAAGA ACAGCTATTA CAAGAGCAAT TACAGAAATC AAAGAGAGAT
TCGGATGAAG TAACACAGAA CATGATCACA GATGTCCAAG AGCTTCTTCG TCGATTTGGT
ATTCCTTATA TTACGGCTCC AATGGAAGCA GAAGCACAGT GTGCTGAACT AGTGAAAATT
GGATTGGTGG ATGGAATCAT CACAGATGAC AGTGATTGCT TTTTATTTGG AGGAACCAAA
GTGTACAAGA ACATGTTCAA TCAGAAGCAG TATGTGGAAT GCTACTCACA GGATGACGTT
GTTGATAAGA TTGGATTAAC CAGAAAAAAT CTCATCGAGC TAGCACTTTT ATTAGGTAGT
GACTATACAG AAGGTATTAA GGGAATTGGT CCAGTTCTTG CGATGGAGAT ATTGGCAGAG
TTTGGGTCGT TGAAGAATTT TAAAAAGTGG TTTGACGAGA AGACAAAGAC AGTCAAGTCT
GACAAGAAAG ATCAGACGGC ATTGGAAAAG AACTTGCTAG GAAGAATAAG AAACGGAAAG
CTTTTCTTGC CCGAAAGATT TCCTGATAGT GTGGTCTTTG ACGCTTATGA ACATCCTGAA
GTTGATCATG ATCGTAGCGA GTTCAAATGG GGAGTGCCCA ATCTTGACCA AATCCGCTCT
TTCTTAATGT ATAACTTACG ATGGACTCAG GATAAGGTGG ATGAAGTTAT GATTCCATTG
ATCAGAGACA TGAATAGGAA GAGAACCGAA GGCACACAAT CCACGATTGG GGAGTTCTTC
CCGCAGGAAT ATATCCAATC ACGGAAGGAG GTGAAATTGG GAAAGAGAAT GAAGTCGGCA
GCTAATAAGT TGAATAAGAG ACAGAAGAAA GGTACTTAG
 
Protein sequence
MGVNSLWDIV GPTARPVRLE ALSRKKLAVD ASIWIYQFLK AVRDSEGNSL PQSHIVGFFR 
RICKLLYFGI FPIFVFDGGA PALKRETINQ RRERRQGQAE TTRQTAQKLL AIQIQREAEK
SRNVANGKHV NYDAEDDVIY LEDLPINHPQ HGATVGNQPV KSREGTPSSQ TNVFRKADEY
HLPELKQFKV SRDDGRIMPE EEFLEENIDH IDGVDINTVD PNSKEFLLLP KATQYMILSH
LRLRSRLRMG YRKEQLEELF PNSMDFSKFQ IKQVQKRNFY TQKLMKIAGM GEDGNMTKRI
AGDKDRKYAL VRNEDGWTLA LQGEESTASN PIVLDENVED IASTIRDEPK PEIAVIEDEV
VKVEEQSDSE FEDVPFESEE VKTENIDSED DDLQNALIES IYEKFESRPS EIKSITSDGF
DEEELSKAIE FSRKDLQQLQ QQEKEIERGS ANTSSTHTFA TTQDRQLMPS ESEFSFGESL
LFVNSESSKI NEQNLELDLG SSILFTGSDK RETPIANVHT TEAKSQNEAK EREKEKVDST
EAATDKKSDK IVDEPRQLPS WFSAESSSNP HSQPFMTYSS QNTSKKYKDD EEAGLISWNE
AKTYLDAEEI NDVSEESEGD IQEISAPATE IPESITQQTV LEVPSNKENV NEERRVEVLD
YDFEQEEEEE LVEQLQKEEA AHESFRENIK KIHQIPVGST STSISEEQLL QEQLQKSKRD
SDEVTQNMIT DVQELLRRFG IPYITAPMEA EAQCAELVKI GLVDGIITDD SDCFLFGGTK
VYKNMFNQKQ YVECYSQDDV VDKIGLTRKN LIELALLLGS DYTEGIKGIG PVLAMEILAE
FGSLKNFKKW FDEKTKTVKS DKKDQTALEK NLLGRIRNGK LFLPERFPDS VVFDAYEHPE
VDHDRSEFKW GVPNLDQIRS FLMYNLRWTQ DKVDEVMIPL IRDMNRKRTE GTQSTIGEFF
PQEYIQSRKE VKLGKRMKSA ANKLNKRQKK GT