Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_35911 |
Symbol | |
ID | 4838979 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | - |
Start bp | 850663 |
End bp | 853641 |
Gene Length | 2979 bp |
Protein Length | 992 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640390294 |
Product | predicted protein |
Protein accession | XP_001384465 |
Protein GI | 150865308 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | [TIGR00600] DNA excision repair protein (rad2) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.3407 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.878994 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTGTCA ACTCCTTGTG GGATATAGTT GGGCCCACGG CTAGGCCGGT CCGGTTGGAG GCTCTTTCCC GCAAGAAGCT AGCTGTTGAT GCCTCTATCT GGATATATCA GTTTCTCAAA GCAGTCAGAG ATAGCGAAGG GAATTCCCTT CCACAATCCC ATATAGTTGG GTTTTTCAGA AGGATATGCA AGCTTTTATA TTTTGGAATC TTTCCCATTT TTGTATTTGA CGGAGGTGCT CCTGCTCTTA AGCGAGAGAC AATAAACCAG AGAAGAGAAA GAAGACAAGG ACAGGCAGAA ACCACGCGGC AAACGGCTCA GAAGTTGTTG GCAATTCAAA TCCAGAGAGA GGCAGAAAAG CTGAGAAACG TGGCGAATGG AAAACACGTC AATTACGATG CCGAAGATGA TGTGATATAT TTGGAAGACT TGCCCATCAA CCATCCACAG CATGGTGCGA CTGTTGGAAA TCAGCCAGTC AAAAGTAGAG AGGGAACTCC TAGTTCTCAG ACAAATGTAT TTAGAAAGGC TGATGAGTAT CATTTACCAG AATTAAAGCA GTTCAAGGTA TCCAGAGATG ATGGCAGAAT TATGCCAGAA GAAGAATTTC TCGAAGAAAA TATTGACCAT ATTGATGGTG TAGATATCAA CACGGTGGAT CCCAATTCAA AGGAATTTCT TTTATTACCC AAAGCTACAC AGTATATGAT TTTGTCACAT CTCAGATTAC GATCCAGATT GAGAATGGGG TACAGAAAGG AACAACTTGA AGAGCTATTT CCCAATAGCA TGGATTTCTC GAAGTTCCAG ATCAAGCAGG TACAAAAGAG AAACTTTTAT ACTCAGAAAC TAATGAAAAT TGCTGGTATG GGAGAAGATG GAAACATGAC AAAGAGAATC GCAGGTGACA AGGATAGAAA ATATGCCTTA GTCAGAAACG AAGATGGCTG GACTCTCGCT TTACAAGGTG AGGAATCTAC TGCTTCAAAT CCAATCGTTC TAGACGAAAA TGTAGAGGAT ATTGCTTCCA CAATCAGGGA CGAACCGAAG CCTGAAATTG CTGTTATAGA AGATGAAGTA GTAAAAGTGG AAGAACAAAG CGATTCAGAG TTTGAAGATG TTCCATTTGA AAGCGAAGAA GTAAAAACTG AAAATATAGA TAGTGAAGAC GATGATTTAC AGAATGCGTT GATTGAATCG ATTTATGAGA AGTTTGAAAG TCGCCCTTCC GAAATCAAGA GCATAACTTC TGATGGTTTT GATGAAGAAG AACTTTCGAA AGCAATTGAG TTCTCTAGAA AAGATCTTCA ACAATTACAG CAACAGGAAA AGGAAATAGA AAGAGGTTCA GCTAATACTT CATCTACCCA CACCTTTGCA ACTACACAGG ATAGACAACT AATGCCATCT GAATCTGAAT TTTCTTTTGG AGAATCGTTG TTATTTGTAA ATTCAGAATC ATCTAAAATC AATGAGCAAA ATTTGGAACT TGATTTGGGC AGTTCGATTC TCTTTACAGG GTCTGACAAA AGAGAAACAC CAATTGCCAA TGTACATACA ACAGAGGCAA AGAGCCAAAA CGAAGCTAAA GAAAGGGAAA AAGAAAAAGT AGACTCTACT GAAGCTGCAA CTGACAAGAA AAGTGATAAA ATAGTTGATG AACCAAGACA ACTCCCCAGC TGGTTTAGTG CTGAATCTTC TTCCAATCCT CATAGTCAGC CTTTCATGAC TTATAGTAGC CAAAACACCT CTAAGAAATA CAAGGATGAT GAAGAAGCAG GGCTAATCAG TTGGAACGAA GCAAAGACTT ATCTCGATGC AGAAGAAATT AACGATGTCC TGGAAGAGTC AGAAGGCGAT ATCCAAGAAA TCAGTGCTCC AGCGACTGAA ATTCCGGAGT CAATTACTCA GCAAACGGTA TTGGAAGTAC CCAGTAATAA AGAAAATGTT AACGAGGAAC GGAGAGTTGA AGTATTAGAC TATGATTTTG AACAGGAAGA AGAGGAAGAA CTAGTGGAAC AACTCCAGAA AGAAGAGGCT GCCCATGAGA GTTTCCGTGA AAACATTAAG AAGATACACC AGATCCCAGT TGGATCAACA AGCACGAGCA TTAGTGAAGA ACAGCTATTA CAAGAGCAAT TACAGAAATC AAAGAGAGAT TCGGATGAAG TAACACAGAA CATGATCACA GATGTCCAAG AGCTTCTTCG TCGATTTGGT ATTCCTTATA TTACGGCTCC AATGGAAGCA GAAGCACAGT GTGCTGAACT AGTGAAAATT GGATTGGTGG ATGGAATCAT CACAGATGAC AGTGATTGCT TTTTATTTGG AGGAACCAAA GTGTACAAGA ACATGTTCAA TCAGAAGCAG TATGTGGAAT GCTACTCACA GGATGACGTT GTTGATAAGA TTGGATTAAC CAGAAAAAAT CTCATCGAGC TAGCACTTTT ATTAGGTAGT GACTATACAG AAGGTATTAA GGGAATTGGT CCAGTTCTTG CGATGGAGAT ATTGGCAGAG TTTGGGTCGT TGAAGAATTT TAAAAAGTGG TTTGACGAGA AGACAAAGAC AGTCAAGTCT GACAAGAAAG ATCAGACGGC ATTGGAAAAG AACTTGCTAG GAAGAATAAG AAACGGAAAG CTTTTCTTGC CCGAAAGATT TCCTGATAGT GTGGTCTTTG ACGCTTATGA ACATCCTGAA GTTGATCATG ATCGTAGCGA GTTCAAATGG GGAGTGCCCA ATCTTGACCA AATCCGCTCT TTCTTAATGT ATAACTTACG ATGGACTCAG GATAAGGTGG ATGAAGTTAT GATTCCATTG ATCAGAGACA TGAATAGGAA GAGAACCGAA GGCACACAAT CCACGATTGG GGAGTTCTTC CCGCAGGAAT ATATCCAATC ACGGAAGGAG GTGAAATTGG GAAAGAGAAT GAAGTCGGCA GCTAATAAGT TGAATAAGAG ACAGAAGAAA GGTACTTAG
|
Protein sequence | MGVNSLWDIV GPTARPVRLE ALSRKKLAVD ASIWIYQFLK AVRDSEGNSL PQSHIVGFFR RICKLLYFGI FPIFVFDGGA PALKRETINQ RRERRQGQAE TTRQTAQKLL AIQIQREAEK SRNVANGKHV NYDAEDDVIY LEDLPINHPQ HGATVGNQPV KSREGTPSSQ TNVFRKADEY HLPELKQFKV SRDDGRIMPE EEFLEENIDH IDGVDINTVD PNSKEFLLLP KATQYMILSH LRLRSRLRMG YRKEQLEELF PNSMDFSKFQ IKQVQKRNFY TQKLMKIAGM GEDGNMTKRI AGDKDRKYAL VRNEDGWTLA LQGEESTASN PIVLDENVED IASTIRDEPK PEIAVIEDEV VKVEEQSDSE FEDVPFESEE VKTENIDSED DDLQNALIES IYEKFESRPS EIKSITSDGF DEEELSKAIE FSRKDLQQLQ QQEKEIERGS ANTSSTHTFA TTQDRQLMPS ESEFSFGESL LFVNSESSKI NEQNLELDLG SSILFTGSDK RETPIANVHT TEAKSQNEAK EREKEKVDST EAATDKKSDK IVDEPRQLPS WFSAESSSNP HSQPFMTYSS QNTSKKYKDD EEAGLISWNE AKTYLDAEEI NDVSEESEGD IQEISAPATE IPESITQQTV LEVPSNKENV NEERRVEVLD YDFEQEEEEE LVEQLQKEEA AHESFRENIK KIHQIPVGST STSISEEQLL QEQLQKSKRD SDEVTQNMIT DVQELLRRFG IPYITAPMEA EAQCAELVKI GLVDGIITDD SDCFLFGGTK VYKNMFNQKQ YVECYSQDDV VDKIGLTRKN LIELALLLGS DYTEGIKGIG PVLAMEILAE FGSLKNFKKW FDEKTKTVKS DKKDQTALEK NLLGRIRNGK LFLPERFPDS VVFDAYEHPE VDHDRSEFKW GVPNLDQIRS FLMYNLRWTQ DKVDEVMIPL IRDMNRKRTE GTQSTIGEFF PQEYIQSRKE VKLGKRMKSA ANKLNKRQKK GT
|
| |