Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_60166 |
Symbol | |
ID | 4839443 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 1358978 |
End bp | 1360672 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640390758 |
Product | predicted protein |
Protein accession | XP_001385266 |
Protein GI | 150865875 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1948] ERCC4-type nuclease |
TIGRFAM ID | [TIGR00596] DNA repair protein (rad1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000965923 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAACACA GCGTAGGCTC AGATTCTGAA GAGTATCTTC TTGAAGAATT GCCTAAGTGG CACGAACTTG GAAACATGTT GGATGACATA TTCCACGAGA AGTCACTATC ATCAGAGAAT CTGGGTCCTA TACTCATTAT GTGCTCGGAT ACTCGTACAG CTCGTCAGTT GTACCAGGTG ATAGAATCAA TGAAAGAGAT TAAAGTAAGC GGTAAGAAAT ACTTCAGTTC CAGAAGATTT ATGATTCTGA AGTTGCATGA GTATCTCCAA TGGAAGGAGA TCAACAACTT GTCTAAACAG CTTAACGAAG ATTTGGAGAA ATCTGAAGAC CAGACCGAAG AACAGATTAT TACGTCTAAA TCATTTACCA GAAATGGACA ACCAGCTAGC AAAAGACGAA GAACTCGTGG AGCTTCTTCC ACAGCAAGAG TTGCCAAACT ATACTCCGGG GAAAACAGAG GGGCTGTCGA TATTGATGAA AATGTATTAG GTCAAATGGA CCAAGAAATC GTAGAGTCAG AGGAAGACGA TGTCGTGGAA ACAGGACCTA CTGGTTTATT TGTCGAAACT GAAGACATCA TCGTGCCTTC GTTAAGTCAT ATCAATATGG GTGATCAGGT AATAATCCAA GTCTACGACG AAGGCAGAAA CGATGCTTTA CTTCAGGAAA TTTCGCCTTC ATACATAATA ATGTATGAAC CAAACTTGTC ATTCATACGG CGTACAGAAA TCTTTCAAGC CATAAACAGG GACCAGCCTG CGAAGGTCTT TGTAATGTTT TACAGCAACT CCACAGAAGA ACAAAAGTAT TTGCTTCGAT TGAAGAAGGA GAAAGATGCA TTTACTAAGT TAATCAGAGA AAAGGCATCG TTGAGTAAAC ATTTCGAGAC GAGTGAAGAT AACTATAAAT TCCAAATTCA GAGAAATCAA ACGATGAACA CTAGGATAGC AGGAGGGGCT TCGTTCAGAA CGACCGATGA AATGAGAATC GTCGTGGACT CAAGAGAATT TGGTGCTCTG CTACCAAATT TGTTGTACAG AATTGGAATC AAAGTTGTTC CATGTATGCT TACAGTTGGC GATTATGTCA TTTCTCCTAA GATTTGCGTA GAGAGAAAGG CAATTCCAGA TTTGGTTTCT AGTTTTAAAT CTGGAAGATT ATTTACTCAG TGTTCTCAGA TGTTCAAGCA TTATGAGACA CCTACGTTGC TAATCGAATT CGACGAGAAC AAGTCATTTT CTTTGCAGCA GTACGCTGAT TCCCGGTTTT TAAAAGGAAG AGCAGAAACA GCCAACGATT CGCCCATCAA CCAACTGTTG CAGTCTAAGA TTATGGAGTT GTTGGTTGCA TATCCCAAGT TGAAAATCAT ATGGTCGTCT TCTCCGTATG AGACAGCACA GATATTCATG TCGTTGAAAG CCAATCAGGA GGAGCCAGAT GTAGAATCAG CTTTGAATAA AGGTGTCAGC AAAGAAGTCA TAACTGAAGA TGGAGGGCCA CCAAACTTTA ATGACGACCC GATCGACTTC ATACAAAACA TCCCAGGCAT AAACGATATG AATTACTATA AGATTATCCA AAATGTCAGG AATTTAGAAG AGTTGGTTCA GCTCTCAAAG GAGCAGTTTG TGAAGTTGCT TGGAAAAGAA AACGGAAAGA AGGCTTATAA CTTTATCAAC CATAGAATTA AGTAG
|
Protein sequence | MKHSVGSDSE EYLLEELPKW HELGNMLDDI FHEKSLSSEN SGPILIMCSD TRTARQLYQV IESMKEIKVS GKKYFSSRRF MISKLHEYLQ WKEINNLSKQ LNEDLEKSED QTEEQIITSK SFTRNGQPAS KRRRTRGASS TARVAKLYSG ENRGAVDIDE NVLGQMDQEI VESEEDDVVE TGPTGLFVET EDIIVPSLSH INMGDQVIIQ VYDEGRNDAL LQEISPSYII MYEPNLSFIR RTEIFQAINR DQPAKVFVMF YSNSTEEQKY LLRLKKEKDA FTKLIREKAS LSKHFETSED NYKFQIQRNQ TMNTRIAGGA SFRTTDEMRI VVDSREFGAS LPNLLYRIGI KVVPCMLTVG DYVISPKICV ERKAIPDLVS SFKSGRLFTQ CSQMFKHYET PTLLIEFDEN KSFSLQQYAD SRFLKGRAET ANDSPINQSL QSKIMELLVA YPKLKIIWSS SPYETAQIFM SLKANQEEPD VESALNKGVS KEVITEDGGP PNFNDDPIDF IQNIPGINDM NYYKIIQNVR NLEELVQLSK EQFVKLLGKE NGKKAYNFIN HRIK
|
| |