Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32483 |
Symbol | |
ID | 4839265 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 1621846 |
End bp | 1623813 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640390580 |
Product | predicted protein |
Protein accession | XP_001385007 |
Protein GI | 150865684 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | [TIGR00583] DNA repair protein (mre11) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0241733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.371537 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTCCTG TTGACAAGAT ACCCGAGGGA GAGGACACTA TACGAATCCT TCTCACCACA GACAACCATG TTGGCTACAA CGAGACAGAT CCAGTGCGAG GAGACGATGG CTGGAAGACC TTTCATGAGA TCACCAGACT CGCAAAGCAA TTAGATGTAG ACATGATAGT CCAGGGTGGC GACTTGTTCC ATATCAACAA ACCGCTGAAA AAATCTCTTT TCCACGTCAT GAAGTCCCTT CGTTTGAATT GTATGGGAGA CCGTCCCTGT GAATTGGAGC TTCTCAGTGA TCCTACACAA GCTTTGGATT CTGGCTTTGG CACCGTCAAC TATGAAGATC CAAACTTGAA CATCTCTATT CCCGTCTTTG CTATCAGTGG TAACCATGAC GATGCTACGG GCGAAGGGTT GTTGCTGCCT CTTGACATTC TTCTGGTTTC TGGACTAGTG AACCATTTCG GTAAGATTCC AGATAGCGAG AACATCACCG TTTCGCCGTT GCTTTTCCAG AAGGGCCGAA CAAAACTAGC TCTTTACGGT ATGGCAAATG TTAGAGACGA AAGGCTCCAT CGAGCTTTTC GTGATGGACA TGTGAAATTC CAGAGACCCA ACATCCAGAC AGACCAATGG TTCAATTTGT TTTGTATTCA CCAGAACCAC GCCCAACATC TGATAACATC ATCAATTCCC GAAATGTATT TGCCCAACTT CTTAGACTTT GTACTTTGGG GCCACGAGCA CGAATGTATA GCTTACCCTG TTCATAATCC AGAGACTGGC TTTGATGTTC TTCAGGCTGG ATCCTCGGTA GCGACTTCGT TATCCGAAGG CGAAGTAGCT GATAAACACA CGTTTTTGTT GAGTATCCGC GACCAGAGAT ATTCTATAGA ACCAATAAAG TTGAATACAG TGAGACCATT TGTGTTAAAG GAAATTGTCT TGCTGCAGAC AGATCTCATA CTGGGAGCTG CCTCGAAGTC AGACGTAATA GCACTTCTAC TGCAAGAAGT GGAAAGCTCC ATAGTGAAAG CCAATGAAAA CTTCAAACAA AATAACGCCG AACTCTTCGA TGAAGACGAT ACAGAAGAAG ATGTAGCAAA AAAGATCCCC TTGCCTTTGA TAAGAATCAG AGTAGAATAT TCAGGAGGTT ACGAAATTGA GAACACAAGA AGGTTTTCCA ACCGGTTTGT AGGAAAAGTT GCTAATCCCA ATGATATCAT CCAATTCTAC AAAAAGAGAA CATCAGAAAC AGGACCCAAA AAAACGAAGT TTCTGGATAA GGATCTTCTT GAAGAAGGGG AGTCCAACAA GAAGTCTACT GAGATACAAC TACAAGATCT AATAGAGAAA TTCATCAGTG TGGCAGATTT GTCACTCTTG CCGGAGGCTG GGATGAACTA CGCTGTCAAG AGATATATCG ATAATGAAGA CAAGCATGTC CTACAAAATT ATATTGAAAA CGAAATCAAG AAAGAGACAG AGATGTTAAT GAAGATAGAC ATCGAGGACA CAAGCGTCTA TGATAAGGAT AACGACTATT CCAAAAAGAT ATTCAGACAG CTTTTGTCTC AAATCAAATT AGAGAATAAG AAGATTGACC ACGAGTCCAT GGACTTTGAC ATGGAGCCTA GCCTTTCGAC AAAGAAAGCA GCACCGAAGA GGAACACAAA AAAGACAAAA CGAAGCGAGG AAATCGTAGT TTCTGAAGAT GAAGATGACG ATTTTCAAGA AGAAATTGAA GAGAAACCTG CTGCACGAAA AAAAAGCGGA AGGAATACAA AAGCAAACAA GAAAGAAGAG ATTATCAGCG ACAACGACGA CGATATAATA GATCTGGATT TAGATGAAGG GCAACAACAA ACAACAAGAA GAGGTGGACT GGCGCCGCGT GCTAAGAGAG TATCTACCAG CAGTCGTGGC AGAGGAAGAA AGACATCTTC ATTGAAAGAC AGTGTGATGA ATTTGTAG
|
Protein sequence | MPPVDKIPEG EDTIRILLTT DNHVGYNETD PVRGDDGWKT FHEITRLAKQ LDVDMIVQGG DLFHINKPSK KSLFHVMKSL RLNCMGDRPC ELELLSDPTQ ALDSGFGTVN YEDPNLNISI PVFAISGNHD DATGEGLLSP LDILSVSGLV NHFGKIPDSE NITVSPLLFQ KGRTKLALYG MANVRDERLH RAFRDGHVKF QRPNIQTDQW FNLFCIHQNH AQHSITSSIP EMYLPNFLDF VLWGHEHECI AYPVHNPETG FDVLQAGSSV ATSLSEGEVA DKHTFLLSIR DQRYSIEPIK LNTVRPFVLK EIVLSQTDLI SGAASKSDVI ALLSQEVESS IVKANENFKQ NNAELFDEDD TEEDVAKKIP LPLIRIRVEY SGGYEIENTR RFSNRFVGKV ANPNDIIQFY KKRTSETGPK KTKFSDKDLL EEGESNKKST EIQLQDLIEK FISVADLSLL PEAGMNYAVK RYIDNEDKHV LQNYIENEIK KETEMLMKID IEDTSVYDKD NDYSKKIFRQ LLSQIKLENK KIDHESMDFD MEPSLSTKKA APKRNTKKTK RSEEIVVSED EDDDFQEEIE EKPAARKKSG RNTKANKKEE IISDNDDDII DSDLDEGQQQ TTRRGGSAPR AKRVSTSSRG RGRKTSSLKD SVMNL
|
| |