Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_68406 |
Symbol | |
ID | 4840512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 1078049 |
End bp | 1081148 |
Gene Length | 3100 bp |
Protein Length | 999 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391827 |
Product | predicted protein |
Protein accession | XP_001386399 |
Protein GI | 150866717 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0249] Mismatch repair ATPase (MutS family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.094593 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAAGAACCAA TCTTCTTCTC TATATCAGCA ACGATGTCCA GTACTCGTCC CGATCTCAGA TTCACCGATA CGGTAGACGA ACGGCTGTTC TACCGTAAAT ATGCTGGCCT TTCTACCAAA GATGCCTCTA CTATCAGATT CATTGACCAC AACAACAAGG ACTATTTCAC TGCCCTCGAC GAAGACGCTG ATCTTGTAGC AGAGAACATT TACAAAACAC AATCTGTACT CAAATATAAC AACAGCAACA AAAACAGATA CGTCACAATC CTGCCTCAGG TATTCTTGAA CAATGTGTTG AAGTTCTGTA TCATCGACAG ACATATGAAG GTGGAGATAT ATCACAACAA AACGTTCCAA TTGCTCAGCA CAGCAACGCC GGGTAACTTG GAAGCTCTTG CCAACGAGTA TGGTGTCAAT TTGGAAGGCA TGTTTCAGGA CTGTTCGACT CCCATGGTGG CTAGTATTAA GTTTCAACAA ACAGGAAGCG CTAGAAAAGT TGGGGTTTGC GTCATAGACA CACTGAACTC GACTATTCAA GTATCTGAAT TCGAAGACAA CGACCTCTTC TCTAACCTTG AAAGTTTATT GTTACAATTG GGTGTAAAAG AAGTCGTGTT GCCTTCGAAC TACAGTGCCA AAGATGAGAA TACCGAATCA ATAAAGCTCT TTCAAGTCTT GGATAAAATC GGTTACCTTG TCGTCAGCTC CGTTAAATCG TCATTCTTTA CAACTAAAGA TATTGAGCAG GATTTGCGCA AGTTGGTGCT GTCTGAAAAC CAAAAAGATG ATGATGATGT CAACGTCGAT TTGTTGTTGG CTTCCAAAGG AATCAATACG GCCGACTTCG CCCACTCGCT TGCTTGTTGC AACGCATTAA TTGCATACCT CCAGTTGTTG CTGGACGATG TACAGAATTC TTTCACTATA GAGCAGTACA ATTTGAGTTC ATACATGAAA TTGGACTCTT CTACAATGAA GGCTTTGAAT ATTTTCCCAT CTTCGAATTC TGGAGTTTCC AATGCTCTTG TTAAATCTTC CAATATCAGC TCGATCTTTG AGTTATTGAA CAAGTGTAGA ACTGCGGCAG GTTCTAGATT ACTTTCTCAA TGGTTGAAAC AGCCCCTCAC TAGCTTGTCC ATGATCGAGG AAAGGCTAGA TTTGGTGAAC TATCTCGTGG ACGGTACCAA CTTCAGAGTA TATGCCAACC AAGAGTTCTT ATCGCAGGTA CCTGATATAA GAAGACTTTT GAAAAAGATT AGCAATGGTT TGTCAAAGTC TACTGGCAAC GAAAATAAGA AGTTAGAGGA TATTGTCGTA TTGTATCAGT TAGTATTGGC TTTGCCTGCA TTCATTGACA TGAGTAAAAT GGTGATTGCT GATATCGAGG AAAAGGATTC ATTGCCGGTA GCAAATTTGA TAAAGAAGCA TTGGCTCGAG CCAGTCGAGA AGAGTCTTGA ATCCTTATCA AAATTCCAAG AGATGATCGA GACAACCATT GACTTATCGC CATTAGAATC AAGTTCTGCT TATGACCAAT TGCATTCTGA TTTCAATGTG AGACCAGAAT TTGACGAGTC CCTTATTGAA ATTAATGATA AATTACAAGC CAGTCTTGCT GAAATAAAAC AATTACATAT TGAAGTTGCT GACGACTTGA ATATGGAATT GGACAAGAAA TTGAAGTTAG AGAAACATAT ACAACACGGT TGGTGTTTTA GAGTTACCAG AAATGATTCT ACCGTCTTGA GAAATACCGG TAACAAATAT TCTCAGCTCC AGACTGTTAA GGCTGGTGTC TTCTTTACCA CCAAGAGATT AACTTTGCTA TCCCAGGAAT ATGCAGAGGC TCTTCAAGAA TACAATACCA AACAGCGCGA GTTAATTAAG GAGATATTGT CCATTTCTTT GTCATATCAA TCGGTTTTTA TGAACTTGTC ATTGACGCTT GCACATTTGG ATGTATTAGT CAGCTTTGCT AATGTGGCAA TAGTGGCACC AACCGTATTT GCAAGACCGA AGTTGCATCC ATTGAGCAAT GATATTGATT CGGACCAATT CAAGAATAGA AAAATCAAGC TAAGAGAAGC CAGACATCCT GTATTGGAAG TACAAGATGA CATTAATTTC ATTGCCAATG ATGTCTTTTT ATCAAACGAT GCATGTGACA AAGGGAAGCC TTTTGTTATC ATAACTGGTC CAAATATGGG TGGTAAGTCA ACATACATAA GACAGATTGG TGTTATTGCC TTGATGGCGC AAATTGGATC ATTCATCCCT GCTAATGAAG ACGATTTTCC AGAATTGCCC ATCTTTGATG CTATCTTATC AAGAGTGGGA GCTGGAGACT CCCAGCTTAA GGGTTTATCT ACTTTCATGA TCGAGATGTT GGAGACTTCG TCCATTTTGG CCACAGCAAC ACAAAACTCG TTGATTATCA TCGATGAGTT GGGAAGAGGT ACTTCTACTT ACGATGGTTT TGGATTAGCT TGGTCAATTC TGGAACACCT CATTAAAGAA AAAAGCTGTT TCACGTTATT TGCAACCCAT TTTCACGAAT TGACTCAATT ATCATCCAAA TATGAGGACA AAGTTGACAA CTTACATGTT GTTGCCCATG TAGAAAACAA AGATGAAAAT GACGATGACA TCACTTTGAT GTACCGTGTT GAACCAGGAG TATCCGACAA ATCGTTTGGT ATTCATGTTG CTGAATTGGT TAAGTTTCCA TCGAAGATTA TCAACATGGC GAAGAGAAAA GCTTCAGAGT TGCAAGATAT GAATGTTACA GAGGAAGACA AGTTTATCCA GAACAAAAAA ACGAAGTGTT CTGCCGAAGA GATTGACCGT GGAGTTGACA CCTTGAAGAC GATCTTGAAG AAGTGGAAAG ATGAATGCTA TGATCCTGAG ACATCCAAGA GTCGCTTTGA AAGTGGCGAG GCAGTCAACA AGCTCAAGCA ATTGGTCGAA GGTGAGTTTT CAGGGGTGGT TGCGAACGAT AAGTTCATAA ATGAAGTTCT TACGGCATTG TGAAGTGGTA TTGTGATATC TAGTGTAGAA ATTGAACTAT TATTATCATA TAAATAAACG AATGAAATGT
|
Protein sequence | MSSTRPDLRF TDTVDERSFY RKYAGLSTKD ASTIRFIDHN NKDYFTALDE DADLVAENIY KTQSVLKYNN SNKNRYVTIS PQVFLNNVLK FCIIDRHMKV EIYHNKTFQL LSTATPGNLE ALANEYGVNL EGMFQDCSTP MVASIKFQQT GSARKVGVCV IDTSNSTIQV SEFEDNDLFS NLESLLLQLG VKEVVLPSNY SAKDENTESI KLFQVLDKIG YLVVSSVKSS FFTTKDIEQD LRKLVSSENQ KDDDDVNVDL LLASKGINTA DFAHSLACCN ALIAYLQLLS DDVQNSFTIE QYNLSSYMKL DSSTMKALNI FPSSNSGVSN ALVKSSNISS IFELLNKCRT AAGSRLLSQW LKQPLTSLSM IEERLDLVNY LVDGTNFRVY ANQEFLSQVP DIRRLLKKIS NGLSKSTGNE NKKLEDIVVL YQLVLALPAF IDMSKMVIAD IEEKDSLPVA NLIKKHWLEP VEKSLESLSK FQEMIETTID LSPLESSSAY DQLHSDFNVR PEFDESLIEI NDKLQASLAE IKQLHIEVAD DLNMELDKKL KLEKHIQHGW CFRVTRNDST VLRNTGNKYS QLQTVKAGVF FTTKRLTLLS QEYAEALQEY NTKQRELIKE ILSISLSYQS VFMNLSLTLA HLDVLVSFAN VAIVAPTVFA RPKLHPLSND IDSDQFKNRK IKLREARHPV LEVQDDINFI ANDVFLSNDA CDKGKPFVII TGPNMGGKST YIRQIGVIAL MAQIGSFIPA NEDDFPELPI FDAILSRVGA GDSQLKGLST FMIEMLETSS ILATATQNSL IIIDELGRGT STYDGFGLAW SISEHLIKEK SCFTLFATHF HELTQLSSKY EDKVDNLHVV AHVENKDEND DDITLMYRVE PGVSDKSFGI HVAELVKFPS KIINMAKRKA SELQDMNVTE EDKFIQNKKT KCSAEEIDRG VDTLKTILKK WKDECYDPET SKSRFESGEA VNKLKQLVEG EFSGVVANDK FINEVLTAL
|
| |