Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_68450 |
Symbol | MUC1.10 |
ID | 4840882 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 259464 |
End bp | 260843 |
Gene Length | 1380 bp |
Protein Length | 222 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640392197 |
Product | repeated sequence with similarity to MUC1 |
Protein accession | XP_001386448 |
Protein GI | 150866751 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.0000016025 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 2 |
Fosmid unclonability p-value | 0.00000996507 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | AAACTGGCTC AAGCCAAACC AGTCAGTAGT TCTGCAACCT TGAAAGAGAA CTTCCAATTA ACAGACGATG ACTATCAGCA GTTTTGGAAA GCTATAGCTA AAGTACAACG TAGGTATCCA AAAGGGGTGG AGGAAATTCA CAGGGTTCAC CTCGGTCACA GGGACCGACA TGGGTCCTCC GAGGCTCTGC CAGAATGCCT GATGTGTTTC AATGAAATAC CAAGACCTCA GTTTTTTCAC CATATTTACC AAGATTGTTC AATCAGCCGT ACATTATGGG ACATACTTCG TCCTACAGAA ATCACATTAA ATTTAAAGAA TCTCATTTGC AATTCTTCAC TTTCACAGGC TGCATATCTC TCGTGGAATC AATACCTTTT AGCTGTTCAT CTTTTTAGAT GCCGACGGCG AAATCCAATG GACAGGGACA TTTTCACCTG TTCTATGCTT TCATCTTGGG TATCGATTTT GCAGTCTAGA CGTCTTATTT AAACATTTTA CAAGTCCGGT AGCTTATACC GTTGTCAAGT TTTTTACAGT TTTCTAAGGA TATCACCTAG TTTTTAGGTC ATATCACCCG GAGGTTTTTA TCCTAGGATT TCCTTCACCA TTCGGTATGG ATTAGAGGAA GATTTATATA CTTTGTAGTT GATAGGTTGG ACTTTAGTCG TAATACATGC CATTTCGTAA AAAAAAAAAA GGGCCTGTCC AATGTCAGAA CTACGACCAA TCAATCAGCC AGAGGGACCT CTAGGGGTTT TCTCTGGTGA AAGCTCTTCA AATCAAGCCT TAAACGAGCC AAAACCACCG GATATTGACC GAAATCCACA CGACCATGTC GCACCCCATG ATCTGATGGA CCTCGATGGC GAGTCCACAG ATGCCGATGA AAACGGCAAT TTAGAGCCAT CCTTCGTGAC TGCAGTGTCA TCTACCTCTG AGGAACCGGC CCAACTACGG GATAATCCCA CGGACGAGCT GGTTATTCAG AACCAGACCC CATTAAGTCC TACTATGGAA AGTTTTGAAA CCTCAGTTTC GGAAATTTTT CAAAGTCACG TGCTTGACGA GGTACACGAC CAGGAAATGG CTGCAAACGA TGATAATGAA ATGGATCACA TTTCTTTAAA TTCTACAACC AGCCTCTCGA ACTTGCCCGA ACAAGAAGAT TCTCTCCTTG TACACCAACT TAAAGAAAAT ACAAAAACCC AAAAAAATTC TGAATCCTTA CAATACTTAA ATAAAAACCA AAAAAACTCC GAAAATACAA AACTCCCAGA AAAAAATACA CAAGACCTTG AAGCAGAGGA ATACCCACTC TTAGGAACCT CAAAGAGTTC TTCTAAGAAA AGGATTCATC CCTCAAACCC CAACTATTAA
|
Protein sequence | MSELRPINQP EGPLGVFSGE SSSNQALNEP KPPDIDRNPH DHVAPHDSMD LDGESTDADE NGNLEPSFVT AVSSTSEEPA QLRDNPTDES VIQNQTPLSP TMESFETSVS EIFQSHVLDE VHDQEMAAND DNEMDHISLN STTSLSNLPE QEDSLLVHQL KENTKTQKNS ESLQYLNKNQ KNSENTKLPE KNTQDLEAEE YPLLGTSKSS SKKRIHPSNP NY
|
| |