Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32772 |
Symbol | MUC1 |
ID | 4839777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 610125 |
End bp | 612215 |
Gene Length | 2091 bp |
Protein Length | 696 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640391092 |
Product | hypothetical protein |
Protein accession | XP_001385465 |
Protein GI | 150866009 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.635098 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCATCTAT TCAGCGGAAA CAGACACAAA CAGCTTTCGC CGGGTTCGCA GACTCCAGAG ACGGCAGTCT CGGGCTCTCC TTCAGTTACT AATCCAAACT CAACTATGGG CAGAGGAAGT CCTTCTTCAG TACACCAGCC GGCTTTCACG CCAGTGACTC GGGTTTATAG CAGCTCGAGA ATCACTGTGC CCAGCACGGC CAATCAGAGT TCGTCAGGGT CTAGCACTCC TCCTTCAATC AGAATCTCGG ATGAACTGGC ATCACCAGTA GCACGCATTG AAAGTCCACC AAGGAAACCA CAGAAGAGTG TTAAGAGCAA GAAAGTTTCA TCCTCTTCAT CTCCGTCTCC CTCTTCACCT TCAAAAACTC CAAAGCGCAA AAATCAAAAA GCACGAACTT CTTCTTCGTC ATCCCCTACA AAGAACAAGA AGGCGCCATC GCAGCCGGTT CAGACTGCGG AAACCCAAGA GGAATCGTCT TCTTCACCAG CCAGACGTAG ACGACCTCCT CCACTTCCGC CCATAGATCC GGTCGAAGCC CTGCACTCGG TGTTAAACGA TAACATAGCC GAACGTTGCC GCAATTCAAT CGATGAGCAA CTCGAAAGAA GCACAATCTT AGCAGCAGAT ATCAGTCGTA ACCGTAGAAA GTCGAGAATC ATCCGTCTGG AACAGGATGA AACTCCATCG TCTAGTCGGG TCACACCAGA AGTTGAAACA GTTCCGGTTC ATGAAGTGTT ATCCCCTATA CCAAATGCTA GCATTCCTGT AGAAAGTGTT GCAGGACCAG CCAGAATGGA ACAGCTTCTC CCATCTCCAG TCGACAGTTC TGTAACTTCT GAGTATAGAA CTCTAGACCC TGCTGTTTTT CCTTCTTTGT CGACAACTCC TGTAGATTTA CCTCCTACTG TTACTGAGTC ATCTACTCCG GAAGTACAGG TGACTGAGAT AACGGCTCCG ATGTACATCC ATGCTGAAAC AGCAACAACT ACCACGACTG AACTGTTGGA AAACTTGCCA TCTACGCAAG AGTCGAGCTC TAGAACCGAC GGCGATCGTA GAACAAGAAG AAATGACAGT GATCAAAATA GACGAGAAAG AAGAGTCAGG ACAGATAGAA GTCGTCAGAC TAGTACTACA GCTAATATGC CAGTAACTAC TAGTCAACGA ACTGGCTCTA ATCAAAGGGC TTCCAATCAA AGAACTCGTG AAAGAAGACT TAGGAGAAGA AGAAGAAGAA CCTCTAGTGA TGTGGAAAGC ATCCCTTCAT TACCTCCTCC TCAGGATGAG TACACAGATA TACATCCATA TTTACATGAA GACTCACCTC CATATCGTGA AAAGCTTCAC AAGTATGTCG ACTTGATCTC ATTCGATCCC AGATATACAA TTCCTGTGTT GCGTTTCAAC AAGGCATTGA TCAACAACGA CAAGAAGCTT CAAAAGTGTA TTAAGAAATT GAGCCAATTC AACCTCTTAC CTCTGGAGAT AAATCTTTGG GATGTTGACC AAGTAGATGA CCGAGAATTC TATGCGGTTC TCCCGTCGTT GTTGCCCGAA CTTGACGGTA ATGGTGGTCC CCATGTACTT GGCCAGGTTC TTGAAATGGA GAGAAGGGAA CAGGAAAATA CCGAGCTCCA GATTGCGATG GAATTATCGT TGCATGCGGA AAATCAGGTT GTACCTCAAG CCATCCCAGC AAGCGAAGAA GATAGTGGAG ACGAATCTAC TTACTTCTAC GATGCGTTTG AAACACAGGC TTCATTCTCC AGGTCCTCAG ATTTATTCCG TGGTTTCCGT TATCAGGAAG CTACTAATTA CGACGCTGCA TCAGTCAGCG ATACCAACAG TTTCAACGAT GCACTCAGCA GTAACCTCAG CAACAGCAAT ATCAACTTAG TTAATGACCC CAATCAAGTC AATCTCAGCA ACAGTCGAGA TCAAATGGTC TCGCCTGCTG GAAGTAATAA TCCCATTGAC GCAGGCCGGC TCAATATCCG CTTGAACAGC CCCTTGTATC ACGTATTCCG GTATAATGAA CCTGGTATTT CGTCTTCCTC TAGTCATCAT GGTCGTTTAG TGGACGTCTA G
|
Protein sequence | MHLFSGNRHK QLSPGSQTPE TAVSGSPSVT NPNSTMGRGS PSSVHQPAFT PVTRVYSSSR ITVPSTANQS SSGSSTPPSI RISDESASPV ARIESPPRKP QKSVKSKKVS SSSSPSPSSP SKTPKRKNQK ARTSSSSSPT KNKKAPSQPV QTAETQEESS SSPARRRRPP PLPPIDPVEA SHSVLNDNIA ERCRNSIDEQ LERSTILAAD ISRNRRKSRI IRSEQDETPS SSRVTPEVET VPVHEVLSPI PNASIPVESV AGPARMEQLL PSPVDSSVTS EYRTLDPAVF PSLSTTPVDL PPTVTESSTP EVQVTEITAP MYIHAETATT TTTESLENLP STQESSSRTD GDRRTRRNDS DQNRRERRVR TDRSRQTSTT ANMPVTTSQR TGSNQRASNQ RTRERRLRRR RRRTSSDVES IPSLPPPQDE YTDIHPYLHE DSPPYREKLH KYVDLISFDP RYTIPVLRFN KALINNDKKL QKCIKKLSQF NLLPSEINLW DVDQVDDREF YAVLPSLLPE LDGNGGPHVL GQVLEMERRE QENTELQIAM ELSLHAENQV VPQAIPASEE DSGDESTYFY DAFETQASFS RSSDLFRGFR YQEATNYDAA SVSDTNSFND ALSSNLSNSN INLVNDPNQV NLSNSRDQMV SPAGSNNPID AGRLNIRLNS PLYHVFRYNE PGISSSSSHH GRLVDV
|
| |