Gene PICST_32772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32772 
SymbolMUC1 
ID4839777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp610125 
End bp612215 
Gene Length2091 bp 
Protein Length696 aa 
Translation table12 
GC content46% 
IMG OID640391092 
Producthypothetical protein 
Protein accessionXP_001385465 
Protein GI150866009 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.635098 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATCTAT TCAGCGGAAA CAGACACAAA CAGCTTTCGC CGGGTTCGCA GACTCCAGAG 
ACGGCAGTCT CGGGCTCTCC TTCAGTTACT AATCCAAACT CAACTATGGG CAGAGGAAGT
CCTTCTTCAG TACACCAGCC GGCTTTCACG CCAGTGACTC GGGTTTATAG CAGCTCGAGA
ATCACTGTGC CCAGCACGGC CAATCAGAGT TCGTCAGGGT CTAGCACTCC TCCTTCAATC
AGAATCTCGG ATGAACTGGC ATCACCAGTA GCACGCATTG AAAGTCCACC AAGGAAACCA
CAGAAGAGTG TTAAGAGCAA GAAAGTTTCA TCCTCTTCAT CTCCGTCTCC CTCTTCACCT
TCAAAAACTC CAAAGCGCAA AAATCAAAAA GCACGAACTT CTTCTTCGTC ATCCCCTACA
AAGAACAAGA AGGCGCCATC GCAGCCGGTT CAGACTGCGG AAACCCAAGA GGAATCGTCT
TCTTCACCAG CCAGACGTAG ACGACCTCCT CCACTTCCGC CCATAGATCC GGTCGAAGCC
CTGCACTCGG TGTTAAACGA TAACATAGCC GAACGTTGCC GCAATTCAAT CGATGAGCAA
CTCGAAAGAA GCACAATCTT AGCAGCAGAT ATCAGTCGTA ACCGTAGAAA GTCGAGAATC
ATCCGTCTGG AACAGGATGA AACTCCATCG TCTAGTCGGG TCACACCAGA AGTTGAAACA
GTTCCGGTTC ATGAAGTGTT ATCCCCTATA CCAAATGCTA GCATTCCTGT AGAAAGTGTT
GCAGGACCAG CCAGAATGGA ACAGCTTCTC CCATCTCCAG TCGACAGTTC TGTAACTTCT
GAGTATAGAA CTCTAGACCC TGCTGTTTTT CCTTCTTTGT CGACAACTCC TGTAGATTTA
CCTCCTACTG TTACTGAGTC ATCTACTCCG GAAGTACAGG TGACTGAGAT AACGGCTCCG
ATGTACATCC ATGCTGAAAC AGCAACAACT ACCACGACTG AACTGTTGGA AAACTTGCCA
TCTACGCAAG AGTCGAGCTC TAGAACCGAC GGCGATCGTA GAACAAGAAG AAATGACAGT
GATCAAAATA GACGAGAAAG AAGAGTCAGG ACAGATAGAA GTCGTCAGAC TAGTACTACA
GCTAATATGC CAGTAACTAC TAGTCAACGA ACTGGCTCTA ATCAAAGGGC TTCCAATCAA
AGAACTCGTG AAAGAAGACT TAGGAGAAGA AGAAGAAGAA CCTCTAGTGA TGTGGAAAGC
ATCCCTTCAT TACCTCCTCC TCAGGATGAG TACACAGATA TACATCCATA TTTACATGAA
GACTCACCTC CATATCGTGA AAAGCTTCAC AAGTATGTCG ACTTGATCTC ATTCGATCCC
AGATATACAA TTCCTGTGTT GCGTTTCAAC AAGGCATTGA TCAACAACGA CAAGAAGCTT
CAAAAGTGTA TTAAGAAATT GAGCCAATTC AACCTCTTAC CTCTGGAGAT AAATCTTTGG
GATGTTGACC AAGTAGATGA CCGAGAATTC TATGCGGTTC TCCCGTCGTT GTTGCCCGAA
CTTGACGGTA ATGGTGGTCC CCATGTACTT GGCCAGGTTC TTGAAATGGA GAGAAGGGAA
CAGGAAAATA CCGAGCTCCA GATTGCGATG GAATTATCGT TGCATGCGGA AAATCAGGTT
GTACCTCAAG CCATCCCAGC AAGCGAAGAA GATAGTGGAG ACGAATCTAC TTACTTCTAC
GATGCGTTTG AAACACAGGC TTCATTCTCC AGGTCCTCAG ATTTATTCCG TGGTTTCCGT
TATCAGGAAG CTACTAATTA CGACGCTGCA TCAGTCAGCG ATACCAACAG TTTCAACGAT
GCACTCAGCA GTAACCTCAG CAACAGCAAT ATCAACTTAG TTAATGACCC CAATCAAGTC
AATCTCAGCA ACAGTCGAGA TCAAATGGTC TCGCCTGCTG GAAGTAATAA TCCCATTGAC
GCAGGCCGGC TCAATATCCG CTTGAACAGC CCCTTGTATC ACGTATTCCG GTATAATGAA
CCTGGTATTT CGTCTTCCTC TAGTCATCAT GGTCGTTTAG TGGACGTCTA G
 
Protein sequence
MHLFSGNRHK QLSPGSQTPE TAVSGSPSVT NPNSTMGRGS PSSVHQPAFT PVTRVYSSSR 
ITVPSTANQS SSGSSTPPSI RISDESASPV ARIESPPRKP QKSVKSKKVS SSSSPSPSSP
SKTPKRKNQK ARTSSSSSPT KNKKAPSQPV QTAETQEESS SSPARRRRPP PLPPIDPVEA
SHSVLNDNIA ERCRNSIDEQ LERSTILAAD ISRNRRKSRI IRSEQDETPS SSRVTPEVET
VPVHEVLSPI PNASIPVESV AGPARMEQLL PSPVDSSVTS EYRTLDPAVF PSLSTTPVDL
PPTVTESSTP EVQVTEITAP MYIHAETATT TTTESLENLP STQESSSRTD GDRRTRRNDS
DQNRRERRVR TDRSRQTSTT ANMPVTTSQR TGSNQRASNQ RTRERRLRRR RRRTSSDVES
IPSLPPPQDE YTDIHPYLHE DSPPYREKLH KYVDLISFDP RYTIPVLRFN KALINNDKKL
QKCIKKLSQF NLLPSEINLW DVDQVDDREF YAVLPSLLPE LDGNGGPHVL GQVLEMERRE
QENTELQIAM ELSLHAENQV VPQAIPASEE DSGDESTYFY DAFETQASFS RSSDLFRGFR
YQEATNYDAA SVSDTNSFND ALSSNLSNSN INLVNDPNQV NLSNSRDQMV SPAGSNNPID
AGRLNIRLNS PLYHVFRYNE PGISSSSSHH GRLVDV