Gene PICST_68450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_68450 
SymbolMUC1.10 
ID4840882 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp259464 
End bp260843 
Gene Length1380 bp 
Protein Length222 aa 
Translation table12 
GC content41% 
IMG OID640392197 
Productrepeated sequence with similarity to MUC1 
Protein accessionXP_001386448 
Protein GI150866751 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000016025 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000996507 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
AAACTGGCTC AAGCCAAACC AGTCAGTAGT TCTGCAACCT TGAAAGAGAA CTTCCAATTA 
ACAGACGATG ACTATCAGCA GTTTTGGAAA GCTATAGCTA AAGTACAACG TAGGTATCCA
AAAGGGGTGG AGGAAATTCA CAGGGTTCAC CTCGGTCACA GGGACCGACA TGGGTCCTCC
GAGGCTCTGC CAGAATGCCT GATGTGTTTC AATGAAATAC CAAGACCTCA GTTTTTTCAC
CATATTTACC AAGATTGTTC AATCAGCCGT ACATTATGGG ACATACTTCG TCCTACAGAA
ATCACATTAA ATTTAAAGAA TCTCATTTGC AATTCTTCAC TTTCACAGGC TGCATATCTC
TCGTGGAATC AATACCTTTT AGCTGTTCAT CTTTTTAGAT GCCGACGGCG AAATCCAATG
GACAGGGACA TTTTCACCTG TTCTATGCTT TCATCTTGGG TATCGATTTT GCAGTCTAGA
CGTCTTATTT AAACATTTTA CAAGTCCGGT AGCTTATACC GTTGTCAAGT TTTTTACAGT
TTTCTAAGGA TATCACCTAG TTTTTAGGTC ATATCACCCG GAGGTTTTTA TCCTAGGATT
TCCTTCACCA TTCGGTATGG ATTAGAGGAA GATTTATATA CTTTGTAGTT GATAGGTTGG
ACTTTAGTCG TAATACATGC CATTTCGTAA AAAAAAAAAA GGGCCTGTCC AATGTCAGAA
CTACGACCAA TCAATCAGCC AGAGGGACCT CTAGGGGTTT TCTCTGGTGA AAGCTCTTCA
AATCAAGCCT TAAACGAGCC AAAACCACCG GATATTGACC GAAATCCACA CGACCATGTC
GCACCCCATG ATCTGATGGA CCTCGATGGC GAGTCCACAG ATGCCGATGA AAACGGCAAT
TTAGAGCCAT CCTTCGTGAC TGCAGTGTCA TCTACCTCTG AGGAACCGGC CCAACTACGG
GATAATCCCA CGGACGAGCT GGTTATTCAG AACCAGACCC CATTAAGTCC TACTATGGAA
AGTTTTGAAA CCTCAGTTTC GGAAATTTTT CAAAGTCACG TGCTTGACGA GGTACACGAC
CAGGAAATGG CTGCAAACGA TGATAATGAA ATGGATCACA TTTCTTTAAA TTCTACAACC
AGCCTCTCGA ACTTGCCCGA ACAAGAAGAT TCTCTCCTTG TACACCAACT TAAAGAAAAT
ACAAAAACCC AAAAAAATTC TGAATCCTTA CAATACTTAA ATAAAAACCA AAAAAACTCC
GAAAATACAA AACTCCCAGA AAAAAATACA CAAGACCTTG AAGCAGAGGA ATACCCACTC
TTAGGAACCT CAAAGAGTTC TTCTAAGAAA AGGATTCATC CCTCAAACCC CAACTATTAA
 
Protein sequence
MSELRPINQP EGPLGVFSGE SSSNQALNEP KPPDIDRNPH DHVAPHDSMD LDGESTDADE 
NGNLEPSFVT AVSSTSEEPA QLRDNPTDES VIQNQTPLSP TMESFETSVS EIFQSHVLDE
VHDQEMAAND DNEMDHISLN STTSLSNLPE QEDSLLVHQL KENTKTQKNS ESLQYLNKNQ
KNSENTKLPE KNTQDLEAEE YPLLGTSKSS SKKRIHPSNP NY