Gene PICST_30195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30195 
SymbolMUC1.3 
ID4837107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2202147 
End bp2204585 
Gene Length2439 bp 
Protein Length812 aa 
Translation table12 
GC content49% 
IMG OID640388422 
Producthypothetical protein, likely cell wall localized and GPI-anchored mucin like 
Protein accessionXP_001382648 
Protein GI150863986 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.322783 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGTATC TTCTTCCATT GGTCCAATTG ACTTGGATGT TGGCTGGAGC TTCTTGCTCA 
ACCTTCTATT CTAACTCCTC TTCTTCAGAA TTTCCAACTA TTTCTGTAAG TTCAGTATCT
TCATTATTGT CATCGTCTGC CGAGTTGAGT TCGTACGTAA ACTCGCTGTC CATCATCAGT
GAATCCTCCA CCGGAGAAAC TAGTATTTTG AGTTCAATTG CATCGTCTAC TGAAGAATCC
ACTATCGAGA CTTCTAACGG ACCATCTACT GAAGAGTCCA CTGTTGAAAC TTCAGTTGGA
CCAACCTCAG AAACTTCCCA GACTGGCGCT GAGTCGTCTC CAGTTGTCTC TACTTCTGAA
TCTACTATTA CCAGTTCTCC AAGTGGTACT CCTATTCCAC CATTAACTGC TGACACTCCT
TTCATTTTGA CTGCTTTGAT CGATGGTATT GCTGCTATCG TTGAAGCCCT TTTGGTTGAC
ATCGAAAATG GCACTTTGCG TAAGAGACTT GAGACCTACA TTTTGGGTTT CAACCTTGGC
GGTGCCGATG TTGTCTTCAA CCTTGACCCA GATACCAGTT TCCTTTCTTC AAACAATGGT
TTGTACGTTC AAGCTCCTGA TCCACAATTG GGTGTTTACC TTGGTGCATC CCCTGTTGCC
GGATGGGGCT ACACTCCAGA CGGACAATTG AGTTTCAATG GAGTTTCTGC TTTCTTCAGT
TGTCCATATG GTGACAACGG TGGTTCCATC CTTACTACAG TGGACACCGG AAACTGTAGT
GGTTTGCAAT TGGGTATCGT TGTTCAAGAG AATCCTACCT CTTCTTCTTC TGAAAGCGAA
ACTTCTGGAA CATCAAGTGA GGCATCTTCG GGTGCTGAGT CTTCTGGAGC TTCTACCGCT
GAGACTTCTG GTGCTTCTAG TGCTGAAACC TCTGGTGCTT CTAGTGCTGA AACCTCTGGT
GCTTCTAGTG CTGTTACTTC TGGTGCTTCT AGTGCTGTAA CCTCTGGTGC CTCCAGTGCT
GTAACCTCCG GTGCCTCCAG TGCTGTGACT TCTGGTGCCT CCAGTGCTGT AACCTCTGGT
GCTTCTAGTG CTGTAACCTC TGGTGCTTCT AGTGCAGTGA CTTCTGGTGC TTCTAGTGCT
GTGACTTCTG GTGCTTCTAG TGCTGTAACC TCTGGTGCTA GTTCTCTCCC ATCTGGTATC
ACATCTGCTG GTTCATCTGG CGTTACCTCT GGTGCTTCAT CTGCACCAAC CTCTGGTACT
CCTTCTTCTG CTGAGAGTTC AGGTGCCTCC AAGACTACTT CTTATACCTT CCAATTGACT
GCCATTGCTG CACCTCCAAA CGATTTCAGC GAATTGCTTC TCGACAATGG TGCCAGATTG
ATCTTGGATC CTACCAATGG TGCCGAATTC GAATTATCCC TTCCAGGTGG TTTCCTTAAA
GTAGGTGACT TGTTCGTCCA CGCTGACGCT TCTGGTTTCT ACCTTAGTTC TGAAACCTTC
GGTGGTTTCT CCTTCATTGG CGACCCTCTT TTGGAATTGA ACGGTCAATC TGACTTCTAC
ATTTGCCCAA CTTTGGATGT TCCATTGCAA CTTACCAAGG TTGACCCAAG CTGTATGCTC
GGCTCCTTGT CGTTAGTCTT GGAACCAGCT ACTTCCTCCA CTGGTTCCAC TGGTACTTCC
ACTGGTTCCA CTGGTGCTAC TACTGGTGCT ACTACTGGTG CTACTACTGG TGCTACTACT
GGCGCAGCTG GTACTTCCAC TGGTGGTGCT GGTGCCTCTA CTGGTGCATC ACTGTTGACT
ACCGCCACTA TCACCACCAA CACTGTTGTT ACTATCACTG AATGCCCAAG CACTGTTACT
AACTGTCCTC TCAACTACCG TAAGACTGTC ACTATTCCAA AGACTATTGT AACTACTTAC
TGTCCACTTA CTGAAGGCAC CTACGTTGTT GAATCTACCT CGTTGGAAGT TATCACTATT
ACCAAATGTC CTGCTACAGT TACTAACTGT CCTCTTAATT CTATTCAAAC ATATACTGTT
GCTCACACCA TTAAGTCTAC CGGTGTTACT CCAATTGCTC CTGGTGCCAC TGCTGCTCCA
GGCGCTCCAG GCGCTGCTGC TCCAGGTGCT GCTGCTCCAG GTGCTGCTGC TCCAGGTGCT
GCTGCTCCAG GTGCTCCAGG TGCTCCAGGT GCTCCAGGTG CTCCAGGTGC TGCTGCTACC
CCAGGTGCTC CAGGTGCTGC TGCTACCCCA GGTGCTCCAG GTGCTCCAGG TGCTGCTGTT
ACCCCAGGTG CTCCAGGTGC TCCAGGTGCT ACTGTTGCTG AACAAGCTAC TGGTGCTGCT
ACCGCTTCCT TCTCTGTGTT CACTTACGAA GCTGCTGCTG GTAAGGCCAC TGGAACTATC
GGTACCTTCA TGTTAATGTT GGTTGCCCTC CTCATGTAA
 
Protein sequence
MKYLLPLVQL TWMLAGASCS TFYSNSSSSE FPTISVSSVS SLLSSSAELS SYVNSSSIIS 
ESSTGETSIL SSIASSTEES TIETSNGPST EESTVETSVG PTSETSQTGA ESSPVVSTSE
STITSSPSGT PIPPLTADTP FILTALIDGI AAIVEALLVD IENGTLRKRL ETYILGFNLG
GADVVFNLDP DTSFLSSNNG LYVQAPDPQL GVYLGASPVA GWGYTPDGQL SFNGVSAFFS
CPYGDNGGSI LTTVDTGNCS GLQLGIVVQE NPTSSSSESE TSGTSSEASS GAESSGASTA
ETSGASSAET SGASSAETSG ASSAVTSGAS SAVTSGASSA VTSGASSAVT SGASSAVTSG
ASSAVTSGAS SAVTSGASSA VTSGASSAVT SGASSLPSGI TSAGSSGVTS GASSAPTSGT
PSSAESSGAS KTTSYTFQLT AIAAPPNDFS ELLLDNGARL ILDPTNGAEF ELSLPGGFLK
VGDLFVHADA SGFYLSSETF GGFSFIGDPL LELNGQSDFY ICPTLDVPLQ LTKVDPSCML
GSLSLVLEPA TSSTGSTGTS TGSTGATTGA TTGATTGATT GAAGTSTGGA GASTGASSLT
TATITTNTVV TITECPSTVT NCPLNYRKTV TIPKTIVTTY CPLTEGTYVV ESTSLEVITI
TKCPATVTNC PLNSIQTYTV AHTIKSTGVT PIAPGATAAP GAPGAAAPGA AAPGAAAPGA
AAPGAPGAPG APGAPGAAAT PGAPGAAATP GAPGAPGAAV TPGAPGAPGA TVAEQATGAA
TASFSVFTYE AAAGKATGTI GTFMLMLVAL LM