Gene PICST_47485 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47485 
SymbolMSI1 
ID4839664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1255271 
End bp1256560 
Gene Length1290 bp 
Protein Length429 aa 
Translation table12 
GC content41% 
IMG OID640390979 
Productchromatin assembly complex, subunit 3 
Protein accessionXP_001385247 
Protein GI150865861 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.884795 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.485977 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCAG AAGATAGCGA ACAGACTTAT ATTGACGATT TTACCCAGAG AAATTACAGA 
ATATGGAAGA AGAACACACC TTTTCTCTAC GACTACCTTC TGACAAATTC ACTCTTGTGG
CCGTCGCTAA CAGTTCAGTT CTTTCCTGAT CGGACTGATG GACAAATAGA AAGCGGAACT
TCTAAAACTT CATCTGAAGA TTCTGATAAT ATATATTTCC AAAGACTACT TCATGGTACA
TTTAGTTTGG GCCTGTCAGT GGATAGTATC CAGATTCTCC AGGTTCCTGT TTTTGCTGAC
TTGAATCGCA ATCTCCGTAT TGACCGACTT GATTTCAATC TGGAAAAGCA GGAATTCGAA
TTAGCTACTT CTGTCAACAA TAAATTCAAG GTGCTTCAAA AGATAAACCA TATGGGAGAT
GTTAACAAGG TGAGGTATAT GCCCCAGAAA CCAAACATCA TCGCCAGCGC CAACAATATG
GGCGACTTGG CGATATACGA AAGAACAAAA CACAAAAGCT TCAAGAACTC GCTCATAGAC
GATACCGACC TAAATAAGGT CCAGGTATAT CTCAAGAACA GTAACTCCGC AGACGTAGAA
GGTACCGATA TCTTTGCTAT CGATTGGAAC AAACAAAAGG AAGGTACTAT TGTATCAGCC
AGTATGAACG GCGAGATAAA TCTATATGAC ATTCGAAGCA ATTTTGTAAA GGATAAGTCT
GTTGTTAATG AATCCTGGTA CTACCACAAT GAGAGCAGTA CAGGTGTCAA CGATATCGAA
TGGCTCCCTC AACATGACTC CCTATTTAGT GCTGTAGATG ATGCCGGTTT CATTTCTTTG
TTTGACACGA GAGAAGAAAG CAAACTAGTT CACCGTTACA GACTGTCTGA AGTTGGAGTT
AACAGTATCA GTGTCAACCC TGGAATTTCT CATTGCATAG CTACTGGTGA TAGCAACGGT
CTGATCCACG TCTACGATAT AAGAGGTATT GGAAGCGAAA TGAACCCTAT CTACTCGATT
CAAGAACAAA CTGAATCTAT CACACAGCTT AAATGGCATC CACGGTACCA TAATGTGTTG
GGTTCGTCTT CCACAGATCA TCTGGTAAAA TTGTTTGATT TGGAAAACTC TAGTTCTCTT
TTGTTTGCAC ATGCTGGCCA TATGTTAGGA GTAAACGACT TTGACTGGTC TCACCATGAT
GACTGGATGG TAGCCAGTGT TTCTGATGAT AACTCCTTGC ATGTATGGAA ACCATCGCAC
ACGATCACAA GAAAGTATAA CAGTAGATAA
 
Protein sequence
MSPEDSEQTY IDDFTQRNYR IWKKNTPFLY DYLSTNSLLW PSLTVQFFPD RTDGQIESGT 
SKTSSEDSDN IYFQRLLHGT FSLGSSVDSI QILQVPVFAD LNRNLRIDRL DFNSEKQEFE
LATSVNNKFK VLQKINHMGD VNKVRYMPQK PNIIASANNM GDLAIYERTK HKSFKNSLID
DTDLNKVQVY LKNSNSADVE GTDIFAIDWN KQKEGTIVSA SMNGEINLYD IRSNFVKDKS
VVNESWYYHN ESSTGVNDIE WLPQHDSLFS AVDDAGFISL FDTREESKLV HRYRSSEVGV
NSISVNPGIS HCIATGDSNG SIHVYDIRGI GSEMNPIYSI QEQTESITQL KWHPRYHNVL
GSSSTDHSVK LFDLENSSSL LFAHAGHMLG VNDFDWSHHD DWMVASVSDD NSLHVWKPSH
TITRKYNSR