Gene PICST_30650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30650 
SymbolMAK32 
ID4837886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp594225 
End bp595286 
Gene Length1062 bp 
Protein Length353 aa 
Translation table12 
GC content42% 
IMG OID640389201 
ProductProtein necessary for structural stability of L-A double-stranded RNA-containing particles 
Protein accessionXP_001383747 
Protein GI150864776 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0524] Sugar kinases, ribokinase family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0154701 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.395285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAACA ACAGCGAAAG TGTTATAGTC ACTTCGATGG GCATGTTCAT CATTGACGAC 
AATATATATC CTCCGTCTTG GAACAGAAAG AACGACACAG ACATTATTGG CGGAGGTGGT
CCGTATGCCA TTGTAGGAGC TACCATGATA GCAGGACGGG AAAATGGACA CAGGGTCAGT
GGCATAATCG ATAAGGGTCT GGACTTTCCT AAAAAGGTTG AGGAGCAGTT GAATTCATGG
CAGTCTGGAG TCATCTTTCG AGAGAACCCA GAAAGACTCA CAACCAGAGG AGTAAACACC
TATGACGAGA ACCATATCCG TCATTTTTCC TACAAGAATC CCAAGAAACG TATCGAAGTC
GTTGATATAT TGCAACTGGA TAAATTGTCG ACTTCGCGAT GTTTTCATTT GATTTGCTCT
ATTGAACGTT GTGAATCGAT CATAGACGAT CTTAACTCTA AACTAGACCA TACTCCAGTT
TACATATATG AGCCTCTCCC AGACGACTGT ATCTCTACCA ACTTTGACAG GCTCAAACTC
TTGCTTCCTA AGATTGACAT TTTCACACCC AATCTCGATG AGGCCCAGGC ACTCTTGGGC
AGATCAGGTT CACTTCCTAG CACATCGGAA AAGCTTAAGG AAGTAGCGTC CCATTTTATG
CCCTATTTAA AGCTCAAGAA CTCAGGAATT ATCTTGAGAT GTGGTCCACT TGGTTGTTTC
ATAAATACCA TAGACGACTA CAATGTCTTG TTGCCTGCTT ATCACAGCGA TCAGACAAAG
GTAGTAGATG TCACTGGAGG TGGAAACTCT TTCTGTGGAG GATGCATAGC AGGATTTTAC
TTGTCAGGAG GTAACTGGCT AGTAGCAGGA GTAAGTGGAA ATTTGGTCAG TGGGTGTGTT
ATAGAGAAGT TGGGAATGCC TCTTAGACAG TCTGAAACCA ACAAATGGAA TGGTCTGACA
GTTTCAGAAA GATTAGACAC TTATTTGAAA AATAATCCTC AGATTATCGA GGTTCAAAAT
GAACAACTAT TACAGGGTTT GAACGTATTG AAACAAGTAT AG
 
Protein sequence
MTNNSESVIV TSMGMFIIDD NIYPPSWNRK NDTDIIGGGG PYAIVGATMI AGRENGHRVS 
GIIDKGSDFP KKVEEQLNSW QSGVIFRENP ERLTTRGVNT YDENHIRHFS YKNPKKRIEV
VDILQSDKLS TSRCFHLICS IERCESIIDD LNSKLDHTPV YIYEPLPDDC ISTNFDRLKL
LLPKIDIFTP NLDEAQALLG RSGSLPSTSE KLKEVASHFM PYLKLKNSGI ILRCGPLGCF
INTIDDYNVL LPAYHSDQTK VVDVTGGGNS FCGGCIAGFY LSGGNWLVAG VSGNLVSGCV
IEKLGMPLRQ SETNKWNGST VSERLDTYLK NNPQIIEVQN EQLLQGLNVL KQV