Gene PICST_33873 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33873 
SymbolMSP1 
ID4841135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp670230 
End bp671303 
Gene Length1074 bp 
Protein Length357 aa 
Translation table12 
GC content42% 
IMG OID640392450 
Product40 kDa putative membrane-spanning ATPase 
Protein accessionXP_001386533 
Protein GI150866810 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.767325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0397668 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGT TCAAACTCGA TATAAAGTTT CTTGGAGACT TGTTTATTCT TGCGGGAGCT 
GGGCTCTCAG TATATTTCAT ACTCAACAAT ATACTCAACG ACTATGTAGA CAACTCTATG
AAGAACAAGG AATCGGAGAA AAAGGGCAAG GGCATTCTCA AGAAATTACA GAGTTCTAAT
CCGCACCTCC GCAACATCTC CCTCAACCAG TACGAGAAGC TGTTGCTCAA TTCGTTGGTT
ACACCAGAAG AAATATCAGT AACTTTCAAT GATGTGGGCG GTTTGCAAGA CATAATTGAC
GAGTTGAGGG AAGCGGTGAT TCTTCCTTTA ACAGAACCAG AGCTTTTCGC TACCCATCTG
GATTTGATCC AGTCACCTAA GGGTGTCTTA TTCTACGGTC CCCCTGGATG TGGTAAGACA
ATGTTGGCCA AAGCTATTGC CAAAGAAAGT GGTGCGTTTT TCTTGCTGAT AAGGATGTCG
ACTATTATGG ACAAATGGTA CGGTGAGTCG AACAAGATCA CCGATGCCAT CTTCTCACTT
GCTAACAAAT TGCAACCCTG TATTATTTTC ATTGACGAAA TAGATTCGTT TTTGAGAGAC
AGGTCTTCCA GCGACCACGA AGTCAGTGCC ATGTTAAAGG CTGAATTCAT GACGTTGTGG
GACGGCTTGA AGTCTAACGG AAGAATCATG GTAATGGGCG CTACCAATCG TAAGAGTGAC
ATTGACGAAG CCTTCTTGAG AAGATTACCC AAAACTTTTG CTATTGGCAA ACCAAACGAA
TCGCAAAGAC GTTCCATATT GTCTAAAATA CTAAGTGGAG CCAAGCTCGA TGAAAAAGAT
TTTGACTTGG AATACATCGT TGCTAATACA AAAGGCTTTA GTGGTTCTGA CTTGAGAGAA
TTATGTCGTG AAGCTGCCAT TCTTCCCGTC AGGGAGTACA TCAGAGAGAA TTACAACTAT
AGGAGTGGGA AGTTGAGCAA GGATGAAAAT GAAGACATGC CTGTTAGACC GTTGAAGACT
TCTGATTTTG TAAAGACTGC CGAGTCCACC ATTCAACCTG CTACCATTGA TTAG
 
Protein sequence
MAKFKLDIKF LGDLFILAGA GLSVYFILNN ILNDYVDNSM KNKESEKKGK GILKKLQSSN 
PHLRNISLNQ YEKSLLNSLV TPEEISVTFN DVGGLQDIID ELREAVILPL TEPELFATHS
DLIQSPKGVL FYGPPGCGKT MLAKAIAKES GAFFLSIRMS TIMDKWYGES NKITDAIFSL
ANKLQPCIIF IDEIDSFLRD RSSSDHEVSA MLKAEFMTLW DGLKSNGRIM VMGATNRKSD
IDEAFLRRLP KTFAIGKPNE SQRRSILSKI LSGAKLDEKD FDLEYIVANT KGFSGSDLRE
LCREAAILPV REYIRENYNY RSGKLSKDEN EDMPVRPLKT SDFVKTAEST IQPATID