Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_33873 |
Symbol | MSP1 |
ID | 4841135 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 670230 |
End bp | 671303 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640392450 |
Product | 40 kDa putative membrane-spanning ATPase |
Protein accession | XP_001386533 |
Protein GI | 150866810 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0464] ATPases of the AAA+ class |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.767325 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0397668 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAAGT TCAAACTCGA TATAAAGTTT CTTGGAGACT TGTTTATTCT TGCGGGAGCT GGGCTCTCAG TATATTTCAT ACTCAACAAT ATACTCAACG ACTATGTAGA CAACTCTATG AAGAACAAGG AATCGGAGAA AAAGGGCAAG GGCATTCTCA AGAAATTACA GAGTTCTAAT CCGCACCTCC GCAACATCTC CCTCAACCAG TACGAGAAGC TGTTGCTCAA TTCGTTGGTT ACACCAGAAG AAATATCAGT AACTTTCAAT GATGTGGGCG GTTTGCAAGA CATAATTGAC GAGTTGAGGG AAGCGGTGAT TCTTCCTTTA ACAGAACCAG AGCTTTTCGC TACCCATCTG GATTTGATCC AGTCACCTAA GGGTGTCTTA TTCTACGGTC CCCCTGGATG TGGTAAGACA ATGTTGGCCA AAGCTATTGC CAAAGAAAGT GGTGCGTTTT TCTTGCTGAT AAGGATGTCG ACTATTATGG ACAAATGGTA CGGTGAGTCG AACAAGATCA CCGATGCCAT CTTCTCACTT GCTAACAAAT TGCAACCCTG TATTATTTTC ATTGACGAAA TAGATTCGTT TTTGAGAGAC AGGTCTTCCA GCGACCACGA AGTCAGTGCC ATGTTAAAGG CTGAATTCAT GACGTTGTGG GACGGCTTGA AGTCTAACGG AAGAATCATG GTAATGGGCG CTACCAATCG TAAGAGTGAC ATTGACGAAG CCTTCTTGAG AAGATTACCC AAAACTTTTG CTATTGGCAA ACCAAACGAA TCGCAAAGAC GTTCCATATT GTCTAAAATA CTAAGTGGAG CCAAGCTCGA TGAAAAAGAT TTTGACTTGG AATACATCGT TGCTAATACA AAAGGCTTTA GTGGTTCTGA CTTGAGAGAA TTATGTCGTG AAGCTGCCAT TCTTCCCGTC AGGGAGTACA TCAGAGAGAA TTACAACTAT AGGAGTGGGA AGTTGAGCAA GGATGAAAAT GAAGACATGC CTGTTAGACC GTTGAAGACT TCTGATTTTG TAAAGACTGC CGAGTCCACC ATTCAACCTG CTACCATTGA TTAG
|
Protein sequence | MAKFKLDIKF LGDLFILAGA GLSVYFILNN ILNDYVDNSM KNKESEKKGK GILKKLQSSN PHLRNISLNQ YEKSLLNSLV TPEEISVTFN DVGGLQDIID ELREAVILPL TEPELFATHS DLIQSPKGVL FYGPPGCGKT MLAKAIAKES GAFFLSIRMS TIMDKWYGES NKITDAIFSL ANKLQPCIIF IDEIDSFLRD RSSSDHEVSA MLKAEFMTLW DGLKSNGRIM VMGATNRKSD IDEAFLRRLP KTFAIGKPNE SQRRSILSKI LSGAKLDEKD FDLEYIVANT KGFSGSDLRE LCREAAILPV REYIRENYNY RSGKLSKDEN EDMPVRPLKT SDFVKTAEST IQPATID
|
| |