Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_86369 |
Symbol | SAP11 |
ID | 4851556 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 2126266 |
End bp | 2129065 |
Gene Length | 2800 bp |
Protein Length | 810 aa |
Translation table | |
GC content | 43% |
IMG OID | 640393264 |
Product | member of the AAA ATPase family of proteins |
Protein accession | XP_001388033 |
Protein GI | 126274846 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0464] ATPases of the AAA+ class |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.435802 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATCTATACGC TGTCTGTTCA TTGTCTGTAG CTGAGCAATA TAGATCAGAA TGTTCCGAAA AAAACCCCCT CAGCTCTCGC TCCTCGAGGA ACTCAGACAG ACCTATGACG ACTGCTGCAA CTTCCTGATC AAGAACCTCG CCCTAGAGGA AAACAACCAC GTAGAGGATG CCCTCAAAGG CTGGAAGGGC TTGCACACGT CCCTTCTCTA CAAAATCGAC TTGTTTGACA GGCTGGCTCA CAAACTCCTG AACGATGAAA TCCAGATCCT CACAGAGTTG AAAGCCATCA GAGACCAGAA CGTCAAGCAT TTGATCCGCG TTCAGTTACG AATGGATGAA GTAAATCGCA AGAAGAGCAG AGATGCCACC AATGCCGTAC ATGCTGCCTC TAAACAGGCT TCTTCTTCGT CGTCGTCCTC GCTATCTGTT CCTTCACTCA GAAACCCAGC CTCCAGATCG CTTTCAAGTA ACTCAGTGAA CCAACGCTCG ATGTTGAAGT CTCTAAGACC AGGAATGGTG CTGGGACTGA GTTCACAGGG ATCTAGTAGC TCGTATAATG TTCCCAAGGT GAAAACTGCT CAGGCTTCGC AGGCTGCAAA TGTCTCCTGG CAGGGTCCCC AAAATCGCCA TAAACCTTCC CAGAAGTCAG CAAAAGAACT TACCGAAACT GAAAATGAGC TCTTTGATGA TTTCGACGAT GATGGATTCG AATTCACCGA CAATTGGGAA CGCATCAACG ACAACAATCT TAACAATGTC AACAACAATA CCAACAATAA CAACAACCAT AATGGCTCTC GCTCATCCTC AGGCTTGTCA GAACTATCTT CAGGAAGAGG ATCTTCATCA TCCAATGTTA ATCTCATAGA TCTCGATGAC GATAACTGCA ACTATGGTAC CAGCTACAAT AACAATAACA CCAACCTCAA TAGTAACAAC AACGGTATTA ATGGAAATCT CAAACAGATC CAAGACTACT TCTCAACGTC CCCCGAAGAT GACCTAGTGC GTTCCATGGA TGACATCACT TTGAAACCAC ATGCATCACC AACAAGGCAG CATCAACTGT TACTGGTACC AAAGGTTACT CTTACCAACC ATAGTTCTGG CAGTATCAAC AAGACCTCGG CTACCAATGG AACAACAACT AACCGTCGAG TAGATCACAT TTCCAAAAGC ACATCCGACA TTAGGACAAA CCATGCGCCA CAGACAAAGC AGCCGTATAT CTATAACAAG CCAAAACCTA TAGACGTCAG CAAGATCATG AAGAAACAAT CCCAGAATCG TTTAAATCCG ACAAAGAGCC CTACAAAGCA ACCTACCACT GTCAAAAGTC AACCTTCCAA AACAGCTGCA GCTCAAAGAA CTACACGCAC ATCCAATAAT AGCCACACTG CAAAGCCGAA GACGAATCCA AGCAGTAACA TATCTTACAA CTACGTAAAA ACTTCTAAGA CTGTGAACCC TGCTGTTAAA AAGACAGTAA AGCCAGCTAC ACTTCAGCCA AATAAAATTA TGACAAATCA GAAGAACAAG AGTAGCGATG CTATCAATAG TCCAACCATG GACGATTTAT TGAGTGGCTA TGATAATGAC GAAGTAGATG TTAATTTATT AACAAATGAA GAGGAACAAG AGGCTCTCAT CAACTCAGTT AGAGGAGTAG ACCCTGATGC GGCCAAACAC ATTTTAAACG ATATTGTTAT CCATGGCGAC GAGGTTTATT GGGACGACTT GGTCGGATTA GAAAGTGCTA AATACTCGTT GAAAGAGGCC GTAGTATATC CATTTTTGCG CCCTGATTTA TTTAAGGGGT TGAGAGAGCC CACTAGAGGC ATGTTGTTGT TTGGTCCGCC AGGAACCGGT AAAACAATGT TGGCGAGAGC TGTTGCTACG GAATCGAAGT CAACATTTTT CTCTATTAGT GCATCATCCT TGACATCTAA GTATTTGGGA GAGTCCGAGA AGTTGGTGAG GGCTTTGTTT TTGATGGCAA AGAAGCTAGC TCCATCTATA GTCTTTGTGG ATGAAATAGA TTCTTTGTTG AGTTCCAGAA CAGAGGGAGA AGTAGAAAGT ACAAGAAGAA TTAAAAATGA ATTTTTGGTA CAGTGGTCCG AACTTTCCAG TGCAGCAGCC GGAAGAGAAT CTGACAACGA TGATGTTTCT AGAGTTTTGA TTCTAGGAGC CACAAATCTA CCTTGGTCTA TCGATGAGGC AGCCAGAAGA AGATTCGCAA GAAGGCAATA CATTCCTTTA CCAGAGGCGG ACTCTAGACT GGCACAGATT CGAAAATTAT TACAATACCA GAAAAATACT CTATCGGACG AAGATTACGA GGTCCTCAAG GACTTGACCG ATGGATTCAG TGGCTCAGAC ATAACTGCTC TTGCAAAGGA TTCAGCTATG GGACCGTTGA GAGCTCTTGG AGAAAAACTT CTACTGACAC CTACAGAGCA AATTCGACCA ATTAACCTTG AAGACTTTAA GAACAGTTTG AAGTACATCA GGCCAAGTGT TTCATCAGAA GGGTTGCAGG AGTATGAAAA ATGGGCCGAA AAATTTGGAT CTTCAGGTGC ATAGGGGTGG ATTGATTGAC ACCTCAGGAA TTCAAATGCC CAGAAAAGAC CAGCTTGTTC TGTTGGCTAC TAGATGCAGC GACTTATCAG ATATACGAAT ATTATTGGTA TATATTTAGG TCTCAGTTAG TACTTTTGCG TGCATTCACA AATCGACTTC ATCAACGTTT CTATTTTGAT TTTATCAGGG GAACAATAAA AGAAATAAAT GAAAAGAATG
|
Protein sequence | MFRKKPPQLS LLEELRQTYD DCCNFLIKNL ALEENNHVED ALKGWKGLHT SLLYKIDLFD RLAHKLLNDE IQILTELKAI RDQNVKHLIR VQLRMDEVNR KKSRDATNAV HAASKQASSS SSSSLSVPSL RNPASRSLSS NSVNQRSMLK SLRPGMVLGL SSQGSSSSYN VPKVKTAQAS QAANVSWQGP QNRHKPSQKS AKELTETENE LFDDFDDDGF EFTDNWERIN DNNLNNVNNN TNNNNNHNGS RSSSGLSELS SGRGSSSSNV NLIDLDDDNC NYDDLVRSMD DITLKPHASP TRQHQLLLVP KVTLTNHSSG SINKTSATNG TTTNRRVDHI SKSTSDIRTN HAPQTKQPYI YNKPKPIDVS KIMKKQSQNR LNPTKSPTKQ PTTVKSQPSK TAAAQRTTRT SNNSHTAKPK TNPSSNISYN YVKTSKTVNP AVKKTVKPAT LQPNKIMTNQ KNKSSDAINS PTMDDLLSGY DNDEVDVNLL TNEEEQEALI NSVRGVDPDA AKHILNDIVI HGDEVYWDDL VGLESAKYSL KEAVVYPFLR PDLFKGLREP TRGMLLFGPP GTGKTMLARA VATESKSTFF SISASSLTSK YLGESEKLVR ALFLMAKKLA PSIVFVDEID SLLSSRTEGE VESTRRIKNE FLVQWSELSS AAAGRESDND DVSRVLILGA TNLPWSIDEA ARRRFARRQY IPLPEADSRL AQIRKLLQYQ KNTLSDEDYE VLKDLTDGFS GSDITALAKD SAMGPLRALG EKLLLTPTEQ IRPINLEDFK NSLKYIRPSV SSEGLQEYEK WAEKFGSSGA
|
| |