Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_36036 |
Symbol | |
ID | 4838737 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1408821 |
End bp | 1410026 |
Gene Length | 1206 bp |
Protein Length | 323 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390052 |
Product | predicted protein |
Protein accession | XP_001384228 |
Protein GI | 126135408 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5034] Chromatin remodeling protein, contains PhD zinc finger |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGAACC AGCAGCAACA ACAGCAGTTC CAGCAGTACA AACAGCGTGC AAAGAACATT GCAAACATCA ACGAGCTTCT TCCAGGGTTA AACGACATTT CCGATGCGTT TGAAGTGTTA CCTATGGACC TCATAAAGTA CTTCACACTA CTTAAGGAAA TCGATGCCAA ATGTATTAAC ACCGTTCCGC AGATCAACTT CTTGATCAAA AAGTACATTG GAGGTTTACA TGAGGGCAAG CTGTCCAAGG ATGGAGCCGA AGGAGGAGAA GACAGCAACG AAAATAACAA TGACAAATCC GAAAATGACA AAGACAAGCT TGAAAACGAC AAGGAATCAA AAGAGACTTC CAAGAGAGAC CTGGAATTGA GAAGGTTGAC ACTCATCAAA GACAAGATCA ACGAAATAAT TCCCTGCTTA GAAGAGAAGA TGCACGTTAC GTCGGTGGCG ACCGACTTGT TGAGCAAACA TATGTTCCGT ATCAACAATG ACTACAAACT AATTGTCAAC AACAACGAGA TTCCAGAGTC AATCCGTATC GGTCCTCTAA GCCACCCGGC AATGATCATG GATGCTAACG TAGCCAACGG GTCAAGTGTT GACAGATCAG CACAGGCACA AAGAAGTGAA AGTAGAAGAG AAGCTCTAGC AGCCAAGAAA GCCAGCAAAG AAGACAATTC TGATGACCAT CTTTCTGTGA AGAAGAAGAA GAACAAAGAG AGCACACCTT CAGAAGCACT CAAGGGCACA CCGAATGGAA ACCCAGGAGT AACCAACACC AAGAAGAGAT CGAGAAAGGA AAATGATGAT ATCCAGAGAC CAGTTACCCC TGGCGGCGGC GCTACCAACG GAACAGCACC AGCCAAGAAG AAGAATAAGC CCAAGAAGGA AGACGACAGT AGCAATAATT CACGTGCAAA CAACACCAGC AGCAATAATG TCAACTCCAA CGAACTTGAC AACGATGACA ATAACACTAT AGACAGAGTC AAGGCTGAAG TTCCAGCAAG TAAGGCCAAG AATACAGGCG AACCTACGTA CTGTTACTGT AACCAAGTTT CCTTTGGTGA GATGGTGGGA TGCGACGGTG ACGACTGTAA GCGTGAATGG TTCCATTTGC CATGTATTGG ATTCAAGAAC CCACCTAAGG GTAAATGGTA CTGTGACGAC TGTCTTGTGA AAATGAAGAA AATGAAGAAA TTATGA
|
Protein sequence | MKNQQQQQQF QQYKQRAKNI ANINELLPGL NDISDAFEVL PMDLIKYFTL LKEIDAKCIN TVPQINFLIK KYIGDKINEI IPCLEEKMHV TSVATDLLSK HMFRINNDYK LIVNNNEIPE SIRIGPLSHP AMIMDANVAN GSSVDRSAQA QRSESRREAL AAKKASKEDN SDDHLSVKKK KNKESTPSEA LKGTPNGNPG VTNTKKRSRK ENDDIQRPED DSSNNSRANN TSSNNVNSNE LDNDDNNTID RVKAEVPASK AKNTGEPTYC YCNQVSFGEM VGCDGDDCKR EWFHLPCIGF KNPPKGKWYC DDCLVKMKKM KKL
|
| |