Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_66957 |
Symbol | HMO1 |
ID | 4837216 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1370484 |
End bp | 1371607 |
Gene Length | 1124 bp |
Protein Length | 232 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640388531 |
Product | high mobility group-like protein |
Protein accession | XP_001383030 |
Protein GI | 150864277 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5648] Chromatin-associated proteins containing the HMG domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAAT CCTTCAAGAA CGCCAAGGAC CTGCTCGTAG CCTCATTATT CGAGTTGTCC AAATCTGCCC AAGATGCCGC CAATGCTACC GTCAACTTCT ACAAGCTCGC CAGCGAAGAG GGTGACGCCG AAACCCTCAA GCTGTTGAGT GAGACCTGGA AATCTGTCGC GTTCGCTACC GATCCCACCA ATTCCAACGG AAACGGTTCT AAGGTCAATG GCTCTAAAGT CGACGACGAT CTCATCGCTA CTGCTGCCGC TGCTGCTGCC GCAGTTTCCT CGGTAGAGAT CCCCACCGTT CCCTCAGTCG GAAAGGCTAA GGTCACCAAG GCTGAAAAGC CCAGAAAGAA GAAGGTGGAA AAAGACCCCA ATGCTCCAAA GAAGCCTTTG ACTATCTACT TTGCCTTTTC CTTCCATACA AGAAAACTGA TCAAGGATGA CAGAGAAAGA AAAGGCTTGC CAGCTTTGTC TGCTATCGAC ATGAACGAAA TTGTCAAGCA GAAATGGGAA TCCATTACTC CAGAAGAAAA GGAAATCTGG CAAAAGAAGT ACGCCAATGA GTTGAATGAA TACCAAAAGG AAAAGGAAAA ATACAGACTC TCCAAAGAAG ACAAACCAAA TCAAGTAGCA GCTGTTGCAG CAGAAGTTGC CCGTGCTTTT GAACCTACAG TGGACATTCC TCTCTTGTCT CTGGTGGATG CTCCTAAGAA GAAGGAGAAG AAGAGAAAGT CTGAAAAATC TGACAAGAAG ATCGAAAAGA AGTCGAAGAA GGATAAGATT GTCCAGCCTA TCCAATTGCA GCATTAAAGA AGACGTCGAT CATGATTATG ACATGAATTG TTTTAAGAAT TCTCCTGTAA TTGCTATCTT AAAATCGCCA TCCAAAATGA TATAATTGTA TGAAAATATA ATTATATCAT TGTATCAATT ATTATCATGA ATCGGTCTTG AAAAAGATTA TTATTCATAT AGAAGATTTG ATAATGATCT TCATGATTAT CTTTCTATTT ATTGCTAATA GCATTTACCG AATGCAACCT TCACTTCTGC CTCGATGGTC CAGCTAGATA TATAGTGTAT TATTACCCAC TCTCCTCTAA ATGTACAACT GTTTCTATCA GTTT
|
Protein sequence | MSESFKNAKD SLVASLFELS KSAQDAANAT VNFYKLASEE VDDDLIATAA AAAAAVSSVE IPTVPSVGKA KVTKAEKPRK KKVEKDPNAP KKPLTIYFAF SFHTRKSIKD DRERKGLPAL SAIDMNEIVK QKWESITPEE KEIWQKKYAN ELNEYQKEKE KYRLSKEDKP NQVAAVAAEV ARAFEPTVDI PLLSSVDAPK KKEKKRKSEK SDKKIEKKSK KDKIVQPIQL QH
|
| |