Gene PICST_66957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_66957 
SymbolHMO1 
ID4837216 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1370484 
End bp1371607 
Gene Length1124 bp 
Protein Length232 aa 
Translation table12 
GC content42% 
IMG OID640388531 
Producthigh mobility group-like protein 
Protein accessionXP_001383030 
Protein GI150864277 
COG category[B] Chromatin structure and dynamics 
COG ID[COG5648] Chromatin-associated proteins containing the HMG domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGAAT CCTTCAAGAA CGCCAAGGAC CTGCTCGTAG CCTCATTATT CGAGTTGTCC 
AAATCTGCCC AAGATGCCGC CAATGCTACC GTCAACTTCT ACAAGCTCGC CAGCGAAGAG
GGTGACGCCG AAACCCTCAA GCTGTTGAGT GAGACCTGGA AATCTGTCGC GTTCGCTACC
GATCCCACCA ATTCCAACGG AAACGGTTCT AAGGTCAATG GCTCTAAAGT CGACGACGAT
CTCATCGCTA CTGCTGCCGC TGCTGCTGCC GCAGTTTCCT CGGTAGAGAT CCCCACCGTT
CCCTCAGTCG GAAAGGCTAA GGTCACCAAG GCTGAAAAGC CCAGAAAGAA GAAGGTGGAA
AAAGACCCCA ATGCTCCAAA GAAGCCTTTG ACTATCTACT TTGCCTTTTC CTTCCATACA
AGAAAACTGA TCAAGGATGA CAGAGAAAGA AAAGGCTTGC CAGCTTTGTC TGCTATCGAC
ATGAACGAAA TTGTCAAGCA GAAATGGGAA TCCATTACTC CAGAAGAAAA GGAAATCTGG
CAAAAGAAGT ACGCCAATGA GTTGAATGAA TACCAAAAGG AAAAGGAAAA ATACAGACTC
TCCAAAGAAG ACAAACCAAA TCAAGTAGCA GCTGTTGCAG CAGAAGTTGC CCGTGCTTTT
GAACCTACAG TGGACATTCC TCTCTTGTCT CTGGTGGATG CTCCTAAGAA GAAGGAGAAG
AAGAGAAAGT CTGAAAAATC TGACAAGAAG ATCGAAAAGA AGTCGAAGAA GGATAAGATT
GTCCAGCCTA TCCAATTGCA GCATTAAAGA AGACGTCGAT CATGATTATG ACATGAATTG
TTTTAAGAAT TCTCCTGTAA TTGCTATCTT AAAATCGCCA TCCAAAATGA TATAATTGTA
TGAAAATATA ATTATATCAT TGTATCAATT ATTATCATGA ATCGGTCTTG AAAAAGATTA
TTATTCATAT AGAAGATTTG ATAATGATCT TCATGATTAT CTTTCTATTT ATTGCTAATA
GCATTTACCG AATGCAACCT TCACTTCTGC CTCGATGGTC CAGCTAGATA TATAGTGTAT
TATTACCCAC TCTCCTCTAA ATGTACAACT GTTTCTATCA GTTT
 
Protein sequence
MSESFKNAKD SLVASLFELS KSAQDAANAT VNFYKLASEE VDDDLIATAA AAAAAVSSVE 
IPTVPSVGKA KVTKAEKPRK KKVEKDPNAP KKPLTIYFAF SFHTRKSIKD DRERKGLPAL
SAIDMNEIVK QKWESITPEE KEIWQKKYAN ELNEYQKEKE KYRLSKEDKP NQVAAVAAEV
ARAFEPTVDI PLLSSVDAPK KKEKKRKSEK SDKKIEKKSK KDKIVQPIQL QH