Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_42410 |
Symbol | |
ID | 4837015 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | + |
Start bp | 2020258 |
End bp | 2021934 |
Gene Length | 1677 bp |
Protein Length | 558 aa |
Translation table | 12 |
GC content | 45% |
IMG OID | 640388330 |
Product | predicted protein |
Protein accession | XP_001382617 |
Protein GI | 150863958 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG1243] Histone acetyltransferase |
TIGRFAM ID | [TIGR01211] histone acetyltransferase, ELP3 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTCTG TTCAGAGCAC CAAGGGCGGA AAACAGAAGT TGGCTCCTGA AAAGGAGCGA TTTTTGCAAT GTTGCGGAGA CATTTCGCTC GAGCTTGTAG CTTCCCTCAA GAACTCCAAA GACATCAACT TGAACGGCTT GATCATCAGG TATGCGAAAA AGTACAAGTT GAAGCAGCAG CCCAGACTAA CGGATATCAT CTCGTCCATC CCAGACCAGT ACAAGAAGTA CTTAATTCCA AAGCTCAAGG CCAAACCGGT CCGTACCGCA TCTGGTATTG CGGTTGTAGC AGTCATGTGT AAACCTCACA GATGTCCCCA TATAGCCTAC ACGGGAAACA TCTGTGTATA TTGTCCAGGG GGGCCAGATT CAGACTTTGA ATACTCGACC CAGTCATATA CCGGGTATGA GCCGACTTCA ATGAGAGCCA TTCGGGCTAG ATATGATCCT TACGAACAGG CTCGAGGCAG ACTAGAGCAG TTGAGACTGT TGGGCCATTC CATAGACAAA GTTGAGTACA TTATCATGGG TGGAACATTC ATGTCACTTC CCATCGATTA CAGAGAAGGC TTCATCACCC AGTTACACAA CGCATTAACA GGTTATAACG GTAAGGACAT TGACGAAGCC ATCAAATATT CCCAACAATC ACAGACTAAG TGTGTGGGAA TAACCATTGA AACTAGACCC GATTACTGTA CTGAAACCCA TTTGAGCGAC ATGTTGAAGT ACGGATGTAC CAGATTGGAA ATCGGGGTAC AGTCGGTATA TGAGGATGTA GCAAGAGACA CGAATAGAGG ACATACGGTT AAGGCTGTCT GTGAAACCTT TGCTGTAGCC AAAGATGCTG GGTACAAGGT GGTGAGTCAT ATGATGCCTG ACTTGCCCAA TGTAGGCATG GAAAGAGACT TGGAACAATT TAAGGAATAC TTTGAGAATC CCGAGTTCAG AACTGACGGC TTGAAGTTGT ACCCCACATT GGTCATTAGA GGCACTGGAT TGTACGAGTT GTGGAAGAAA GGGTTATATA AGTCATACAA TGCGAATGCC TTGATAGACT TGGTGGCTCG TATCATGGCC ATGGTACCTC CATGGACACG TATCTATCGT GTGCAAAGAG ATATCCCTAT GCCGTTAGTC ACGTCGGGTG TAGAAAACGG AAACTTGAGA GAATTGGCTC TTGCCAGAAT GAAAGACTTT GGCACCACCT GTAGAGACGT ACGTACAAGA GAAGTCGGAA TCCAAGAAGT TCATCACAAA GTTGTACCAG ACCACGTGGA ATTGATTAGA AGAGATTACT ATGCCAATGG AGGCTGGGAA ACTTTTTTGT CGTACGAAGA CCCAAAGAAG GATATTTTGA TTGGCTTGTT GAGATTGCGT AAGGCTTCTA AGAAGTACAC ATACAGAAAG GAATTCACCA ACCAACCTAC CTCTATCATC AGAGAATTGC ATGTCTACGG TTCTGTTGTG CCCTTGCACT CCAGAGACCC TAGAAAGTTC CAGCATCAAG GGTTTGGTAC CTTGTTAATG GAAGAAGCTG CCAGAATCGC CAAGGAAGAA CATGGTTCTG AAAAGATCTC GGTCATTTCG GGTGTAGGTG TAAGAAACTA CTACGCAAAA CTTGGCTACC ATTTGGATGG TCCATATATG TCTAAATGGC TTAACGACGA GGAATAG
|
Protein sequence | MPSVQSTKGG KQKLAPEKER FLQCCGDISL ELVASLKNSK DINLNGLIIR YAKKYKLKQQ PRLTDIISSI PDQYKKYLIP KLKAKPVRTA SGIAVVAVMC KPHRCPHIAY TGNICVYCPG GPDSDFEYST QSYTGYEPTS MRAIRARYDP YEQARGRLEQ LRSLGHSIDK VEYIIMGGTF MSLPIDYREG FITQLHNALT GYNGKDIDEA IKYSQQSQTK CVGITIETRP DYCTETHLSD MLKYGCTRLE IGVQSVYEDV ARDTNRGHTV KAVCETFAVA KDAGYKVVSH MMPDLPNVGM ERDLEQFKEY FENPEFRTDG LKLYPTLVIR GTGLYELWKK GLYKSYNANA LIDLVARIMA MVPPWTRIYR VQRDIPMPLV TSGVENGNLR ELALARMKDF GTTCRDVRTR EVGIQEVHHK VVPDHVELIR RDYYANGGWE TFLSYEDPKK DILIGLLRLR KASKKYTYRK EFTNQPTSII RELHVYGSVV PLHSRDPRKF QHQGFGTLLM EEAARIAKEE HGSEKISVIS GVGVRNYYAK LGYHLDGPYM SKWLNDEE
|
| |