Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_73625 |
Symbol | |
ID | 4840543 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009047 |
Strand | - |
Start bp | 888704 |
End bp | 891604 |
Gene Length | 2901 bp |
Protein Length | 636 aa |
Translation table | 12 |
GC content | 42% |
IMG OID | 640391858 |
Product | predicted protein |
Protein accession | XP_001386360 |
Protein GI | 150866686 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG5076] Transcription factor involved in chromatin remodeling, contains bromodomain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.171811 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GCCAGCACAT AATATTTTTT CGCCGCCCGC CCATCCAGCC CGCGTTTTGC AGTTTCGTCC ACCGCACCAC AAAGTCTGCA GATTGCGCTT ACGTCAGCTT GCTGGTTCAC CTCGACACGT CTGGCTGTGG GTAATTTTTT AATTTTTTGC AAAACCCGGT CGCAATCCGA AGATATATAA AAAACGCTTT TGCGTCTTTT TTTTTGTCTT TTTCCAACCT GAAAAAACAG ATTTAGTTTT CGTGGGTGCA GAGTGTATGT GTTATATACA GGTGTTTATT TCACCACCAG TGAACGAGAG AGCCGAAAGA TACCAGAGGA TAGGATAGAT ATGACAGTTT GGTGGCCGTT CGAGAGCTGC TGAAGTGATG GCAAATGACT GAGATGTGAA AGAGGGTATA AATTCGAAGT CACTTCCGTA CTTTTTTTTC AGCCTCGTTT CTGCGGTTTT GCCATCAAGT TTATGTGTCT CCTGTTATCT GTCAGTTTCG CTGGATTACC AGTGTCTCAG ATTTGTTCAC AGTGATAAAT TTCACTTGCC AATTCCAAAA ACCAATAGAA TATCCATACT AACAAAACTT CTAGAACTTG GTTTGTTATT TTGCTTAGAC TCCGATACTA TTCTTATCTC TTTACAGATA TATCTTAATT GTATCATAGG TGTTCTCTTT GTTGTTTGTC ATCTTATCAC TTCCGTATAT TGCATCTTTC ATCGCTTCAA GCCTTGTTAA TAATCTTATT CCCTATATCT TTTCAAGAAT ATGTCAGAAG TTGTACCAGA AACGAATACT CCAGTTCAGA CCCCTTCATC CGAAACACAC TTCCACAAAT CAGACACAGC CATAGTCAAC GAATATAAGA AAATGACCCC TGAAGAGCCG GAAAAACCGC TTTCTCCTCC AAATCCCTCT CCGAGCCCTG AGAAGCGAAA GTTAGAAGTG GACGAAAATG AAGAGTCCAA AAGGCAAAAA TACGATTCAG AAGCTCCTGA AGCTGTAGCC AATGAGGCTG CTCCCAATTC GATTAACGTA GAAGAATCTA AAGAAGCTTC TCCAGTTGTT CCTGCAACAG CAGGGACAGC TGTATTTTCG GAACCGGCTC CAAAGCCAGC TGCAGAACCA GATATGGACA ATTTGCCTGC CAACCCATTA CCCCCACATC AAGCCAAGTT TGCCCTCAAC ACCATAAAAG CCATAAAGCG GTTGCGTGAT GCTGTGCCAT TTTTACACCC AGTAGACATC GTCAAGTTGA ACATTCCCTT CTACTACAAC TACATTCCTA GACCTATGGA CTTGTCAACT ATCGAAACCA AAGTACATGT AAATGCCTAC GAAGACTCCA ATCAGATAGT TGAGGACTTC AACTTGATGG TAGCTAATTG TAAGAAGTTC AACGGCGAAA ATGCTGGTAT TTCCAAAATG GCTGATAACA TTCAAGCTCA CTTTGAAAAG CACATGTTGA ATTTTCCTCC CAAAGTTTTA CCATCGGCAG TTGCTGCGGC TAAACCTTCT GCAACTGGAT TGGCTTCGAA GAGAAGAACC GAAGCTGATG CCGTAAAGCA ACAACAGCGC GAGTCAGTAG CTGCTCATAG ACCAAAGAGA ACCATACACC CTCCAAAGTC AAAAGAAATC CCATATGATA CTAAGCCCCG TAAGAAGAAG TTTGCAGCAG AGTTGCGATT CTGTTCCCAG ACTGTCAAGG AGTTAATGTC GAAAAAGCAT AACGGCTACA ACTTCCCGTT TGTCGCTCCT GTAGACCCTG TAGCCTTGAA TATTCCAAAC TACTTCAAAG TAGTGAAAGA ACCGATGGAC TTGGGCACAA TTCAATCCAA GTTGACTAAT AACCAGTACG AAAATGGAGA TGAGTTTGAA CGTGACATAC GTTTGGTATT CAAAAATTGT TACATTTTCA ATCCTGAAGG AAGTGAAGTG AACATGATGG GACATCGTCT TGAAGCAGTT TTCGACAAGA GATGGGCTGC TCGTCCTGTT CCAGAACCAA CGCCTGTCAA TTCTGAAATC GAAGATTCTG AAGAAGAATC GAGCGACGGC GAAGATGAAG AACTGGAAAT CAACGAGTCC ATGTTATCAG ATGTTCCTGC CATTCAGTTC TTGGAAAATC AATTGCTCAG AATGAAGAAG GAACTAGATG AATTGAAGAA GGAACATTTG AAGAAGTTGA GAGAGCAGCA GGCAGCTAGG AAGAAAAAGA GAAAGTCCAA GAAGGCTGCA GCTAAGAAGT CTTCGGCTCC GCCTAGGGCA CCATCGATTT CCTCGACTCC TGTTGTTACT TACGAAATGA AAAAACAAGT CAGTGAGATG GTTCCTAATC TTTCGGACAA GAAATTGCAG TCTCTTATCA AGATCATCAA GGATGATGTT GAAATTAGCA ACGAAGATGA AGTAGAATTG GACATGGACC AATTGGAAGA CCGCACTGTC TTGAAGTTGT ACAACTTCTT GTTTGGCAAG AAGGCTTCAG CCAAGCTTGC TAAGAAGCCA AAGAAACCTG TTATAACTAA CAGTGTTGAT GAATTGGCCC ATTTGAGAAG CCAGTTGGCG TTGTTTGACG ACGACAATAA CAATGGCTCT ACCAATGGAT TCATGAATAT TGGCAACGAC CATGAATCTT CAGAAGACGA TCTCTCCTCT GAAAGTTCTG AAGAGGAATA AGCTCCTCCA CGATATGAAT AAATTAATAG TGGTTTGTTG TCACAAACTA TGGAAATGGA AGCTATGAAT ACTATGTAAA ACTAGCGCCT TGTACAAAGA AAGCTTTTAC TGTTGAGCTA CGCCCACCGC ACCCGTTTCA CACGTATACT ATATTAGTTT GGATATGTTA TTCTTACTTC TATCACAGTT TCAGTTTTAT AGTGTATTAT AAAAGGATTA ATTAATTGAA G
|
Protein sequence | MSEVVPETNT PVQTPSSETH FHKSDTAIVN EYKKMTPEEP EKPLSPPNPS PSPEKRKLEV DENEESKRQK YDSEAPEAVA NEAAPNSINV EESKEASPVV PATAGTAVFS EPAPKPAAEP DMDNLPANPL PPHQAKFALN TIKAIKRLRD AVPFLHPVDI VKLNIPFYYN YIPRPMDLST IETKVHVNAY EDSNQIVEDF NLMVANCKKF NGENAGISKM ADNIQAHFEK HMLNFPPKVL PSAVAAAKPS ATGLASKRRT EADAVKQQQR ESVAAHRPKR TIHPPKSKEI PYDTKPRKKK FAAELRFCSQ TVKELMSKKH NGYNFPFVAP VDPVALNIPN YFKVVKEPMD LGTIQSKLTN NQYENGDEFE RDIRLVFKNC YIFNPEGSEV NMMGHRLEAV FDKRWAARPV PEPTPVNSEI EDSEEESSDG EDEESEINES MLSDVPAIQF LENQLLRMKK ELDELKKEHL KKLREQQAAR KKKRKSKKAA AKKSSAPPRA PSISSTPVVT YEMKKQVSEM VPNLSDKKLQ SLIKIIKDDV EISNEDEVEL DMDQLEDRTV LKLYNFLFGK KASAKLAKKP KKPVITNSVD ELAHLRSQLA LFDDDNNNGS TNGFMNIGND HESSEDDLSS ESSEEE
|
| |