Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_91503 |
Symbol | TAF12 |
ID | 4840894 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009048 |
Strand | + |
Start bp | 615608 |
End bp | 617980 |
Gene Length | 2373 bp |
Protein Length | 520 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640392209 |
Product | Transcription initiation factor TFIID subunit 12 (TBP-associated factor 12) (TBP-associated factor 61 kDa) (TAFII-61) (TAFII61) (TAFII-68) (TAFII68) |
Protein accession | XP_001386520 |
Protein GI | 150866801 |
COG category | [K] Transcription |
COG ID | [COG5624] Transcription initiation factor TFIID, subunit TAF12 (also component of histone acetyltransferase SAGA) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0374369 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.805326 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGGAC TTCCTGGTCT GAGTGGACAG AGAAACCCTA ATGCACCTAA TAGATCAGGT GGAGCTATCA ACATTCATCC CAGTCAAGTA CAACAATTGG TGCAAGTGCT TAAGAACGAA GTTCAGTTGG GAAAGAATGC TTCAGATGAA ACTGAAAAGA AGAAACACTA TGCCAAAGCT GAAGGAATCA GAACCCTTCT TTTGAATTAT CAAGCGCAAC AAAGGGCGAG GTCACAACAA CAGCAAAATC AACAAAATCA AAATCAAAAT CAAAATCAAA GCCAACAAAA TCAGATCCAT GTCCAAACCC AAAATCAACA GAACCAAATG CAACCGGCCC AACAACAAGT AAGGCAGCTA ACAGCTCAAC AACAATTGCA ACAGCAACAG CAACAGCAGC AAGCTCATCA ACAGCAGATT CAGGCCGTCC AGGCTCAGGC TCAGGCTCTA GCTGCTGCCC AGATGCTGAA TCCGTTGATG CAACAGAGTA GCCAATCGCC AATGGCTATG AACTTATCTC AACAGAGCAG ACAACCTTCA AGCCAGCCTA CGCCCGTTTT GCAGTCTCCA CAGTTGAGTC AACAACAGCA ACAACAAATA AGACAATCGC CAATGATGCA ACAGAAACCA ACTCCCCAGA TGCGGCAGTC TCCTATTGTA GCACATCTGC TGCCTCCAGG TTCTCAGCAG ATGAGATCTG GCTCTGCGGG ATCGCAAGGC TCTCCAGCTC CCATTGCAGG TACTCCATCA ATTCCAGCCA ATATGGCTAC TGTAGAACGA TTCAATATTG TCAAGCAGAA GTTAACTGAA GTTCAGCATA GAATCCAATT CCTTGAGCAG AGTAAAAAAG GAAGTAATAT TGGGCCTGAT GAACTTGCAA CTATCGACAA AGAGCTTCTT GAGCAGAAGA CAAAGTTCCT GCAATTCCAG AAACTCGGTC TTTTCATTAG AAATCTGTTG ACTCAACAGG CTCAGGCTAG AGCCAATACA CCACAAGCAG CAACAGGGAC CCCTCCGCAA TTTCAACCAC AACAGCAACA GCAGCAGATG CAACAGCAGC AGCAGTTCAT AAACAAACAG AACCAACAAG CGATTTCATC TCCAGCTTTG CAAAAGAATC AGCTCACTGT ACAACAGCAA CAACAATTGC AACAGCAGAT CCAACAAAGA TCGCAGCAAC AGACTCTCTT ACAACAGAGA GTGCAACAGC AACAGACTAC AATACAGCAG CAACAACAAA TGCATCAACA GCAACAGATG CAACAGCAGC AGCAACAGCA AAAGCAGCTG ATTCAACAGC AACAACAGAT GCAACAAATC AAGCGCGCAC AAGCATCTTC TCCACAGTTG CAACAGCAGC CTCAACGGGT GCCTAGTAGA CAAGCAACAG CAACTCCTCA ACAACATCCC TCTGTTGCCA GTAGACCTGC GTCAATTACT TCAGCCGGTC AAGCTCAGAA GCCTACTGGC AGTAGACCTC CTTCTACGGC ACACAGCCAA TCTACGCCTA GTTTCAGTGG TATACCCAAT GCGACTAACG TTGTAGAATC CGCTAATTCT GGAGCTGCTA ATGCATCTGC TTCAGCGGTT GTTGCTGCTG CTGTTGCTGG TAGTAGCGCA GGCGCTAAAT CAGTGTCGCC AGCTCCTGCT ACTGGAGACA AGTCGTCACC CAAATCGACA ACTCCTCAGA AGCTGCAGTC ATTGAGAACT CCTGGTCCTC CTCCGATCAA TTTAGCAGGA ATCACCAAAC CTCAGGTACC ATCGATTCCA ATCTCTGCAA CAATCAACGT CAAGCCTCCT ACGGCTGTGA CCTTGAAGGC AACAGGTGAT AGTAGACCTA CGTTGACTGG AGGTGAAGCA AATAGTTTGA GTATTCTCTT GAATACCCCA GCTATCACCA AATTGCCTAC TTTTGAGTTG TCTTCTGGAG GAACCAATGG TAGTTTACCT GAAACAGGGC AGAGAGCCTT GACCAAACGG AAACTCTCGG AGTTAATCAG CACTATGGGC GTGGATGAAG GCGATGGTAA AACTAACATA GACGGCAATG TCGAAGAGCT CTTGTTGGAT TTAGCCGATG AGTTCATCAA CTCGGTGACT TCCTTCTCCT GTAGACTAGC CAAACATAGA AAGGTGGATT CCATAGATAC CAAGGACGTG CAATTACATT TGGAAAGAAA CTGGAACATC AAGATTCCTG GCTATGCTAT GGACGAAATT AGATCGACTA GAAAGTTGCA GCCTTCTACA AGCTACAACC AGAAGGTTCA AGGTGTCGAG ATCTCCAGAT CGGTCAATGG TGATATCAAC GGGTAAGAAT TTTTCTCTGT GTAACTATTT AACTGGGTAT AGTAATGTAC TAGAATTTAA CAT
|
Protein sequence | MSGLPGSSGQ RNPNAPNRSG GAINIHPSQV QQLVQVLKNE VQLGKNASDE TEKKKHYAKA EGIRTLLLNY QAQQRARSQQ QQNQQNQNQN QNQSQQNQIH SSQSPMAMNL SQQSRQPSSQ PTPVLQSPQL SSQQMRSGSA GSQGSPAPIA GTPSIPANMA TVERFNIVKQ KLTEVQHRIQ FLEQSKKGSN IGPDELATID KELLEQKTKF SQFQKLGLFI RNSLTQQAQA RANTPQAATG TPPQFQPQQQ QQQMQQQQQP ASITSAGQAQ KPTGSRPPST AHSQSTPSFS AKSVSPAPAT GDKSSPKSTT PQKSQSLRTP GPPPINLAGI TKPQVPSIPI SATINVKPPT AVTLKATGDS RPTLTGGEAN SLSILLNTPA ITKLPTFELS SGGTNGSLPE TGQRALTKRK LSELISTMGV DEGDGKTNID GNVEELLLDL ADEFINSVTS FSCRLAKHRK VDSIDTKDVQ LHLERNWNIK IPGYAMDEIR STRKLQPSTS YNQKVQGVEI SRSVNGDING
|
| |