Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_83789 |
Symbol | |
ID | 4839608 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 267188 |
End bp | 268348 |
Gene Length | 1161 bp |
Protein Length | 376 aa |
Translation table | 12 |
GC content | 44% |
IMG OID | 640390923 |
Product | predicted protein |
Protein accession | XP_001384710 |
Protein GI | 150865476 |
COG category | [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG5242] RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit TFB4 |
TIGRFAM ID | [TIGR00627] transcription factor tfb4 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.362151 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGCGA TATCTGACCG AGTCTTCACG GAGACTACTC TGACGGAGAC CTCTAATGAT GACCCGTCAC TTTTGACGGT CATTCTCGAT GTCTCACCAG CTGGTTGGTA CAGAATCCGA GATCAAACAT CAATCGACGA ATTGGCCAAG TCACTTTTAG TTTTCATGAA TGCCCATTTG TCACTCAACA ATTCCAATCA GGTGGCATTT ATAGCGAGTA CACCACAGAA ATCGAAGTTC TTGTTTCCTA ATCCAGAAAT AGACTACGAC GAGATTCGAA CCAGTTCTTC TAGCTCTGGT TCTGCCTCAA ACCAGCATCA GAGTCAAGAT ATCGACTCCA ATGCTAGAGA AGAAACACCT ACTTTAGTAT CTAAGGACAT GTATAGACAA TTTCGAGTTG TTGATGAAGC TGTTCTCGAG GAGTTGAACG TGGTTTTCGA CGAAATTGCT AATGGGATAC AAGATATAAA TAATAACTCT ACTCTATCCG GAGCTCTCAG CATGGCTCTA ACATATACAA ACCGAATGTT GACTCTTGAC CAACTGATTT CTACAACTAC GGCTTCAGCC ATCAACTCTA CTACTAGTAT GGGAGCAGGT TCTGGGTCTG GAAACACAGC TACCAATTCT TCTACCAGCA ATCCTTCCAA CAGCATTACT TCTATGAAAT CGCGTATTCT TATTGTCACA GCCAACGACG AAGACGATGT CAAGTATATT CCCGTGATGA ACTCGATCTT TGCGGCTCAG AAAATGAGGA CTTCCATTGA TATAGCCAAG TTGGGCTTTG AGGACTCGTC ATACTTGCAA CAAGCGGCGG ATGCTACTAA TGGGATTTAC TTCCACGTTC ATGATCCTCG TGGAATTGTG CAGACTTTGA CTTCTGCTTT TTTCATAGAA CCTTCTATCA GACCGTTCAT CATACTCCCA ACCAACTCTA ATGTCAACTA CAGAGCCAGT TGCTTTGTCA CCGGCAAATC CGTGGATATA GGCTTTGTTT GTTCAGTGTG TCTCTGCATC ATGAGCAAGA TTCCACCGTC TGGCAAATGC CCGGCCTGCG AATCGGTGTT TGACGAAAAG ATCATAGCCC AGTTGCTGAA AGGTCCTTCT GTTCTTTCCA AGAAGAAGAG AAAGATAGAT ACGAATGGTG CAGCCAAATA G
|
Protein sequence | MDAISDRVFT ETTSTETSND DPSLLTVILD VSPAGWYRIR DQTSIDELAK SLLVFMNAHL SLNNSNQVAF IASTPQKSKF LFPNPEIDYD EIRTSSSSSG SASNQHQKTP TLVSKDMYRQ FRVVDEAVLE ELNVVFDEIA NGIQDINNNS TLSGALSMAL TYTNRMLTLD QSISTTTASA INSTTSMGAG SGSGNTATNS STSNPSNSIT SMKSRILIVT ANDEDDVKYI PVMNSIFAAQ KMRTSIDIAK LGFEDSSYLQ QAADATNGIY FHVHDPRGIV QTLTSAFFIE PSIRPFIILP TNSNVNYRAS CFVTGKSVDI GFVCSVCLCI MSKIPPSGKC PACESVFDEK IIAQLSKGPS VLSKKKRKID TNGAAK
|
| |