Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_90675 |
Symbol | |
ID | 4840120 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 258163 |
End bp | 262014 |
Gene Length | 3852 bp |
Protein Length | 1114 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640391435 |
Product | predicted protein |
Protein accession | XP_001385388 |
Protein GI | 150865960 |
COG category | [K] Transcription |
COG ID | [COG5179] Transcription initiation factor TFIID, subunit TAF1 |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.621061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.00727478 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | CCCAAACTCT AAGCTTCAGC ATCAAACTAG TTCAGTACCA ATTGCATCTC TTTCTTCTCT CATTTGCATC CTCGATCTCC CTTATCTTAA TCTCACAGGC TGCATTTTGT CGTTCATAAA CCATTTTATT AAATCGAAAT TTATAAATAC AGCTCAAAAA CCGCGGAACT CGTATGAAGA TTCCCTACAT ATTGCCTCGT TTGCTGCAAC TTACCAACAA GGAGACGATG GTAGAGAGAA GCCGCAAAGG CAACTCGAAA GTCGAGGATG AGACCGCCTT CAACCAATTT GTAGAATCAG CTGAAGTTCC TGACGATACC GTGATAGAAG CTATTTTTCT GGAAAGGAAA ACGGAACATG CTGTAGACGC TGTAGACTAC GAAGATATCG ATGAGTTGGC AGATGAAGAA GACCTTCCCG AGGAAGAAGT TAGCACCAAC AATTTGGACG ATGTAGACGA AAACGACAAC TTCGAAGCGT TTCTAGATGA ACCTCAGAAA GAAGAAGAGC TCCAAAACAG AGCCAATGAA GAGTTTGATG ACATGTTTGG CGACGGAGGG TTTGAGGATG CTGGTGGAAA CAACAACTTG TTCACTGATT TGGATCACAA TAACCATCTC ATAGATGATG ACGGCTTGAC GGACTTGAAC ATGGGTGGAA TATTTGATGA AGAAACTACT CAAAGTCAGA CAAGCATTCC TTCTATCAAG AAAAGAGTTC TTCTGGAAGA ACAGAGAAAG ATAAAAAGAC AAAGACTAGA GAATATCGTA AAACGACTTG AAAATCAGAG AGTTAGACGT AATATAGCAT ACCACTTCCC ATCCTATTCC AGAAACAAGC CATTTAACTT TCATAACTTT TTTCTTCCTG AGCCCAAATA CTATCGGTAT CAGAAGCCAT ACATCGCGTA CAAGGAGAAT ATCAAGCCTC TTGTGCCTAC AAAGTTAAGT CTTGAGTTGG AAGTTGATGA GAGAAAACTA TTCAAATCTA GAAAGCCTTT ATCAAATTCC AAATCTACTA AAGGTATAGT AAGTGTAACC CAGCAAGATA TTGAATTCAT CAGAGACTTG AACAGTAAGA GCTCCCTGAC AGAGACATTA TTGAAACAGA TCTCGTTCTT GAAGAATGAC TGGACCAATA ACGACAAATT CACAAACTAT TCCAAAGACT TGATTCTTTC TACTACAGAT TGGAACGACG ATGCCATTAT CAATGCTGGC GATACCAAGT TCGCAGTCAA GAAAGTCAAC TTTGAATCGA ACACTGTTGA TACGCTGACA TTTGACGAAG AAGATGAAAA CATATTCAAT GGCCAGTTGA ACACAGATCT ATTGAAGTTG GACATGAATG ATCCTAACTT GTTGTTTGTT CCGGAAAAGA AGTCAAATAA ATCGAAATCG TTGATTCCAA CTAACGAAAA GTTGTTGCTG TTGAAATTCA ACATTTCAAA TGATAAACAG TACGAAATCT TGAAGTCTAA CTACAATACT AAGGTTAGAA GTCAGTTGAG TAACTTAAAT ATCGAACATT CCATCCCTGC ATTAAGACTT CAATCTCCAT ACTACAAGGT CAAGTTGACT AAGGAAGAAT CTCGTTCCTT CCACAGACCC AAGTTTGTTA TCAGACCAGG TTCTTTGATA AGTTTCTCCA AACTTAAATT GCGTAAGAAG AAGAAAGATC GTGGTAAATC ACTACAAGAA GTATTTTCTA AAACTTCAGA TTTGTCTACT ACTGACACTG CTCCCTTGGT TGCAATGGAA TACTCAGAAG AATATCCGCT CATACTTTCA AACTTTGGTA TGGGCTCCAA GATGATCAAC TATTATAGAA AAGAAAGGGA AGACGATTCA TCAAGACCAA AGGCTCAAAT TGGCGAGACA CATGTCCTTG GTGTTGAGGA CAGATCTCCC TTCTGGAATT TTGGAGAGGT AGCCCCGGGA GATTTTGTTC CAACTTTGTA TAACAATATG GTAAGAGCTC CTATATTCAA GCATGAAGTT AAGAATACTG ATTTCCTCTT CATCAAATCC CAAGGTGCCG GTTCGCATCA AAGGTACTTC TTGAGGGCTA TTAACTTCAT GTTCTCTGTG GGTAATGTCT TTCCTGCTGT AGAAATCCCA GCACCACATT CCAGAAAGGT TACCAATACG TCGAAGAATA GATTAAAGAT GGTTGTTTTC AGAGTAATGA ATAGCATGGG TGTCGCTCGT GTGTCGGTTA AAGACGTTTC TAGACACTTT CCTGACCAGA ATGATATGCA AAATAGACAG AGATTGAAGG AGTTCATGGA ATATCAGAGA CAGGGTGATG ACCAAGGTTT CTGGAAAATC AGAAATAGCG ACAGTGTTCC AAATGAGAAC GAAATAAGAT CCATGATTAC TCCCGAAGAC ATTTCTTTAT TGGATAGTAT GCAATACGGT CAACAGACAC TAGACGATAC CTACATGTTG TTTAACAATG ATAAAAAGGA CGAAGCCAAA AAGGAAGAAA AGAAGGAGAA AAAAGAAATC GAATCTAATG AAAAAGATAA AGAAAAAGAA AAGGAAAAAG ATAAAGATAA AGAGGACAAT GAAAACGAAG CCCAAGAGAA TGGCTCGAAG AAAGAAAAGG ATAAAGAAAA AGAACGAACA AGAAGACCCA GAGATCCAGA TGCCGAAGTT GATATGGACG AGGAATTGGC TCCTTGGAAC TTGTCTCGTA ACTTTGTCAA CGCAAACCAG ACTAAGTCGA TGTTGCAGCT TAACGGTGAA GGAGATCCTA CAGGTATTGG GTTGGGCTAT TCTCTTTTGA GATCTACACA GAAAAATGGT TTCAAGCCGC TATTTGCACC TGTCAGAGAG AATGTTCCTA AAAATAATAC TGCTGCCTAT CAGCAGAAAT TATATGAGGC TGAGGTCAAG AGAATTTGGT ACTCTCAGAG GAGTTCGTTG GTCGATCACG GACCTGACTT CAATCTTCAA GCCATCTATG ATGAATACAA ACCAGTCAAT CAAATGAAAA CGATAAAAAA CATGGTCAAA CAGGAAGACA AGAATGACTT CGAAAACAAG GTGTTGAGAA TTACTAGAAG AGTCAGAGAC GAGAATGGCA TTCTCCAGAG AAAAGTTGAA ACAATCACCG ATCCGAGGTT GATCACAGCA TACATCAGAC GCAAGAAGCA GATTGAAGAT GAAATGATGA AGAATGCTGA AGTGGGTGAC ATTTTGCCTA CAAATGACAA GGAATTGAAC AAGATTCGTC GTAAGATTTT GGAAGAGAAA TTGGCTTCAG CAGAAAAAAG AGCCAAACAG AACAAATCTA AGAAGCCTCC CAAAGATTCT ATCCATGCTG CTGCCGCTGC TGGTGGTACG ATTATCGATG CCAACACTGT CATGTTGCCG GATGGTTCAT ATGCCATCGG TGGTAAAGGT ATTGGAAAGG GTAAGTCTAA AACACGAAGG TGTGCTTCTT GTGGAGCATT TGGACACATT AGAACTAACA AGACATGTCC ACTTTATGCA ATCACAAGAG GTGGTACTGT TCCTCTCCAG AAAGACGAAC AAGGTAATCC TATAATCCCA GCATCAGTGA TTATTCCACC AGGTATGATT GGTATTACTG CAAGACATGC ACCACCAGTA CCTGAACAAG CTCCAGCTTC TGACGTGGCA CAAGCGACTT CTGCGTCCCC ACCAGCTCCA GCTACAGCTG CACCTACGGA TCAATAATAA ATTTTACGCA TAAGTCTATA AGCAACCATG AAACGAAACA AACTATAGTT GTAGTATAAT CTATTCATTG TAAAGTGAAT CTATATAATG TGAGTTGTTA CGGGCGGGCC GTCCACGCAT TTAGAATGTA ATTTGCTTGA TG
|
Protein sequence | MKIPYILPRL SQLTNKETMV ERSRKGNSKV EDETAFNQFV ESAEVPDDTV IEAIFSERKT EHAVDAVDYE DIDELADEED LPEEEVSTNN LDDVDENDNF EAFLDEPQKE EELQNRANEE FDDMFGDGGF EDAGGNNNLF TDLDHNNHLI DDDGLTDLNM GGIFDEETTQ SQTSIPSIKK RVLSEEQRKI KRQRLENIVK RLENQRVRRN IAYHFPSYSR NKPFNFHNFF LPEPKYYRYQ KPYIAYKENI KPLVPTKLSL ELEVDERKLF KSRKPLSNSK STKGIVSVTQ QDIEFIRDLN SKSSSTETLL KQISFLKNDW TNNDKFTNYS KDLILSTTDW NDDAIINAGD TKFAVKKVNF ESNTVDTSTF DEEDENIFNG QLNTDLLKLD MNDPNLLFVP EKKSNKSKSL IPTNEKLLSL KFNISNDKQY EILKSNYNTK VRSQLSNLNI EHSIPALRLQ SPYYKVKLTK EESRSFHRPK FVIRPGSLIS FSKLKLRKKK KDRGKSLQEV FSKTSDLSTT DTAPLVAMEY SEEYPLILSN FGMGSKMINY YRKEREDDSS RPKAQIGETH VLGVEDRSPF WNFGEVAPGD FVPTLYNNMV RAPIFKHEVK NTDFLFIKSQ GAGSHQRYFL RAINFMFSVG NVFPAVEIPA PHSRKVTNTS KNRLKMVVFR VMNSMGVARV SVKDVSRHFP DQNDMQNRQR LKEFMEYQRQ GDDQGFWKIR NSDSVPNENE IRSMITPEDI SLLDSMQYGQ QTLDDTYIPR DPDAEVDMDE ELAPWNLSRN FVNANQTKSM LQLNGEGDPT GIGLGYSLLR STQKNGFKPL FAPVRENVPK NNTAAYQQKL YEAEVKRIWY SQRSSLVDHG PDFNLQAIYD EYKPVNQMKT IKNMVKQEDK NDFENKVLRI TRRVRDENGI LQRKVETITD PRLITAYIRR KKQIEDEMMK NAEVGDILPT NDKELNKIRR KILEEKLASA EKRAKQNKSK KPPKDSIHAA AAAGGTIIDA NTVMLPDGSY AIGGKGIGKG KSKTRRCASC GAFGHIRTNK TCPLYAITRG GTVPLQKDEQ GNPIIPASVI IPPGMIGITA RHAPPVPEQA PASDVAQATS ASPPAPATAA PTDQ
|
| |