Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_10913 |
Symbol | PPR1 |
ID | 4837524 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 801233 |
End bp | 803161 |
Gene Length | 1929 bp |
Protein Length | 643 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388839 |
Product | Fungal specific transcription factor |
Protein accession | XP_001382926 |
Protein GI | 150864199 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0760081 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.897103 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TCAGGAATCT CAAGGTCTAT ATCGGCCTGT CAGCGGTGTA GAAAGAAGAA AGTGAAATGT GATCAGCGAC TTCCAAAGTG TTCCAAGTGT GAAAAGGCTG GCTCAGAGTG TATTGGCCTT GATCCCATAA CAGGCCGTGA GGTTCCTAGA TCTTACGTGA TGTATTTGGA AGATAGAGTG AACATGCTCG AGCAGAGACT TCGTGACAAG GGGATCACAC TTGACGATCA GGAAACAACC GAAATTCGCA ACAGCAAGGT TAAAGTAGAA GATAGAGTCC TTTCAAAGGG CAGTATTCCT TGTGAGTCTT CTCTGGTGAT AGTGAAAAGC ATTCTGATAA GCTCAGAGAA GACTGATATA CACACATTAG ACGAAAGTCA ACACGCACCA AGTATCGCCT TTTCACGTTT GATGTCAACA GCTGTGAAAA TACAGAAAAG GTCACTTTCT AACCTCCAGC ATCCTCTAGC TTCAAAGAGA TTTGCCAATA ACTCAGGTGC AACTACGGAT CCGAGCAACG GTATTTTACC AGCGATATTG CCTCCTAAAT CAACGGCATT ACAGTTCATT CTGATCTTTT TCACCCAGGC GAATCCTCAG TTGCCAATTT TCCATCGTGA AGAGTTCATC TCCAAATACT TCATACCTAT CTACGGCTAC TTCAACTATG AAGAAGTGTC TATTGCGTCA GATTATACAT CTATAAATGC GGCTTTCTTC GAGAACGAAA ATATGAAAGT ACCGTGGACA CCGTGGTTTG AGCAGTATAA GGACAATTTC CAGTCAATCT TGGGCACAAA TGCTAGCAAA GCTTCCAATG ACGAAATCAA AAAGATTTCC AACTCTATTA TTCCACCAAA ACAATATAGA AAGGGACTCT TCTTCCTAAA CATAATTTTC GCAATAGCGT CTTCTGTGAA TCATCTACAA TACCCGATTT CCATCTCGGA GTCGTTTCGC CTTGCAGCCA ACAAGTACTT CGAAGAAGTG AATAGCTCTA CAGATCAACT TGAATCCTTG CTGGCTATAT TAACATATTC ATTATATTCT ACCATGAGAC CTTCAAATCC TGGTGTCTGG TACACGATGG GATCGGCATT GAGAACTTGT GTAGATTTGG ACTTGCACAA CGAAAGCAAC TCATCAAGTG CCCATAATAT CGATTCCTTC ACTAAGGAAA AAAGAAGAAG GCTCTTTTGG TGTACTTATT GCATAGATAG ACAAATCTGT TTCTATCTAG ATCGTCCCGT GGGAATTCCT GACGAGAGCA TCAATACGCC TTTCCCAACA GTATTGGACG ATTCTCTAGT TTATCCCAAT GAAACGATCA AGGATTTCTC ACTTCTTGCT AACTCTACGC CCTCATACAA GACAGTCTTT TTGTCCATGC TTCAGATGAG AAAGATTCAG TCAGAAGTGC AAAAGGTTTT GTATACTAGT TTCGAATTGC CTCGTCGTTA CGATGGGTTG GATAGCTGGA AGAAATCGAT CTTGAACAGG CTCGGTCTTT GGAAGCAGAA TATCCCTAAA TCTCGTAAAG AAATGAACTG TGACTTCAAT CTAAACTTCT TCCATTTGAA TTACTATCAT GTCAAGCTCA TGATCCACGG CTTGTCCCCC AAAAATTACA AGCTTTCATC TAACGATTAC ACACAAGTCA AGTATGCTGC CAAGCAGATG ATCAATTGTT ATGCGCAATT GTTCTCAACA AAGAGTATCA ATTATACGTG GGCAGCAGTC CACAATATTT TCATGGCAGG AACATCCTAT CTACACACGA TATACAACAG TGAAGATGCC AGAGTCGATG AGCCATTGCA TGAAGTGAAA CGAGTATCGT CTGATTGTTT GAATGTGCTC AATTCCTTAA TCGACAGCTG CGAGGCTGCT TCTCACTGTG TAGAAGTATT TCAAAGTTTG ACAATGGTT
|
Protein sequence | SGISRSISAC QRCRKKKVKC DQRLPKCSKC EKAGSECIGL DPITGREVPR SYVMYLEDRV NMLEQRLRDK GITLDDQETT EIRNSKVKVE DRVLSKGSIP CESSSVIVKS ISISSEKTDI HTLDESQHAP SIAFSRLMST AVKIQKRSLS NLQHPLASKR FANNSGATTD PSNGILPAIL PPKSTALQFI SIFFTQANPQ LPIFHREEFI SKYFIPIYGY FNYEEVSIAS DYTSINAAFF ENENMKVPWT PWFEQYKDNF QSILGTNASK ASNDEIKKIS NSIIPPKQYR KGLFFLNIIF AIASSVNHLQ YPISISESFR LAANKYFEEV NSSTDQLESL SAILTYSLYS TMRPSNPGVW YTMGSALRTC VDLDLHNESN SSSAHNIDSF TKEKRRRLFW CTYCIDRQIC FYLDRPVGIP DESINTPFPT VLDDSLVYPN ETIKDFSLLA NSTPSYKTVF LSMLQMRKIQ SEVQKVLYTS FELPRRYDGL DSWKKSILNR LGLWKQNIPK SRKEMNCDFN LNFFHLNYYH VKLMIHGLSP KNYKLSSNDY TQVKYAAKQM INCYAQLFST KSINYTWAAV HNIFMAGTSY LHTIYNSEDA RVDEPLHEVK RVSSDCLNVL NSLIDSCEAA SHCVEVFQSL TMV
|
| |