Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_51550 |
Symbol | RFX1 |
ID | 4851288 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1417970 |
End bp | 1420198 |
Gene Length | 2229 bp |
Protein Length | 719 aa |
Translation table | |
GC content | 39% |
IMG OID | 640392996 |
Product | DNA binding protein |
Protein accession | XP_001387499 |
Protein GI | 126274274 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.0273348 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AACCTCTCGA TTTCGTCTCA TTTTAATCTC TTCTCGTTGA GTGAAGATAC CTCTAAGCGT ATGGCTCCTG CTCCGAACGA TACCTCCATA ATCAACGAAC TATTGATTAG TTTGACAAAC GTGGACGGCT CCAACATTAA TAACTACCTC TTGTCGCTTC TTGTAAGATT GAATCTTCCT TTCCCCATCG ACGACTTTTA CAATTTGTTG TACAACAATG ACAGAGCAGG CTCCTTGATC CAGCTTGATC AGCCGCAGAA GCTTGATAAA ACTCCAGTAG ATAGAAACAA CGAGTTAGTT ATACGCACAA TCACCCAGTT GCTCAATATC TTCAAAAGAC CAGATCTCCT ACTGGACATG TTGCCCAACT TGGAAGACAA AGAGAACAAG TTGGTAAGCA TCAACTACCA TGAGCTTCTT CGTAGTTTCT TGGCAATAAA GGTCCTCTTT GACATCCTAA TACAGTTGCC TCTTTCGTCA GACGATGATC CCCAGAACTA CACCATTCCC AGACTCTCCA TCTACAAAAC CTACTATATT ATCTGCCAGA AGTTGATCTT GACATATCCT AGCTCTTCTA ATACTACTAA CGAGCAACAG AAGTTGATTC TAGGCCAGTC CAAGCTCGGA AAGTTGATCA AACTCGTGTA TCCCAACTTG TTAATCAAAA GATTGGGAAG TAGAGGTGAA TCCAAGTACA ACTATTTGGG CGTCATCTGG AACGACAAGA TCATCAATGA CGAGATCGCG CGTCTTTGTG AAAATAACGA CTTGCTCGAG TTAACTGAGA TCTTCAAAGG AGAGCAGATG ATGCCTAGAA CCTCTATTTC GGGTCCTGTT AAAAAAGTTC ATAGACGATC CTCCAGTAAG GGAAAGGTCA AATTGGAACA CATCAAGAGA ACTTCCGTAG ATCATGGCTA TCCAGGAAAA TCTGGAGTAA TAGAGCCAGT CTTGCCACCT CCTCTAGCAA CGAGTACTGA ATCGAGTAAC CAGATTTCCA GTCCATCTTT ATCGTTTATA AAACCATTTT TACGGTATCC TCAGGATGAG AACTTTACTG CCTTGAACGA TGAGGAGAAT TGGTTCAATG AGATCAGATT GCAAGTTTAC TCTTCGCATA CTGTTATCAA TCGTGACATG ATCCATCAGA TCTTTCTTGA CAGCTCTAAC TTGTCGACCA ACTCGAGTCT CTTGGATAAC TTATTGGAGA TGTTAATGAA ACCGATTGAT GTCCTTAATA ATGAACCTAA CATAGATTTG AACTTGTATT TGATTATCAT AGTGGAAATT CTTCCGTATT TGATGTTGAT AAAATCATCT ACCAACATCA ATTTCTTGAA AAATCTCCGG TTGAACTTAT TACATTTGAT CAACAATCTT AATCCCAACA TCAAGAAGCT CAACTTGAAG ACTTTCAGCA TCAACAACTC CAGCATCTTC TTGGTTCTTG TCAAGAAATT GATCAACCTC AACGACTTGT TGATTACCTT CATCAAGTTG ATCATCAAAG ATGATACTAC ATCCATCATG TCAACCGATA TTGAAACGTT TTTGAAGGTA AACTCCAGTC AATCAGCTAG CGGTGTCAAG CTAGAAGATC AGGATGAAGA TTCATTTTTC CTCAACTTGA ATGCCAACTT GAGCAATTCT TTAGGTGATT TAAATTTCAA CTTCAAGAAT GATATCTTAT CGAATGATCT CGTATATGCT TTGGTTGGCT ACAATTTTGA CCCAACATTA TTTCAGGATT TGAAATCATC GATTTCGATG AGCTTCATCA ACGAAGAAAT CAACATCATC GACGACTTCT TCAAAAAGGA CTTGTTCGCG TTTTTGAACG ATAACAGCTT CGATTCTGAG TCTGAAGCTA CTAGCGGTAC TCATTCTGAG ACTGGCGAAG CCAAGAGAGA GTCTGATCTG ATCTTATCTA CAAAGGAGTT GTCCAAATTG TACTCCTTGA TTAGTCTCAT TGACAGGAAG TTGTTAGCAA ATCACTACAA GTCAAAATAT CCCATCATGA TCTACAACAA CTTTGTAAGC TACATCCTTA ATGATATTTT GAAATTCATC TTCTTGAAGC AACAGCAGGT GCAACTTCAA AACCTACAAA CCAACACGGA AGTTGAGGCT CCCAATAGCT TCGGTAATTG GTGGGTCTTC AATTCGTTTA TTCAGGAGTA CTTGAGTTTG ATGGGCGAGA TTGTTGGATT ACATGATTCA ATCTCCTAG
|
Protein sequence | NLSISSHFNL FSLSEDTSKR MAPAPNDTSI INELLISLTN VDGSNINNYL LSLLVRLNLP FPIDDFYNLL YNNDRAGSLI QLDQPQKLDK TPVDRNNELV IRTITQLLNI FKRPDLLLDM LPNLEDKENK LVSINYHELL RSFLAIKVLF DILIQLPLSS DDDPQNYTIP RLSIYKTYYI ICQKLILTYP SSSNTTNEQQ KLILGQSKLG KLIKLVYPNL LIKRLGSRGE SKYNYLGVIW NDKIINDEIA RLCENNDLLE LTEIFKGEQM MPRTSISGPV KKVHRRSSSK GKPVLPPPLA TSTESSNQIS SPSLSFIKPF LRYPQDENFT ALNDEENWFN EIRLQVYSSH TVINRDMIHQ IFLDSSNLST NSSLLDNLLE MLMKPIDVLN NEPNIDLNLY LIIIVEILPY LMLIKSSTNI NFLKNLRLNL LHLINNLNPN IKKLNLKTFS INNSSIFLVL VKKLINLNDL LITFIKLIIK DDTTSIMSTD IETFLKVNSS QSASGVKLED QDEDSFFLNL NANLSNSLGD LNFNFKNDIL SNDLVYALVG YNFDPTLFQD LKSSISMSFI NEEINIIDDF FKKDLFAFLN DNSFDSESEA TSGTHSETGE AKRESDLILS TKELSKLYSL ISLIDRKLLA NHYKSKYPIM IYNNFVSYIL NDILKFIFLK QQQVQLQNLQ TNTEVEAPNS FGNWWVFNSF IQEYLSLMGE IVGLHDSIS
|
| |