Gene PICST_51550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_51550 
SymbolRFX1 
ID4851288 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1417970 
End bp1420198 
Gene Length2229 bp 
Protein Length719 aa 
Translation table 
GC content39% 
IMG OID640392996 
ProductDNA binding protein 
Protein accessionXP_001387499 
Protein GI126274274 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0273348 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AACCTCTCGA TTTCGTCTCA TTTTAATCTC TTCTCGTTGA GTGAAGATAC CTCTAAGCGT 
ATGGCTCCTG CTCCGAACGA TACCTCCATA ATCAACGAAC TATTGATTAG TTTGACAAAC
GTGGACGGCT CCAACATTAA TAACTACCTC TTGTCGCTTC TTGTAAGATT GAATCTTCCT
TTCCCCATCG ACGACTTTTA CAATTTGTTG TACAACAATG ACAGAGCAGG CTCCTTGATC
CAGCTTGATC AGCCGCAGAA GCTTGATAAA ACTCCAGTAG ATAGAAACAA CGAGTTAGTT
ATACGCACAA TCACCCAGTT GCTCAATATC TTCAAAAGAC CAGATCTCCT ACTGGACATG
TTGCCCAACT TGGAAGACAA AGAGAACAAG TTGGTAAGCA TCAACTACCA TGAGCTTCTT
CGTAGTTTCT TGGCAATAAA GGTCCTCTTT GACATCCTAA TACAGTTGCC TCTTTCGTCA
GACGATGATC CCCAGAACTA CACCATTCCC AGACTCTCCA TCTACAAAAC CTACTATATT
ATCTGCCAGA AGTTGATCTT GACATATCCT AGCTCTTCTA ATACTACTAA CGAGCAACAG
AAGTTGATTC TAGGCCAGTC CAAGCTCGGA AAGTTGATCA AACTCGTGTA TCCCAACTTG
TTAATCAAAA GATTGGGAAG TAGAGGTGAA TCCAAGTACA ACTATTTGGG CGTCATCTGG
AACGACAAGA TCATCAATGA CGAGATCGCG CGTCTTTGTG AAAATAACGA CTTGCTCGAG
TTAACTGAGA TCTTCAAAGG AGAGCAGATG ATGCCTAGAA CCTCTATTTC GGGTCCTGTT
AAAAAAGTTC ATAGACGATC CTCCAGTAAG GGAAAGGTCA AATTGGAACA CATCAAGAGA
ACTTCCGTAG ATCATGGCTA TCCAGGAAAA TCTGGAGTAA TAGAGCCAGT CTTGCCACCT
CCTCTAGCAA CGAGTACTGA ATCGAGTAAC CAGATTTCCA GTCCATCTTT ATCGTTTATA
AAACCATTTT TACGGTATCC TCAGGATGAG AACTTTACTG CCTTGAACGA TGAGGAGAAT
TGGTTCAATG AGATCAGATT GCAAGTTTAC TCTTCGCATA CTGTTATCAA TCGTGACATG
ATCCATCAGA TCTTTCTTGA CAGCTCTAAC TTGTCGACCA ACTCGAGTCT CTTGGATAAC
TTATTGGAGA TGTTAATGAA ACCGATTGAT GTCCTTAATA ATGAACCTAA CATAGATTTG
AACTTGTATT TGATTATCAT AGTGGAAATT CTTCCGTATT TGATGTTGAT AAAATCATCT
ACCAACATCA ATTTCTTGAA AAATCTCCGG TTGAACTTAT TACATTTGAT CAACAATCTT
AATCCCAACA TCAAGAAGCT CAACTTGAAG ACTTTCAGCA TCAACAACTC CAGCATCTTC
TTGGTTCTTG TCAAGAAATT GATCAACCTC AACGACTTGT TGATTACCTT CATCAAGTTG
ATCATCAAAG ATGATACTAC ATCCATCATG TCAACCGATA TTGAAACGTT TTTGAAGGTA
AACTCCAGTC AATCAGCTAG CGGTGTCAAG CTAGAAGATC AGGATGAAGA TTCATTTTTC
CTCAACTTGA ATGCCAACTT GAGCAATTCT TTAGGTGATT TAAATTTCAA CTTCAAGAAT
GATATCTTAT CGAATGATCT CGTATATGCT TTGGTTGGCT ACAATTTTGA CCCAACATTA
TTTCAGGATT TGAAATCATC GATTTCGATG AGCTTCATCA ACGAAGAAAT CAACATCATC
GACGACTTCT TCAAAAAGGA CTTGTTCGCG TTTTTGAACG ATAACAGCTT CGATTCTGAG
TCTGAAGCTA CTAGCGGTAC TCATTCTGAG ACTGGCGAAG CCAAGAGAGA GTCTGATCTG
ATCTTATCTA CAAAGGAGTT GTCCAAATTG TACTCCTTGA TTAGTCTCAT TGACAGGAAG
TTGTTAGCAA ATCACTACAA GTCAAAATAT CCCATCATGA TCTACAACAA CTTTGTAAGC
TACATCCTTA ATGATATTTT GAAATTCATC TTCTTGAAGC AACAGCAGGT GCAACTTCAA
AACCTACAAA CCAACACGGA AGTTGAGGCT CCCAATAGCT TCGGTAATTG GTGGGTCTTC
AATTCGTTTA TTCAGGAGTA CTTGAGTTTG ATGGGCGAGA TTGTTGGATT ACATGATTCA
ATCTCCTAG
 
Protein sequence
NLSISSHFNL FSLSEDTSKR MAPAPNDTSI INELLISLTN VDGSNINNYL LSLLVRLNLP 
FPIDDFYNLL YNNDRAGSLI QLDQPQKLDK TPVDRNNELV IRTITQLLNI FKRPDLLLDM
LPNLEDKENK LVSINYHELL RSFLAIKVLF DILIQLPLSS DDDPQNYTIP RLSIYKTYYI
ICQKLILTYP SSSNTTNEQQ KLILGQSKLG KLIKLVYPNL LIKRLGSRGE SKYNYLGVIW
NDKIINDEIA RLCENNDLLE LTEIFKGEQM MPRTSISGPV KKVHRRSSSK GKPVLPPPLA
TSTESSNQIS SPSLSFIKPF LRYPQDENFT ALNDEENWFN EIRLQVYSSH TVINRDMIHQ
IFLDSSNLST NSSLLDNLLE MLMKPIDVLN NEPNIDLNLY LIIIVEILPY LMLIKSSTNI
NFLKNLRLNL LHLINNLNPN IKKLNLKTFS INNSSIFLVL VKKLINLNDL LITFIKLIIK
DDTTSIMSTD IETFLKVNSS QSASGVKLED QDEDSFFLNL NANLSNSLGD LNFNFKNDIL
SNDLVYALVG YNFDPTLFQD LKSSISMSFI NEEINIIDDF FKKDLFAFLN DNSFDSESEA
TSGTHSETGE AKRESDLILS TKELSKLYSL ISLIDRKLLA NHYKSKYPIM IYNNFVSYIL
NDILKFIFLK QQQVQLQNLQ TNTEVEAPNS FGNWWVFNSF IQEYLSLMGE IVGLHDSIS