Gene PICST_68747 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_68747 
SymbolRLM1 
ID4851331 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp1519243 
End bp1521492 
Gene Length2250 bp 
Protein Length517 aa 
Translation table 
GC content43% 
IMG OID640393039 
Producttranscription factor of the MADS box family 
Protein accessionXP_001387522 
Protein GI126274372 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TGAAAAGAAA TCTCGTGCTG TAGACCGAGA CTGAAATTAA AAGTGTTTGT CAGCATCACG 
TTCTTGAATA GTAGAGTGAA TCGTGTCTTG AAGTGTATCC ATCGTCATTT TATTTGGCTA
TTTACATTTG AGTTTTGATA TTGAAACCGA TATTGCCAAC ACTATCAACA AGGCACCCGA
CTATATCTGC TATTTTATCA TTAACATTTA CATTCTCAGT ACTTTTCGCA AATTTTTCAC
AAACAAGCGA GAGTCATACT AGTGATACTT TATTCAATTA ATTCAATTGA ACAATCATCA
TAACATTCGA AAACAGCCCG TACAATCAAA CAAAACATCG AGTAAGACTG AGTCTAACAA
CCACTCTGCA TTGTATCCAT CATACTATAA GACTATTTTA ACAATAATAA AAGTGATATA
GCACCATGGG CAGAAGAAAA ATAGAGATCC AGCCCTTGAC GGATGACAGA AACAGAACAG
TAACGTTTGT CAAACGTAAG GCTGGGCTTT TCAAGAAAGC ACACGAGTTG GCCGTGCTTT
GTCAGGTAGA TCTTGCGGTG ATCATAGTTG GTTCCAACAA CAAGATCTAC GAATTTTCGT
CTGTGGACAC CGGCGAATTG ATCAAATCGT ACCAGCATAC ATGTAAAACC AAGCAGCCGC
AGGAGCTGAA ACTGCCGGAG AACTACGGAA ACTACAGGAA GAAAAAGTTT CTAAATGAGT
CTTTGTCGAG AAAGTCTGGA GCGTCCGTGC TAGAAGAGGT TGATCATGAT ATGGATGAAA
ACGAGGTCGA TTCTGAATAC GATAGCGATT CACCAGAGCC TAAGAGACAC AAAAGATCGT
ACAGCGATTA CTCTAAGAAT ACTAATTCAA AGGTGTTCAA CAGCAACAGG CCGCCTCCAC
CACCTCCACC TCACCATATC TCTCTTTCAA ACATGCCTAC TTTCAACAAT CCTACAACGT
TTAGAATGAA AGAAGACGAC ATCAACACTC CTGGAAGCAC TGCTTCAAAT GCTACGGACA
AGGTCTCACA CAAGCGGGAT GACTCGACTT CTAGTAACCA AAGACCGGTG CTAAGAGTCC
AAATTCCAAC TGATGCCAAG AGTAATTCAC TGAATAACAA GAGCTCCATC AACGATAGTG
CCCGTACCAT TACTGCCATC GACACCAACA TGCAAAATAC GTCTGTAACA GGTTCAGGAA
ACAGCAATTC CAATGCGGGT AGTAAAAATG GCAATGATAG TACCAGTTCG CACGAGGGAG
ACAATAGCTC TGATGTAAAT GGTTCATCCA ATATTCAGAC CAATGCCAGT AGCAACAACT
CCATAATAGT TGGCGGCAAC AACCATAACA TCAACAATAA CCAGAACAAT CTTAACATCA
ACAATAATCA TATTAACCAG AATCATAACC AAAATCATAC TAATAATAGT AACTTGCCGA
ACATCAACAC CCCGAAATAC TCGAGCTTTG CATCGTTCAG ATCACCTGAT TCACGTAAAC
CAACGCTACC TCTACCTATA CAATCAAAGT CACAGACTTC ATCACCTGCC AGTGCCACGG
CTCCTCCTTT GCCAATCGTT AATAACGGAG CCAACATGGC CATTAACAGT AATGTAGCCA
ACCCTAACGC CATGTACTAC TCATCTATAC CCCAGGGATC ACCGTCAAAT CCATATCCCA
ATGGAATTCT CCCAACACCC ATACTTAACC AGGTATTCAA CCAACAGTAT GGTCAGCAGC
TTGCGAATGG AAATGATCCT CATTCTGGGA ATGCAAAGTT CCGACCTCCA ATATTCACCA
ACTTACCCAA CAACGTTGGA GAACAGACTC CTATTTCGGG ACTTCCCTCG AGATATGTCA
ACGACATGTT TCCATCGCCT TCCAACTTCT ACGCACCTCA AGATTGGCCT AGTGGAAACA
CTGGAATGAC GCCCATCCAT GCCAACATCC CTCAATACTT CATGAACATG TTACCCTCGG
CCGGTCCGTC CAGCGCATTT CCGGGAAATC CAGGGACCAA GTTGCCTGCG AATGTACAGC
AGCAACAGAC ATCTCCAGCT CCCGCAACAG TGCCTCCTAA AGACGGACCT CTTTCACCAA
CTATCTTCAT GGGAACGAAT GCGAAGATTA AGCTAGAGGA GAAGAAGGAT AGTAAGTGAG
ACTGCCACTT TAGCAGTATA ACTTGTACTT ATAGACATAT ACCTTTTGAA ACGAGATCCA
ATGAACATTT ACCAGAGATG AGTAGCACAC
 
Protein sequence
MGRRKIEIQP LTDDRNRTVT FVKRKAGLFK KAHELAVLCQ VDLAVIIVGS NNKIYEFSSV 
DTGELIKSYQ HTCKTKQPQE LKLPENYGNY RKKKFLNESL SRKSGASVLE EVDHDMDENE
VDSEYDSDSP EPKRHKRSYS DYSKNTNSKV FNSNRPPPPP PPHHISLSNM PTFNNPTTFR
MKEDDINTPG STASNATDKV SHKRDDSTSS NQRPVLRVQI PTDAKSNSLN NKSSINDSAR
TITAIDTNMQ NTSNNLNINN NHINQNHNQN HTNNSNLPNI NTPKYSSFAS FRSPDSRKPT
LPLPIQSKSQ TSSPASATAP PLPIVNNGAN MAINSNVANP NAMYYSSIPQ GSPSNPYPNG
ILPTPILNQV FNQQYGQQLA NGNDPHSGNA KFRPPIFTNL PNNVGEQTPI SGLPSRYVND
MFPSPSNFYA PQDWPSGNTG MTPIHANIPQ YFMNMLPSAG PSSAFPGNPG TKLPANVQQQ
QTSPAPATVP PKDGPLSPTI FMGTNAKIKL EEKKDSK