Gene PICST_86817 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_86817 
SymbolRSA2 
ID4851965 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3315939 
End bp3317614 
Gene Length1676 bp 
Protein Length506 aa 
Translation table 
GC content43% 
IMG OID640393673 
ProductRibosome assembly protein 
Protein accessionXP_001387204 
Protein GI126276160 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.189505 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTAAAA GATCTGCTCC TGAGGACGCT ATCGAACAGC TGAGTGTATC TGCTGTCAAG 
ACCTCCAAGG AATCCGTACC TAAATCCAAT GTCGTAGAGG AAAATGACAT CGACATGGGG
GAGTTTGAAG ATCCGTACGG AGACGAGTTT GAGAGTGATG GGGAAATTAT TGAAGTCGAT
TCTGCTGAAG ATGACGATGA GGAATTGGAC CCAGAAGCTG TTGCCAAGAA GATCGAGCTG
GAAGAACAGA GAGAGCAACA AGAAGAAGAA TCCACTATTT ATCTTCCACA TAGATCCAAA
CCATTGGGAC CGGATGAAGT CTTAGAAGCC GATCCTACAG TATACGAGAT GTTACATAAT
GTTAATTTGC CATGGCCCTG TTTAACTGTT GATATTTTAC CGGATAACCT TGGTAACGAA
AGAAGAACAT ATCCCGCTTC GCTTTATTTG ACTACTGCTA CCCAGGCTTC GCGGGGAAAT
GCCAATGAGT TGATCACTAT GAAGTTGTCT TCCTTAGCGA AAACTTTGGT TAAGGACGAT
GAAGAAGATG ATGAAGACGA CAATGAAGAC GAAGACGAAG ATGTGGACCC AGTTATGGAC
TCAGAAATTA TTTCATTGAA GCACACCACT AATAGAATCA GAGTGTCTCC TCATGCTTCG
CAAACTGGCG AGTACTTGAC AGCCACCATG TCTGAAAGTG GCGAAGTTCT CATTTTCGAC
GTAGCATCGC AATTTAAAGC ATTTGACACT CCTGGATTTG TAGTCCCCAA GGGTGCCAAA
AGACCTATCC ATACTATTAG AACTCATGGA AATGTAGAAG GCTATGGTTT GGATTGGTCC
CCTTTAATCA ACACTGGTGC CTTGTTATCT GGAGACTTGA CTGGCAGAGT TCATTTGACT
TCTAGAACTA CATCCAACTG GGTTACAGAC AAAACCCCTT TTTTTGCATC TCAATCCTCC
ATTGAAGACA TCCAGTGGTC TACAAGTGAA AACACTGTTT TCGCAACTGC TGGAACTGAT
GGGTACGTCC GGATCTGGGA TACGAGATCC AAGAAACACA AGCCTGCTCT TTCTGTGGTT
GCTTCTAACA CTGATGTTAA CGTCATTTCT TGGTGCAACA AGATCAGCTA CTTGTTGGCT
TCTGGTCACG ACGATGGATC CTGGGGCGTC TGGGATTTGA GAAACTTCAA TGCCAATACA
ACTCCTACTC CTGTAGCTAA CTACGACTTC CATAAGTCTG CCGTTACGTC CATCTCATTC
AATCCATTGG ACGAATCAAT CATTGCCGTT TCGTCGGAAG ACAACACTGT CACTCTTTGG
GACTTGGCCG TTGAGGCCGA TGACGAAGAA ATCTCTAATC AAAGAAAAGA AACCAAGGAG
TTGGATGATA TTCCTCCACA ATTGTTATTT GTGCATTGGC AAAAAGATGT AAAGGACGTC
AGATGGCACA AACAAATCCC AGGATGCTTG GTCTCTACTG GTGGAGATGG TTTGAACGTA
TGGAAAACCA TTTCTGTCTG AGTAAAGGAG CACCTGAAAG TACCCACCTT CCCATATATG
CATTGCTCCA TCATTTGTAC AGTAGTCTAC CTTTAGAATA TGTTAATTAG ATCTTCTTGG
AAATATAACT TCTTTCTCAG TTCAACGCTT GGCGTGCCTC GATGAACATA TACTTA
 
Protein sequence
MSKRSAPEDA IEQLSVSAVK TSKESVPKSN VVEENDIDMG EFEDPYGDEF ESDGEIIEVD 
SAEDDDEELD PEAVAKKIEL EEQREQQEEE STIYLPHRSK PLGPDEVLEA DPTVYEMLHN
VNLPWPCLTV DILPDNLGNE RRTYPASLYL TTATQASRGN ANELITMKLS SLAKTLVKDD
EEDDEDDNED EDEDVDPVMD SEIISLKHTT NRIRVSPHAS QTGEYLTATM SESGEVLIFD
VASQFKAFDT PGFVVPKGAK RPIHTIRTHG NVEGYGLDWS PLINTGALLS GDLTGRVHLT
SRTTSNWVTD KTPFFASQSS IEDIQWSTSE NTVFATAGTD GYVRIWDTRS KKHKPALSVV
ASNTDVNVIS WCNKISYLLA SGHDDGSWGV WDLRNFNANT TPTPVANYDF HKSAVTSISF
NPLDESIIAV SSEDNTVTLW DLAVEADDEE ISNQRKETKE LDDIPPQLLF VHWQKDVKDV
RWHKQIPGCL VSTGGDGLNV WKTISV