Gene PICST_82779 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_82779 
SymbolRAD23 
ID4838295 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp818254 
End bp819740 
Gene Length1487 bp 
Protein Length366 aa 
Translation table12 
GC content45% 
IMG OID640389610 
Productnucleotide excision repair protein (ubiquitin-like protein) 
Protein accessionXP_001383791 
Protein GI150864814 
COG category 
COG ID 
TIGRFAM ID[TIGR00601] UV excision repair protein Rad23 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCACCGTTGA ATTGACCCCG TGCACTGTTT TCGTAATGCA AGTGATTTTC AAGGATTTCA 
AGAAACAGAA AGTGCCCTTG GAGGTCGAGT TGACCGACAC TGTATGTATA TGAAAGATAG
AGACTTACTG AAATTTTGCA GAGAGACGTG AATATTTCGT TGAGAGGAAC ACAATTTGGT
TGGTTAGATT GAGGATTGGC TGATATAGGG ATTCAAATTA CTTATTAAAT GAATGGTTTT
ATAAAGACTC ATAATTTGCT TCCAAATTTT ATCAAATATA AAATATCTAA CACCAACGCG
ATTATATCAG CGCATTAAAT GAAAATCTTC ATTTGTTAAT TTATTTCAAC TTTCGGTAGT
TGTCAATGTC AATAATAGTT GCTTTTACTA ACGCTCTTCA GGTGTTGGCG ACCAAGGAAA
AACTCGCTGC CGAAAAGGAC TGCGAAGCCC CGCAATTAAA GTTCGTCTAC TCCGGTAAAG
TGTTACTGGA CGAAAAGACT TTGGAGGAGT TTAAGATCAA GGAAGGCGAC TCGATCATTT
TCATGATATC CAAGGCTAAA AAGACTCCTA CGCCTGCTCC TGCTGCTGCG CCAATTACGA
CAACATCTTC CGAACAGTCA TCTGCCACCC CTGGATCTAC CACTGCCACC ACTACTACTG
GTAATGCTGA GGGTTCTACC GACTCTGGTA TTTCTTCTGG AAATGCACCA GAACCAGAAG
CAGCTGCACC AGAATCTTCT ACAACTTCGG AACCAAGTTC TACTTTTGCC CAAGGCTCAG
AGCGCGAAGC TAGCATTCAG AACATTATGG AAATGGGATA TCAAAGAGCC GAAGTTGAAA
ATGCATTGAG AGCAGCTTTC AACAATCCTC ATAGAGCCGT AGAGTATCTC TTGACTGGAA
TTCCGCAATC TTTGCAACGT CCGGAAGTGC CAGCCGCCGT AGCTCCTGTA GCTGACTCAA
CTCACGAAGA GTTGGCGCAG GATCACGACA TTGACGATGG CGAAGAACAG GGTGAAAACT
TGTTTGAAGC TGCCGCAGCC GCCCAGGCCA GAAGCCAAGG GGCTGGTGCC GTAGAACAAC
CGGCAACTGG TGGAGGATTA GCGGAAATGG GCGACGATGA ACAGATGAAC TTGTTGAGAG
CATCGTTGCA ATCAAACCCC GAGTTGATCC AGCCTATTTT GGAACAATTG GCCCTGTCCA
ATCCCCGAAT CGCTACTTTG ATTCAGCAAG ACCCAGAAGC GTTTATCAGA ACATTTTTGG
GAGCTGGTGC CGACGAATTG GGATACGAAA TAGAAGGCGA TGACGGAGCT GAAGGAGCTG
ACGCTACCGG CCAACAGCCA ATTCGTATTC CCTTGACAGA ACAAGACCAG AATGCAATTG
AAAGATTGTG CGAGTTGGGC TTTGAACGTG ACTTGGTGAT CCAGGTTTAT TTGGCCTGCG
ACAAGAACGA GGAAGTAGCT GCTGACATCT TATTTAGAGA TATGTAA
 
Protein sequence
MQVIFKDFKK QKVPLEVELT DTVLATKEKL AAEKDCEAPQ LKFVYSGKVL SDEKTLEEFK 
IKEGDSIIFM ISKAKKTPTP APAAAPITTT SSEQSSATPG STTATTTTGN AEAAPESSTT
SEPSSTFAQG SEREASIQNI MEMGYQRAEV ENALRAAFNN PHRAVEYLLT GIPQSLQRPE
VPAAVAPVAD STHEELAQDH DIDDGEEQGE NLFEAAAAAQ ARSQGAGAVE QPATGGGLAE
MGDDEQMNLL RASLQSNPEL IQPILEQLAS SNPRIATLIQ QDPEAFIRTF LGAGADELGY
EIEGDDGAEG ADATGQQPIR IPLTEQDQNA IERLCELGFE RDLVIQVYLA CDKNEEVAAD
ILFRDM