Gene PICST_81646 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_81646 
SymbolRFA1 
ID4836927 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp133334 
End bp135336 
Gene Length2003 bp 
Protein Length616 aa 
Translation table12 
GC content42% 
IMG OID640388242 
ProductDNA replication factor A 
Protein accessionXP_001382257 
Protein GI150863697 
COG category[L] Replication, recombination and repair 
COG ID[COG1599] Single-stranded DNA-binding replication protein A (RPA), large (70 kD) subunit and related ssDNA-binding proteins 
TIGRFAM ID[TIGR00617] replication factor-a protein 1 (rpa1) 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.272017 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CTTGAACAAC AAGCACTGCC CACTATGAGT TACAATCTTT CCGGTGGAGC CTTGAGAGAT 
GTGTTCTCCA TCAAGAAGCA CGCGCTGGTG AACACTCCCA TTATTTTACA GGTTACTAAC
ATCAAGCCTG TGCTTTACAA GGACGAAGTG AAGAAGTACA GACTTCTTCT CAATGATGGC
CAATACGCAG CCCAGGGTCT TGTGGATGAA CCTTGTATCA CGTATTTGCA GAATAACAAC
TTTTCCCGTT ACTCCATTAT TGAAGTTAAG GAGTTCTCGA CGTTTCCTAC TCAGAAACAC
ATTTTCATAG TAAAGGACGC GGTCATCGTT TCGCCAACTA GTGAAAAGGG CAACTCTAAC
GATTATATCA GCGTTGATAC GTACTATGCC GAACATCCGG ACGATGATAA TTTACAAATC
GCCCAGAGGC AAGCTGCTGG TGCCACTAAT GGAGCTGGAG CGCGTTCTGA ATCCCCAATT
CCTGGTAGTG AACAGAACAG ACCTCAGCAA TTTCAACAAC CACAACAGTA CCAAAACCAG
CCATCCAATC GAAACGGATT CCAGGGTAAG ATTACGCCCA TCGAGACATT GTCTCCGTAC
CAAAACAACT GGACCATCAA GGCTAGAGTG TCGTACAAGG GAGATCTCAG AACGTGGACT
AATGCCAAAG GAGAAGGCAA GTTGATTTCC GTCAACTTTT TGGATGAATC AGACGAAATA
AAGGCTAGTG CCTTCCAAGA TGTTGCCATA TCTGCCCATA AGTTATTGGA GGAGGGTAAG
GTCTACTACA TTTCCAAGGC TAAGGTCCAG GCTTCCAACA AGAAGTTCAA CACCTTATCC
CACCCATATG AGTTGGTGAT GGACAGAGAC ACCAAAATTG AAGAGTGTTT TGATGTAGAC
AACGTACCCA AGATGCACTT CAACTTCATC AAATTGAACC AAATTCCTAA CCTTGATCCA
AATGCCATTA TCGATGTCCT TGGAGCCTTG AAGATTGTTA ATGAGCCTTA CAAAATAACA
GCCAAGTCAA CAGGTAAGGA ATTCGACAGA AGAAACGTCA CCATTGTGGA TGAAACCGGT
TTTGCTATAG ATGTGGGTTT GTGGAATAAT ACCGCCACTG AGTTCAGTAT TCCTGAAGGC
TCGATTATCG CTTTCAAGAG TTGTAGAGTT CAGGATTTTA ATGGCAGATC TTTAACGTTG
ACTCAGACTG GTTCCATGTT GCCTAATCCA AACACCCCGG AACTGTACCT GTTGAAGGGC
TGGTACGATA ACCAAGGTGT CAATGCCAAC TTCAACAACT TGAAGGTAGA GAGTAGTGGT
GGCGAAACAA AAATTGGTGA TCGTAAAACA ATTGCACAGG CTCAAGACGA GAGCCTCGGG
CTTCGTTCGG AGAAGGAACC AGACTATTTC ACCGTCAAAG CAAGTATCTC TTTCATTAAA
ACTGATCCCA ACTTCTGCTA TCCTGCTTGT ACAAACGAAG TTCAGTACAA CAACAGAAAG
CTGGCATGTA ACAAGAAGTT AGTAGAACAA CACGATAATT CCTGGAGATG TGAAAAATGT
GACAAGAACT ACGCACAACC AACCTATCGT TATATTCTTA CCTGTTCCAT TATGGATGAG
ACCAATCAAA TATGGGTTAC GCTCTTTGAG AGAGAGGCTC TTAAGATTTT GGGCAAGGAT
GCCAATGAAC TTATTGCCCT CCAGGATGAC TCTGCCGCTT TCAAAGATTA CATTCAGGAA
AAGTGTTTTC AAGAACATGT TTTCAGAATT AGGGCAAAGC AAGATACTTA TAATGATCAG
GTAAGAGTCA GATACCAGTG TGTTGCCCTT TATGATATCG ACTATAACGC CGAAGCCATT
CATTTGAGTG AGCAATTGGA TTCATTATTG GTTTGATTTT ATTGTTTTCT ACCCATGTAT
TTCTACCATT CGTTATGTAT TCTGCTGTTT CCTCTTAATT GGTGTCTATA TTGTTAAATA
AGAGTTAATG TTATGTACAT GTT
 
Protein sequence
MSYNLSGGAL RDVFSIKKHA SVNTPIILQV TNIKPVLYKD EVKKYRLLLN DGQYAAQGLV 
DEPCITYLQN NNFSRYSIIE VKEFSTFPTQ KHIFIVKDAV IVSPTSEKGN SNDYISVDTY
YAEHPDDDNL QIAQRQAAAR SESPIPGSEQ NRPQQFQQPQ QYQNQPSNRN GFQGKITPIE
TLSPYQNNWT IKARVSYKGD LRTWTNAKGE GKLISVNFLD ESDEIKASAF QDVAISAHKL
LEEGKVYYIS KAKVQASNKK FNTLSHPYEL VMDRDTKIEE CFDVDNVPKM HFNFIKLNQI
PNLDPNAIID VLGALKIVNE PYKITAKSTG KEFDRRNVTI VDETGFAIDV GLWNNTATEF
SIPEGSIIAF KSCRVQDFNG RSLTLTQTGS MLPNPNTPES YSLKGWYDNQ GVNANFNNLK
VESSGGETKI GDRKTIAQAQ DESLGLRSEK EPDYFTVKAS ISFIKTDPNF CYPACTNEVQ
YNNRKSACNK KLVEQHDNSW RCEKCDKNYA QPTYRYILTC SIMDETNQIW VTLFEREALK
ILGKDANELI ALQDDSAAFK DYIQEKCFQE HVFRIRAKQD TYNDQVRVRY QCVALYDIDY
NAEAIHLSEQ LDSLLV