Gene PICST_39705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39705 
SymbolPRP1 
ID4851966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3318177 
End bp3320882 
Gene Length2706 bp 
Protein Length901 aa 
Translation table 
GC content39% 
IMG OID640393674 
ProductPre-mRNA splicing factor prp1 
Protein accessionXP_001386957 
Protein GI126276163 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.215456 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGAGAC CGGCATTTCT AGACCAAGAA CCTCCACCTG GATATATATC TGGAATTGGA 
CGAGGTGCCA CGGGATTTAC AACTTCTGCT GATACTGGCT CTTTACAACC TGGCTTTACT
ATAGAAAATG GAGAAGAATC CGATGACAAT TTAGCTGGAG AAATTGGTGA TGAGGGAGCT
ATCCTTGCGA GCGGAAAAAA CCGCGATAAA GAAGACGAAG AAGCAGATCA AATTTATGAG
GAAATTGAGA GGAAATTGCA AAGAAGAAAA GTGTTAAAAG AAGAAGCAGT TTCAGAAGCA
GAAAATCAAA GTTATGAAAT CAAGAGCAAG TTTTCGGATT TGAAGAGATC GCTTTCGAGC
ATTTCAGCCG AACAGTGGGA AGCTTTACCA GAAGTTGGAG ATATTACAAG ACGTAACAAG
AGAACGAGAC TTTATGAGCA GCAACAGCAG AGAACGTATG CTGTTCCAGA TAGTGTTATA
GCTGGAAGTA TAGCGAGAGC ATCACCTACC AACTTCCAGT CAATTTCCGA GTCTCGAGAC
CAATTACTTC TGAGCCAACT TGACTCCTTA TTACCTAAAC ATGAGATAGA GTTTAATGCA
GAAGAAGCAA CCGAGATATT GAAGAATGAC CAGCATGTAC AAATTGCAGA TATCAGGAAG
GGCAGGCAAA TTCTTGCTTC CTTGAGAAGA ACACAGCCAA ATCTGTCAAA CTCGTGGATT
GCATCTGCAA GACTTGAAGA ACAGGCAACC AACTACACTA TGGCCAAAAA GCTCATTATA
GAAGGGTGCA AGCACGCTCC AAAAAGTGAG GATATCTGGC TTGAAAGTAT ACGTATACAC
AAACTAACTT CTGAGGGAGC GAAAATGTGT AAGGTTATCG TAGCCGAATC TTTGAAGTAC
AACCCTGGTT CGGAAAAACT TTGGATCCAA GCAGAAAACT TGGAGAATTC TATAGATGTG
GTCTCCAGAA AACGGATTTT GATGAGAGCT ATAGAGGCTA TACCATCCAG TGCCAGTCTT
TGGAAGAGAC TAGTGGATTT GGAGACTAAC CAAGATGACG TAAGCAAGAT TTTGACCAAG
GCAATAGAGT TGTGTCCTAC TGAATGGGAC TTCTGGTTAA CTTTAATCAA TTCGTCCGAA
TACAAAGACG CAAAAACACT TCTTAACAGA GCCAGAAAGG CATTGAATGG AAACTACCAG
GTATGGATTA CCGCAGCTAA ATTGGAAGAG AGAGAAAATT CAACAATTGA TTCATCAAAG
ATATCAAAAT TGATGGATAA GGCATTTAAG GAAACCGAGA AGAGTACGAC TACAATTCTG
CGAACTACTT GGTTAGAAGA AGCCATCAAA GCAGAAGAAG AAGGATTCAG AAATACTTGT
AGAGCCATTG TTAACAGTCT ACTTTCTTCT GAAATAAACC AAGATAATCC AGAAGAAAAC
CTTGTAACTT GGTTCCAGGA TGCTGAAACT CTAGCACTGA AAGAGAGTGT GGAAGCTGCG
AACTATATCC ACCAATTCAT TGTTGAGACC AATCCACATA GCATTAACAG TTGGAAGGAG
TTGTTCTCAT TTTTAAAGAA CTCATCAAAC AGGAATTTGG ATACTTTATT TAACTACTAC
AAGAGATCTA TAGAGTTAAA TCCGAAGGTC GAGGTTTTGC ATCTCATGTA TGCAAAGGAT
TTGTGGCAAT TGGCTGGCAA TATTGTCGAA GCGAGAAAGG TTTTGAATGC TGCTAGTCAT
GGACTTGAAA ATAATGAAGA AGTGTGGTTC GCAAGAATCA AATTAGAAAT CAAATCCGGT
AACTTTGAAC AGGCTCTATC CATTTCTTCT AAAATGATCA AAGCTATTCC TACTTCTTCT
GATCGCGTTT GGTATAAACA CATTCACTTG GTGCGATGTA TGAACAATAG AGAACAAAAT
CCTAACTACG AGGGACAGAT ATTGGCGTTA CTAGAAAGTG GGTTGGATTC TTTTCCAGAA
TCTCCTAGAC TTCATTTGCA GAAAATACAA GTTCTTCTTA GAGACTTGAG AAAACCTGAC
ATCGCTAGAG AATCTGCCCG TGCTGCAGTT GAAAAACTCC CCAGCATAGT TGAACTTTGG
ATCTTGTTGT CTCATATTGA CGAACAACAT TTGAATATTT TGATCAAAGC CAGATCTGTA
TTGGATACTG CTATCCTCAA AAATCCAACA AGTGACAAAT TATGGACAGC AAAAATACAA
CTAGAAAGAA GAAATAAGGA CTTTGTGGCT GCTAGACAAC TTGTTAACAA GGGATTGAAA
GCTTTTCCTA AGTCTTCCAG AATCTGGATC GAGTACTTAT CTTTGATTCC CAAAATGTCA
CACAGAAAAA CTGCGTTCTT GGATGCTTTG AAAAGTACCG AAAATTCTCC AGAAATCCTT
TTAGGTATTG GTGTATTCTT CTGGCTTGAC AGTAAACATT CCAAGGCTAA ATCGTGGTTC
GAAAGAGCGT TGTCAAATGA TCGCAATAGT GGGGAATGCT GGGGCTGGCT CTTCAACTTC
ATGAAATCAT ATGGAAAGCA ACAAGAGAAA GAAAATTTGA TAAACTCATT CGAGAGTCAC
TACGAAGAAA TCAATAAAGG AGATATTTGG AATTCTACGA ACAAAGATAT AACCAACTTT
GAGAAAACTC CAAAGGAAAT CCTTGAACTT GTGTCCAAGG TTCTTATATC TAAATATAGT
ACATAA
 
Protein sequence
MLRPAFLDQE PPPGYISGIG RGATGFTTSA DTGSLQPGFT IENGEESDDN LAGEIGDEGA 
ILASGKNRDK EDEEADQIYE EIERKLQRRK VLKEEAVSEA ENQSYEIKSK FSDLKRSLSS
ISAEQWEALP EVGDITRRNK RTRLYEQQQQ RTYAVPDSVI AGSIARASPT NFQSISESRD
QLLLSQLDSL LPKHEIEFNA EEATEILKND QHVQIADIRK GRQILASLRR TQPNLSNSWI
ASARLEEQAT NYTMAKKLII EGCKHAPKSE DIWLESIRIH KLTSEGAKMC KVIVAESLKY
NPGSEKLWIQ AENLENSIDV VSRKRILMRA IEAIPSSASL WKRLVDLETN QDDVSKILTK
AIELCPTEWD FWLTLINSSE YKDAKTLLNR ARKALNGNYQ VWITAAKLEE RENSTIDSSK
ISKLMDKAFK ETEKSTTTIL RTTWLEEAIK AEEEGFRNTC RAIVNSLLSS EINQDNPEEN
LVTWFQDAET LALKESVEAA NYIHQFIVET NPHSINSWKE LFSFLKNSSN RNLDTLFNYY
KRSIELNPKV EVLHLMYAKD LWQLAGNIVE ARKVLNAASH GLENNEEVWF ARIKLEIKSG
NFEQALSISS KMIKAIPTSS DRVWYKHIHL VRCMNNREQN PNYEGQILAL LESGLDSFPE
SPRLHLQKIQ VLLRDLRKPD IARESARAAV EKLPSIVELW ILLSHIDEQH LNILIKARSV
LDTAILKNPT SDKLWTAKIQ LERRNKDFVA ARQLVNKGLK AFPKSSRIWI EYLSLIPKMS
HRKTAFLDAL KSTENSPEIL LGIGVFFWLD SKHSKAKSWF ERALSNDRNS GECWGWLFNF
MKSYGKQQEK ENLINSFESH YEEINKGDIW NSTNKDITNF EKTPKEILEL VSKVLISKYS
T