Gene PICST_39588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39588 
SymbolRRP1 
ID4851894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3125678 
End bp3127345 
Gene Length1668 bp 
Protein Length556 aa 
Translation table 
GC content42% 
IMG OID640393602 
ProductRhomboid-related protein 1 (RRP) (Rhomboid-like protein 1) 
Protein accessionXP_001386926 
Protein GI126275943 
COG category[R] General function prediction only 
COG ID[COG0705] Uncharacterized membrane protein (homolog of Drosophila rhomboid) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0102742 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCAACG ACAACCATAA TAATCAACAC TACAATCTCA ACCTCGGCAA TAGTAGCAAT 
AAAAATAGCA ACAAAAACAA TAATAATGTC AGGAATATAC AAAAAATCAA CAGCAACTAT
TCAGCCAACG AAGCGACCTA TCCCCATTCA ATCTTTACAG AGAACAACAC TCACGAACCT
CCGGTAAACT CATTCCCATC TGGGACCCAG TATAGACCCA CACCTAAATT GGAAAAAGAG
CTTCCGAATC CAAGAAACAA CGAATACGAG CTTTCAAACA TGGATGAATC CAATAATATC
AGAAGATACT CATATACTCC GTATCTGGCG GAAACGTCTA ACTACAACAA CTATGACGAA
ATAAGAGCTT ATAGCCCCAA TCCTTTTGCG GATAGAAACC TTGCACCACT TCCATCGATT
CCTCACTCAG ATCCTTTTGA AGAGACACCG TTGGAAACCT CAAGCAACGA TCTCTTGAAA
GATGAGTCTG ACAGCCATGA CAGACAGCAA TTTAACGAAA GAGAAAGAAT CAAGCTTTTG
CGCAGAAAGC CGAGATTCCA CTATACGAGG CTTCCGTACT TCACGATTCT CGTGACTTTG
ATCCAGGTAA TCGTCTTCAT TGTAGAATTG GCGCGTATGG CTCAATTAAC TGGTTCGGCA
TTTCAAACCA AGCCTTACTT CAATCCGATG TTAGGTCCAT CTACATACCT CTTGATCTAT
ATGGGTGCAA GATACGTTCC TTGTATGCTG CAGATTGTAG GAATCACGGA CGACACATCG
ATCATGTTTC CCTGTGCAAA CTCGACCACA GTAGACACCA ATGTTTGCAA TTTGAGCGAG
CTCTGTGGCT TGGGAGGTGT TCCTATTGTA GATAATAAGT TCATTCCAGA CCAATGGTAT
CGTGTGATTA CACCTATCTT TTTGCACGCT GGGTTTCTTC ATATTATATT CAATCTTCTC
TTACAGATCA CCATGGGTTC TTCCATAGAA CGTCATATTG GTGTACTCAA GTATGCTATC
ATTTATCTCC TGAGTGGTAT AGCTGGTTTC TTGCTAGGAG CAAACTTCAC TCCACAAGGT
ATCGCGTCCA CCGGAGCTTC AGGTGCCTTG TTTGGAATCG TCGCTACCAA CATATTGCTA
TTCATATATT GTGGCAGAAA AAATACCAAT CTCTATGGAA CTCGCCATTA CGTCTTATTC
ATCTGCATCA TGGTAGGCGA AATCATCATT TCTCTAGTTC TAGGTTTATT ACCTGGTCTT
GATAACTTTA GTCATATTGG TGGGTTTGCT ATGGGTGTCT TGACAGCAGT TGTATTCTTG
CCAGATCCCT TCTTTGTATA CATAGATGGT ATCATTACCT ACAAAGGAAA TGCAACCACA
TGGGAACAGT TTGTGAACGC CTGGAACCCT TTCTATGCCT GGGAAGACAA AATCCCCTTA
CGATTCTATA TTTGGTGCGG TTTTAGAGTC GTTTGTCTCG TACTTGCCAT AGTCTATCTC
GCGATGTTGA TCAAGAACTT TTTTACTAAC ACTGAGTCAC CAGAATCTCG CTGTTCCTGG
TGTAAGTACA TCAATTGTAT TCCTGTAAAT GGCTGGTGTG ACATTGGAGA AGTGACTATT
ACTACATCTA CAGTTGCACA ACCTACTGCT ACTCCACCTC CTACAATG
 
Protein sequence
MFNDNHNNQH YNLNLGNSSN KNSNKNNNNV RNIQKINSNY SANEATYPHS IFTENNTHEP 
PVNSFPSGTQ YRPTPKLEKE LPNPRNNEYE LSNMDESNNI RRYSYTPYLA ETSNYNNYDE
IRAYSPNPFA DRNLAPLPSI PHSDPFEETP LETSSNDLLK DESDSHDRQQ FNERERIKLL
RRKPRFHYTR LPYFTILVTL IQVIVFIVEL ARMAQLTGSA FQTKPYFNPM LGPSTYLLIY
MGARYVPCML QIVGITDDTS IMFPCANSTT VDTNVCNLSE LCGLGGVPIV DNKFIPDQWY
RVITPIFLHA GFLHIIFNLL LQITMGSSIE RHIGVLKYAI IYLLSGIAGF LLGANFTPQG
IASTGASGAL FGIVATNILL FIYCGRKNTN LYGTRHYVLF ICIMVGEIII SLVLGLLPGL
DNFSHIGGFA MGVLTAVVFL PDPFFVYIDG IITYKGNATT WEQFVNAWNP FYAWEDKIPL
RFYIWCGFRV VCLVLAIVYL AMLIKNFFTN TESPESRCSW CKYINCIPVN GWCDIGEVTI
TTSTVAQPTA TPPPTM