Gene PICST_31379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31379 
Symbol 
ID4839032 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp647470 
End bp649943 
Gene Length2474 bp 
Protein Length722 aa 
Translation table12 
GC content40% 
IMG OID640390347 
Productpredicted protein 
Protein accessionXP_001384089 
Protein GI150865039 
COG category[A] RNA processing and modification 
COG ID[COG5107] Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.828555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATAA ACGACAACAA CAGAAAACGT TTGGCGTTGG ATGTGGTTGG CCAACTAGAG 
GAAGATTTAG AGGCAGATCC CTTGAACTAT GCCAAATGGA ACAGACTCAT TAAACATGTG
GTGGCCAAAG ACAAAGAAGA ACAAGTCAGG GCCATCTACA CCAAGTACCT CGGCATCTTC
AAGTCTGACG TATGTATACG CTAAGCTGCT TATTGCTGTT AGAAAGGCAT GCTCATTTTA
GTTCTTGTCA TTTGTATATA ATTAGATACT ATTGTTTGAT ACAATAGTAT AGTACTGCTG
CTGTTGATAC AATAGCAGTC GAAATTCTTC TGTATATATC GGCACTGTCT CTCTGGAAGA
ATTCAAATGG AATAATTAAT AGACACAGAA TCTTATAGAT GAGTTCTATA TCAGGTCTAG
AGCAAAATTA AGCATTGCCT TTCTGGCAGA GTAAATTTCT AATATCGATT TAACAATTAC
TAACATATCT ATAGGGAGAG CAGTGGTGCA AATATATCAA CTACGAGTTG AACCGAGGCG
AGTTCCAAAA GGTAGAATCT CTATTTCACC AGTGTTTCTT GATTACAGAC AACGTAGAAC
TTTGTCGTTT ATACGTATCG TACGTACGTC GTGTCAACGA TGTTATTACT GGTGGTGAAA
AAGCCAGAGG CACGGTCATT CAGGCGTTTG AATTCGCCAT CAACAAAGTT GGAATAGATA
TCAATAGCAC GGCTTTGTGG AATGACTACT TAGAATTCCT CAAGTCGTGG ACTCCTGCTG
CTAGTTGGGA ACAGCAACAG AAAGTTGATT TGATAAGAAA AGTCTACAAA AAGTTTTTGA
TTGTTCCCAC GGAAAACTTG GAGAACTCGT GGTCTCAATA TACTAAATGG GAAAATGAAG
TCAACCCAGC TACAGCAGCC AAGTTCATCT CTGAAAAGTC TGCCGAATTT ATGCTTGCCA
GATCGTGGAA TACCGAATGG CAGAACATCA CCGAAAGGAA GTTGATGAGA GACATTTATC
CGTTCTCAGC TACTGGTGAA AAGGAAAAGA TTATTAGGAA CCAAGTCGGT TATTGGCTCA
ATTGGGTCGA GTTAGAAAAG AAAAATATAT TGGAGTTGAA GGAAGATTTG CTTGAGAAGA
GAATAGCATT TACATATAGA CAAGCTACGT TTGCATTGCC GTTTGTTCCG GAATTGTGGT
TTAAAGCTAG TAAATTCTTG CTTCTTAGTA ATGAGGAAGC AAACATTAAT AGATGTGTAG
ATCTCTTAAG TGAGGGTTTG CTGTTAAACC CCAGAAGTCT TCTCCTTTCA TTTCAGCTAG
CCGAATTGCA TGAAAAGGAC GCTGGATTCG AAAAATCAAA GGATATCTAC AATAATCTTG
CGAAGTGGTT AACTATTGAC TACACCAAAA CTACTGAGCA GTTAGAGTCG CTTAGATCAC
GTTTTGAAAT CCCCAGCAAT GGCGATGACA ACGATGAAAA TGATCCAGAG TCCTTCAACA
ACGACGACGA TATGCAAATA GATACCAAGA AAGTGTACCA ACTCACTTCT GAAGACAAAA
AGCATCTTGC GACTTTGAGT AAGAAACAAA CTGAACTTGC AAAATCTGTA ACATTAGTCT
ACGTGAAGTG GATGACAGCC TCCAAACGAG CAGAAGGAAT TAAGGAGGCA CGTAGTGTTT
TCAAGCTGGC CAAGAAGTTT GCCAGCATAG GCAGTGAGTT GTTCGTGGAA AATGCTCTTT
TAGAGCATTA TGCCGACAAC AAGAAAGTAG CCTTGAAGAT CTTTGATTTG GGTATGAAAG
CTTACGCTAC AGACGGAGAC TTCTTATTTT CCTATTTGGA GTATTTAATC ATGATCAATG
ATGTGGATAA CATCAGAATC TTGATCCAAA CGTCAGACAC CAACCTCACG AAAGACATCG
TTTCTTTAAC TGAAGCAGTA CAACTAGGTC TGTTGAATGA ATACTTGAAG GAGTTGAAGG
AAGATGAGAT TGAAGTGAAG AGAGGCTACT TGAGAAAGTT ATTTAAGCGA TACATTTCCT
ATGCCTCAAA ATATGTTTCT TTGGATGTTG CTCAAAGTTT TGTAAATAAG TACGAGCAAA
CTTTTCCCGA TGACGACCCC ATCGAGTTGT TTAGCGATAG ATACACACAA GGTGACGACA
ACTTGATAGA AAAACTTGAT TTGGGAATTG ATAGTTCTAC ATCTTTGCCG CCCAGCAAGA
AGAGAAAGGT CAATGCTAAG TCTGATATCG AAGACAACAA CAGAGACTTG GACGACCGTG
ACGGTATATT CAGCGCTCCA CAACCACCAG TGTTGGAGCC TGAACAGCCC AGCTCTTTTG
TGGGGCCTAC TATCACCACT CTTTTAGCAG CCTTACCCAA CGCCTCTTAC TTTGGACAGC
CTTCAGAGAG TGTCTTCAAT AGTGAGAAAT TGGTCAAGTT GTTTTCCAAT TTGCCCAACA
TTCCTGTTGA TTGA
 
Protein sequence
MFINDNNRKR LALDVVGQLE EDLEADPLNY AKWNRLIKHV VAKDKEEQVR AIYTKYLGIF 
KSDGEQWCKY INYELNRGEF QKVESLFHQC FLITDNVELC RLYVSYVRRV NDVITGGEKA
RGTVIQAFEF AINKVGIDIN STALWNDYLE FLKSWTPAAS WEQQQKVDLI RKVYKKFLIV
PTENLENSWS QYTKWENEVN PATAAKFISE KSAEFMLARS WNTEWQNITE RKLMRDIYPF
SATGEKEKII RNQVGYWLNW VELEKKNILE LKEDLLEKRI AFTYRQATFA LPFVPELWFK
ASKFLLLSNE EANINRCVDL LSEGLSLNPR SLLLSFQLAE LHEKDAGFEK SKDIYNNLAK
WLTIDYTKTT EQLESLRSRF EIPSNGDDND ENDPESFNND DDMQIDTKKV YQLTSEDKKH
LATLSKKQTE LAKSVTLVYV KWMTASKRAE GIKEARSVFK SAKKFASIGS ELFVENALLE
HYADNKKVAL KIFDLGMKAY ATDGDFLFSY LEYLIMINDV DNIRILIQTS DTNLTKDIVS
LTEAVQLGSL NEYLKELKED EIEVKRGYLR KLFKRYISYA SKYVSLDVAQ SFVNKYEQTF
PDDDPIELFS DRYTQGDDNL IEKLDLGIDS STSLPPSKKR KVNAKSDIED NNRDLDDRDG
IFSAPQPPVL EPEQPSSFVG PTITTLLAAL PNASYFGQPS ESVFNSEKLV KLFSNLPNIP
VD