Gene PICST_88664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_88664 
Symbol 
ID4838363 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp763322 
End bp764922 
Gene Length1601 bp 
Protein Length468 aa 
Translation table12 
GC content44% 
IMG OID640389678 
Productpredicted protein 
Protein accessionXP_001383778 
Protein GI150864802 
COG category[A] RNA processing and modification 
COG ID[COG5228] mRNA deadenylase subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.64572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.542479 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGTTA ATCCACATCT CCAGTATCTC CAGCTGCACC AACTCCAGCA GCAGCAACAG 
CCCATGAATG CTACCGGCCA GTCCCCGCAG ATTTCACAGC TCCAGTTGCA ACAGCACCAG
TTACAGAGAC AACAGCTTTT GAACCATCAT CTCCAGCAAC AACACCAGCA CCAGCCCATA
GGAACACCCA CAGCTCAGAA CGTCAGTACT AATCCTTTGT TAGCTGCAAT TAATGGAACT
TCTAACGGAA ACGTCGTTCC AGGGAATGTA AATGGTCCAG GCTCAGGCTT GAACTCTGGT
TTGAACTCTG GTTTGACCAC TGGAGCTCTG GTAAATCCTG TCTTACAATT ACAATTACAA
CAGCAGAAAC AGCAACAGCA GCATCAGTTC CAGCAGTTAC AGGCAGCCCA TCAGGCAGCC
CACCAAGTCC AACAGCAGAT TCCTTCTCAT CAGTTACATA ATCAGGCTCC TCTCATCAAA
GAGGTCTGGG TGCAAAACTT GGAAAATGAG TTTCACACCT TACGAACCTT CATCAACGAC
AAGACATCCA AGATCTTCAT CGCCATACAC GAGGAAATCC CGGGCATCGT AGCAAGACCA
GTAGGCACGT TCAAGTCATC GTCTGACTAT CATTTCCAGA CGTTGCGTTC CAACCTGGAC
TTGTTGAACT TGATCCAGTT GTCATTCTGT GTCACCAAGA TCAAAAACAA CGAGATCAGT
TCCAGCATCA TCTGGCAGTT CAACTTCTTG TATGACTTGA CTAAGGAGAT GTTCAACGAA
GAACATTTGA CCATGTTGTC GCAGTCGTCG CAGATAAACT TCCAGATGCA CATGACTCAG
GGTATTCCTC ATTTTTCATT TGCTGAACTC TTGATTGAAA GTGGTTTGCT TTTGGACCTG
TCCATCAATT GGATCAGTTA TCATGCTGGG TATGACTTGG GCTTCTTTGT TAGCTTGCTC
ATCAACGATA ATCTTCCTGT CGATGAAAAA GACTTCTACT CCTGGTGCTC CAAGTACTTC
CCCAACTTCT ACGACTTGAA GTATATCGGT AGCCAGTTGT TGAACACACC CAATGGCGAA
GATACAGCTA AGGCTTCCAA TAATAAACCA TCCATAGAAT ATTTGGCTGA AGAATTGCAC
TTATTGCCTA TCTCTCCTGC TATCCGTCAA CACTTTGCTG CGTCTATGTC ATCTCACTTT
CCGGGCCATC AACAGCAAAT GACCTCTACT TTACATGCCT ACTTGTCAAT GGAGTGTTTC
AAGGAGCTTT TGAGACAGTC GTCTTTTGAT CTTGCTTCGT TTTCACGCTT CAAGGGCTAC
ATCTGGGGCT TGGGCAATTT GTATGGTAAC GGATCTGTAG ACGAGCAATT TCAGATCAAC
GGGGCCATTC CTCAGCCCTC AACACCCTCG GGAGGCAACT CAAAAAGCGG TGTCGTTCAC
TATGGAAGAC CCCTTTGACT TTTGTTGGTT GCTATGCCGC TTTTAGAACA AACTTGTACT
TGTATGTTTT CCTTCTATAA TATTCTTCAT TTCCATGTAA ATTAGATATT GATTCTTGCA
ATTAATATAT ACTATATACT ACACTATCCA AGCTCTATGA T
 
Protein sequence
MNVNPHLQYL QSHQLQQQQQ PMNATGQSPQ ISQLQLQQHQ LQRQQLLNHH LQQQHQHQPI 
GTPTAQNVST NPLLAAINGT SNGNVVPGNV NASVNPVLQL QLQQQKQQQQ HQFQQLQAAH
QAAHQVQQQI PSHQLHNQAP LIKEVWVQNL ENEFHTLRTF INDKTSKIFI AIHEEIPGIV
ARPVGTFKSS SDYHFQTLRS NSDLLNLIQL SFCVTKIKNN EISSSIIWQF NFLYDLTKEM
FNEEHLTMLS QSSQINFQMH MTQGIPHFSF AELLIESGLL LDSSINWISY HAGYDLGFFV
SLLINDNLPV DEKDFYSWCS KYFPNFYDLK YIGSQLLNTP NGEDTAKASN NKPSIEYLAE
ELHLLPISPA IRQHFAASMS SHFPGHQQQM TSTLHAYLSM ECFKELLRQS SFDLASFSRF
KGYIWGLGNL YGNGSVDEQF QINGAIPQPS TPSGGNSKSG VVHYGRPL