Gene PICST_63097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_63097 
Symbol 
ID4840559 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp1023892 
End bp1026279 
Gene Length2388 bp 
Protein Length790 aa 
Translation table12 
GC content44% 
IMG OID640391874 
Productpredicted protein 
Protein accessionXP_001386384 
Protein GI150866706 
COG category[K] Transcription 
COG ID[COG5190] TFIIF-interacting CTD phosphatases, including NLI-interacting factor 
TIGRFAM ID[TIGR02250] FCP1-like phosphatase, phosphatase domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGACC TGACACCGAT AACGTTGCCC CAGTCGGTGC CTTTTCCTGT GACCATCACT 
TCCATCGACT GTTCTGTTGG CCAAACGGTT GCAAAGCACA AGCCCATTTT CAAGTACAAG
TACTGGGAAT ATCAGGATGA TCCCAATTCC CAGGAAGCTG TCCCTCCCAA AATCAGAGTT
GAACGTATTG GAACATTCGA AAGCCCTATT GAAGGTGAAG TGCAGCAAAT AAACATTGCT
GTTCTCGACG AAGTAGCCCA TTCCGGTATT GAACTCTGTC TGATTAGAGA ACCATGTAGT
CATACTGTCC AATACGGAGG TCTTTGTGCA CTTTGCGGTA AAGCTGTTGA AGATGAGAAA
GACTACTCAG GCTATACTTT CGAAGATCGA GCCACTATTT CCATGTCACA TGATAACACT
GGACTCAAGA TCTCGCTCGA TGAAGCTGCT AAAATCGAGC AGTCAACTAC CGACAGACTC
AATGAAGAGA AAAAGCTCAT TCTCGTAGTA GACTTGGATC AAACAGTCAT CCACGCCACT
GTGGACCCCA CTGTAGGCGA ATGGCAACTG GATCCATCCA ATCCCAATTA TCCTGCCATT
AAGGATGTCA AGACATTCTG TCTTGAAGAA GAAGCCATTG TTCCTCCAGG ATGGACGGGA
CCTCGCTTGG CTCCTACCAA ATGCTGGTAC TACGTTAAAG TGAGACCAGG GCTTTCGGAC
TTTTTGGAAG AGATTGTTAA CCTCTACGAA ATGCATATCT ACACAATGGC AACTAGGAAC
TACGCCCTTG CCATAGCCAA AATAATCGAT CCCACTGGAA AATACTTCGG TGACAGGATA
TTAAGTCGTG ACGAGAGTGG CTCTTTGACG CACAAGAACT TGAAGCGTCT TTTCCCAGTA
GACCAGTCCA TGGTAGTAAT CATTGACGAT AGAGGTGATA TTTGGCAGTG GGAGAGCAAC
CTTATCAAGG TAGTACCGTA TGACTTCTTT GTAGGCATTG GAGATATCAA TTCCAGTTTC
TTGCCTAAGA AGAATGGGCA ATTGACAGGT CCTACCAAGA AGAGAAAGTC CATTGCCAAA
TTGGAAGCTC AGGCCACAGC TGAGCTTCAA GTAGAAGAAG AACCTTCGTT TGATGATGAT
GAAGACAAAG CCGCTGCTGA TTTAGATGCT GACTCAGCAT CACCAGTGGA TAGAATTCTC
GAACTTGGTG GAGGCGAAGG TAACACCGAT CTCTTGATAG AACAGTCAAT CACCAGAAAC
CAATCTATAG AGCAACAGCA GTCCGAAAGA CCGCTTGCTA AATTGCAACA TGATTTAGAG
AAAATGCATC ACCATGAGCA GGATGGACAT TCGAGGTCCA ATTCTCCTGA CGGAACATCG
GCTGATGAGG AAGAAGATGA CGACGATGAT GAAGATGATA ATTTGCTTTA CGACGACGAT
AACGAATTGA CTGCATTAAA CAAAGTGTTG ATCAACATTC ACAACCAATA CTATAAGATC
TTGGGTGAAA ACATCCTTAA GAACTCTTCA TTGAAGCCAG ACTTGACCAA GATCATTCCA
TACATGAAGA GTCAAACTCT TGAAGGAATT ACTGTGTTGT TCTCAGGAAT AATTCCCTTG
GGAATTAACT TTGAAAATGC TGACATCGTC ATCTGGTGCA AGCAATTTGG CGTCAAGGTA
GTAAATGAAG TGTACCCAGA AGTGACCCAT GTAGTTTGTA GGGATCCTAG CAACGGTCAG
GGACCTACAT TTAAGGCACG AGTTGCCAGA AAGATACTAC CTGATGCTCA TATCGTCAAC
CCTGATTGGT TGTTTGCGTG TCTTAGTGCA TGGAACAAAG TTGACGAAGC TGACTACTTG
ATCCCGCTTG GGGACGAGAA GTTATGGGTT GTTAGAAGTA AAGATATTGA AAAGTACCAA
AAGGCACTTG AGAATCAGAA ACAGAAAGAA GCCGACCGCT CAATTCAAAG GCCTCGTTTT
GGATCCATAG ATTCAATCGA GGAGTACGAT CTCGAGCAGG CTAATCAAGA GGTTGATGAA
TTCTTGGCAG GCATCAGCGA CGATGACGAA GATGACGAAG ATGACTTGGA AGCCACATTG
GTGGACGAGA TTAGAAATGG GGGAAATTAT AAGAATGTCA ATGAAGAAGA TGAATACGAC
GAGGAACTTG AAAACGGCAA GTCCGCTGAT TCCTTCATTA AAGAACTCTA CAGCACCAAG
AAGAGAAAAC ACGAACAAGA GGAAGAAGAC GAGAGTGAGA ATGAGGTTGT GGAGGAGCCA
GAGTCAAATG GCCAGGTGCA CAAAAAACAG AAACCTTCGA AGGATGAGGA CCTTGATGAG
CTTGAACAGG AATTGCTTGA TGGGTTTGAC GATTTGGAGG AGGACTAG
 
Protein sequence
MSDSTPITLP QSVPFPVTIT SIDCSVGQTV AKHKPIFKYK YWEYQDDPNS QEAVPPKIRV 
ERIGTFESPI EGEVQQINIA VLDEVAHSGI ELCSIREPCS HTVQYGGLCA LCGKAVEDEK
DYSGYTFEDR ATISMSHDNT GLKISLDEAA KIEQSTTDRL NEEKKLILVV DLDQTVIHAT
VDPTVGEWQS DPSNPNYPAI KDVKTFCLEE EAIVPPGWTG PRLAPTKCWY YVKVRPGLSD
FLEEIVNLYE MHIYTMATRN YALAIAKIID PTGKYFGDRI LSRDESGSLT HKNLKRLFPV
DQSMVVIIDD RGDIWQWESN LIKVVPYDFF VGIGDINSSF LPKKNGQLTG PTKKRKSIAK
LEAQATAELQ VEEEPSFDDD EDKAAADLDA DSASPVDRIL ELGGGEGNTD LLIEQSITRN
QSIEQQQSER PLAKLQHDLE KMHHHEQDGH SRSNSPDGTS ADEEEDDDDD EDDNLLYDDD
NELTALNKVL INIHNQYYKI LGENILKNSS LKPDLTKIIP YMKSQTLEGI TVLFSGIIPL
GINFENADIV IWCKQFGVKV VNEVYPEVTH VVCRDPSNGQ GPTFKARVAR KILPDAHIVN
PDWLFACLSA WNKVDEADYL IPLGDEKLWV VRSKDIEKYQ KALENQKQKE ADRSIQRPRF
GSIDSIEEYD LEQANQEVDE FLAGISDDDE DDEDDLEATL VDEIRNGGNY KNVNEEDEYD
EELENGKSAD SFIKELYSTK KRKHEQEEED ESENEVVEEP ESNGQKPSKD EDLDELEQEL
LDGFDDLEED