Gene PICST_29003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29003 
Symbol 
ID4851740 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2709410 
End bp2711905 
Gene Length2496 bp 
Protein Length831 aa 
Translation table 
GC content36% 
IMG OID640393448 
Productpredicted protein 
Protein accessionXP_001387086 
Protein GI126275441 
COG category[R] General function prediction only 
COG ID[COG5141] PHD zinc finger-containing protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.596418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.417698 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACC AAACTTCATT CACAACAGGC AGATCGATTG CTGATCCCAA CTACGCCCAG 
GTGGCGCTCT TTGACAGTCA CTCCAAGCCA CGTGAGGAAA GGGACTTCAA AGAAATATAC
CCCGATCTTG ACGAAGGCAT CCAGCTCAAC ATGTTTGTAT TGAATAATGA CGCGGAGGAT
GACAATGACA TCGACATGGT TAATCCTGGT CCTGTAATTA TGTCCGAATT AAAGCAACCC
GTGTTCAATA AGATAGATGA TGGAGCGGGT GACGCAAATC GTTTAAATGT CAAATTTAGT
AAGTCTATAA CAGCGTATGG CTTCCAGGAA CAAAATAGAG TAATTAGTAA AACTATCCAA
GACACATATA TTCGTCCGTT CCAGTTGCCT GGTTCAAACG AAAATGGTTC CGATGATGTT
TCCATAATTG GAAGAATTTT GGAAAGAAAA CGAAACGCCG TAGAATATGA TATGGACGAA
CAGGATTGTT TGTTCTTGAG TTATAGAAAC CAACAACCCA ACAATGTAAT CAAAATAACT
CCAGAAGTGT TTGAAATTAT GATAACGACA TTAGAAAATG AATGGGACAA ACTAGAACAC
CAAATGAATT CGATAATAAA TAATAATGGC AGTCTGGAAA GAGCCTATGA TGGCTCTACG
ACGTTTCTTC TGTTGGACCG AGATGATATT GAGAAATACG GAACCGACGA CGGCATAATT
CCAGGTTCTA TATATGATCA AAGGTGTGCC GTATGCAATG ATAGTGATTG TGACAATTCG
AATGCTATTG TTTTTTGTGA TGGTTGCGAT ATAGCTGTAC ACCAAGAGTG TTATGGAATA
GCATTTATTC CTGAAGGGCA ATGGCTTTGC CGAAAATGTA TGATAAACAA GAACCGTAAA
ACGGATTGCG TTTTTTGTCC GAGTAAAACG GGTGCATTTA AGCAACTTGA TAATAGTTTG
TGGAGCCATG TCATTTGTGC TTTATGGATT AATGAATTAT ATTTCGCCAA TCCGATTTAC
ATGGAACCTA TTGAAGGCAT AGATTTAATT CCGAAAAGCC GTTGGAAATT GGTTTGCTAT
ATTTGTAAAC AGAGGATTGG TGCTTGCATC CAATGTACTA ACCGAAACTG CTTCCAGGCA
TATCATGTCA CATGTGCTAG AAGAGCTGGT CTCTACATGG AAATGACTAT GGGAATGCAA
GGTGCTATTA GCAACAAGAT GACATTGAGG ACTTTTTGTG ACAAGCACAG TCCGCCAAGT
TGGAATGCAG AAGATATTCC CAGGGGCATT CAACGGACTA GGTTGTTCTA TCGTGATACG
AATATTCTAA ATGAAAGAAA TGCCAAACTC AGTAGCTATC AAAAGACTGC CAACAAACTT
AATATATTCA AATGGAAAAC AGAAAATAAT ACTCCAATTG CCCCCAAGGT TTTTAGTGAT
GTTTTGTTTA ATATTTTGCA AGCATTAAAA GTTGAAAATC AAGTTTCCTT AGAAGAAAAT
AATCAATTGA AAGACTTGAA TGTCTTGCCT AATCGATCTC GTGAAGATAT TTGGGCGGAC
ATGAGAAGTA TCAGCAATGA AATTTGCCGC TATTGGTGTT TAAAAAGAGA ACTGAAAAAG
GGAGCTCCAT TAATAAGAAA GAATAACAAT TTCGTTTCCA CGAGCTCCAT TTTATATAAC
GCTAATGGAT CTGATACTAG AAGTGACGGA GATTATTATG AACAAATTGA CGAAATTAAA
GGACGGATAG ATTTTGCAGA AGTTTTGGTC AGAGATTTAG ATAAGGTGAT TGATATGAAT
AAGGATTCAT TATCGAGACA AGTCTATTCA CGAGAAGTGC ATAACTTAGA ATTTGAATCA
ATTGATGCTG TTTATTTTCC TATCAAGAAA GTCATCGAAA ATTGTTTGAA TACTTTAAAT
GAAAAATTCG ATGCTAATAG AATTATTCAA GGATATCAAG TCAAGGTGGA AAATGAAAAT
GAAAATAGAA GGAAATTTGG TGATTTACCT GTGCAAATAT CTGAACAAGA TAATAAAAAT
GGTGAGGGGG TGCTGGATTT TCTACAGCGG GATTTTACTG ACATGTTACC GCGACCACTT
ACATTGTCTC AAATAATTTC AAAAAACGCA AAGTACGGTT ATTTCACAAT AGAGGAGTTC
AATAAAGATC TTCAGAAATT TAGTGAAATT GTTCTTCAAC GAAGTAAATC ATCAAACAAT
ATTCACAGGC TAATGAAGAG ATGGATGAAA GAGTATTACA AGATACTTCC AGAGATGTTT
TCCTTTGAAA ACAAGGTTAA AAGGGAATTA AATAATAACG AGAAAATATT GACAAGTAAT
TATTTTAGTT CCCCTTCCCT TGGCGTGCAT GGGAGTGAGA TTATGTTTAA ATCATATGAT
GTCAAAGAAT TATTGGACGA GAATGACTTG AGTGAAGTGG AAAATGATAT GAACGATGCA
AATCAACATG AGCTACATAG ATTTTTAAAT GGTTGA
 
Protein sequence
MSNQTSFTTG RSIADPNYAQ VALFDSHSKP REERDFKEIY PDLDEGIQLN MFVLNNDAED 
DNDIDMVNPG PVIMSELKQP VFNKIDDGAG DANRLNVKFS KSITAYGFQE QNRVISKTIQ
DTYIRPFQLP GSNENGSDDV SIIGRILERK RNAVEYDMDE QDCLFLSYRN QQPNNVIKIT
PEVFEIMITT LENEWDKLEH QMNSIINNNG SLERAYDGST TFLLLDRDDI EKYGTDDGII
PGSIYDQRCA VCNDSDCDNS NAIVFCDGCD IAVHQECYGI AFIPEGQWLC RKCMINKNRK
TDCVFCPSKT GAFKQLDNSL WSHVICALWI NELYFANPIY MEPIEGIDLI PKSRWKLVCY
ICKQRIGACI QCTNRNCFQA YHVTCARRAG LYMEMTMGMQ GAISNKMTLR TFCDKHSPPS
WNAEDIPRGI QRTRLFYRDT NILNERNAKL SSYQKTANKL NIFKWKTENN TPIAPKVFSD
VLFNILQALK VENQVSLEEN NQLKDLNVLP NRSREDIWAD MRSISNEICR YWCLKRELKK
GAPLIRKNNN FVSTSSILYN ANGSDTRSDG DYYEQIDEIK GRIDFAEVLV RDLDKVIDMN
KDSLSRQVYS REVHNLEFES IDAVYFPIKK VIENCLNTLN EKFDANRIIQ GYQVKVENEN
ENRRKFGDLP VQISEQDNKN GEGVLDFLQR DFTDMLPRPL TLSQIISKNA KYGYFTIEEF
NKDLQKFSEI VLQRSKSSNN IHRLMKRWMK EYYKILPEMF SFENKVKREL NNNEKILTSN
YFSSPSLGVH GSEIMFKSYD VKELLDENDL SEVENDMNDA NQHELHRFLN G