Gene PICST_33495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33495 
Symbol 
ID4840628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp785147 
End bp787372 
Gene Length2226 bp 
Protein Length741 aa 
Translation table12 
GC content42% 
IMG OID640391943 
Producthypothetical protein 
Protein accessionXP_001386346 
Protein GI150866678 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.375404 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0265895 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCCCT TGAGACAGAC GGTGTTCAGG AACTTAACCA GGACAAATAT TGTGAAACTC 
AGAAGACTGG GGCAATTTTT GAGTCGTTTT CAATCTACAG CAGCTGACAC TTTGAAGGCT
ACTACTTCTA CGAGCAGCCC AATAGTGAAA AATGACAAGA ATGTCTCAGA AAAGGCACCA
GAAGGTTCAA CTAAAAACGG AAACCACTCC AAAAAGGCTC CTCATCGTAA TGTAAAACTC
AGAGATATCA GTAACCAGAT CCAAGATTTG GTTAAGTCGT CCAAATCTGA TCTCACAGAA
GCCATAGAGA TCTTGGAAGA AGGGTTATCG TACTTGAGAG AAATTCAATT GGCTGAAAAC
ATCTCCGACA ACTCGATCTA CTTCCGTTTC CAGCCAATTG TGACCGAGTT ATTGTTTAAA
GCTTTAGACC CCGCTACTTC GCTCGGAAAT AAATCAGTGG AGGACGTTTT GGAAATCTTC
ACCAAGTATG GCGTCTGCCA CAAATACCAT TACACTGTAG TAGCTTCCAG GTACTTCAAG
AGTGGAGAAG ATAAGGCTGT GATCTATCAG AATGTGCTTA AGCTCTGGCT CCAATTTTTG
GAATTCGAAA AATCGAACAA TGCTACCGGA ATGGCTTCTG TCAAGGCTGG AGAAATAGAA
TACCGTCCTT ACTACCTCCC CAACCTTGTT TATTTCGCAT ACAGCCAGAC ATGTAGTCTT
CAGGGGGTCA AGTTTTCGTT TGAAGATGCT TCCAAGCTCT TGAATCAGAC GTTGCCCAAT
CCAGTTTTGA TCAGAAACTC GCTTATGGAC TTGCGTATAT TCGACGGCTA CAAAAAGGAG
TTCCAAAGTT TTCAAAAGAG CATCCACGAG TTGTCTGCTG AGTCACTTGA TCCTAACGGT
CCTGAAGTCT ACAGGAAAAT CAGAGAAGCT GGTGAAAAGA AGAATCAGGT AGCCTTGAAT
ATAATCCAAA AAGAAATCCA GGAAGCTGCA GCTAGAAACA ATAATCCAAT CAATGAAGAC
ACCTTGATCA GATTAATGGA TGGATACTAT GAGGCTGATA GACCAGATGA GGTTTTTGCT
ATTTTCCAGA ACTTGCTTCT GCACGGAATC GAGAAACCCT CCATTCGTGC CTGGGATGTA
GTGTTGAGAA CAATGGGTAC TCCATCATAT ATCTCTAAGA TTTCATCTGC TCAAAGAGGC
AAGCTTATTA AAAGTATTGA GCTGACCATC GAAACTATCT TGAACAATGG CACAGAGATC
ACTGCTAAGA CCCTTTCTAT CATCATTGGT AGTTTTGCCA ACTTGAACAA GTTCGACAAG
GTTGACGAAT ACTTGCAGAG ATTTAGCATA GAAGGCGAGG GTAAGCTTCC TGTGATTGCT
CCAACCAAGA ATAACATTTT GATTGGATTA GCGTTGAATA AAAAGATTAG CGAAGCGGAA
GAAAAGTTGA AGGAGTTTGT CAGAGCTGGT GGGTACGTTC CTTCTACGTC TGTGATGAAT
ACATTTTTAG GCTACTATGC TAAAATTAAT AACTATGCTG CCGTGGAAGG TATTCTTGAG
TTCATGAAGA AGCACAACAT TCCAGAAGAA GTTGGAACTT ACACTTCAGT TATCGACATC
TACTTTAAGA TGCACCGTGA AAAGGGATTG GTAGCTGACG TTGACAAAGT TCTCGACAAC
ATCTCTGCTT CCAAATCTAT ACCATTGAAC GACTTCACTT ACACCGCATT GATTGACGGT
TTGGTGAAGA ACGGAGCCAA TATAGAAGCT GCTCGTTCCA TCTTCGAAAA GGCTTCGAAG
AAGTACCCTG CTTCTGCTCA CTTGTACACT GCCATGTTGA GAGGTGAGTT GGACCAAGGC
TCTGTCAGTT CAGCAGAAAA GTTGTTTGAT GTCTACATTA AGAAGATTAG AAACGACGCC
AGAATCTGGA ACACAATGAT AAACTCGTTG TTGAGCAAAC GTGAAGAGCT TGCCTTGCAA
TACTACGAAA ACTTGAAGAA TGATGCCCAT TCTTCGCCAA ACCACTTCAC ATACTACTTC
TTGTTCCATC ACTTCATAAA GAGAGGAAAC AAAGAAACAG TGCAACACCT AATTGACGAC
TTGTCACAGA AACCTCTCAG AGACTTCGGT AATGAGTTGC CCAAGATGTT GGGCAAATTA
ACGGGAGAAT ACAAATTTGG TCCAGAACTC ATCAACATTC TCTCCAACCA AAAGAAGCAG
AACTAG
 
Protein sequence
MIPLRQTVFR NLTRTNIVKL RRSGQFLSRF QSTAADTLKA TTSTSSPIVK NDKNVSEKAP 
EGSTKNGNHS KKAPHRNVKL RDISNQIQDL VKSSKSDLTE AIEILEEGLS YLREIQLAEN
ISDNSIYFRF QPIVTELLFK ALDPATSLGN KSVEDVLEIF TKYGVCHKYH YTVVASRYFK
SGEDKAVIYQ NVLKLWLQFL EFEKSNNATG MASVKAGEIE YRPYYLPNLV YFAYSQTCSL
QGVKFSFEDA SKLLNQTLPN PVLIRNSLMD LRIFDGYKKE FQSFQKSIHE LSAESLDPNG
PEVYRKIREA GEKKNQVALN IIQKEIQEAA ARNNNPINED TLIRLMDGYY EADRPDEVFA
IFQNLLSHGI EKPSIRAWDV VLRTMGTPSY ISKISSAQRG KLIKSIESTI ETILNNGTEI
TAKTLSIIIG SFANLNKFDK VDEYLQRFSI EGEGKLPVIA PTKNNILIGL ALNKKISEAE
EKLKEFVRAG GYVPSTSVMN TFLGYYAKIN NYAAVEGILE FMKKHNIPEE VGTYTSVIDI
YFKMHREKGL VADVDKVLDN ISASKSIPLN DFTYTALIDG LVKNGANIEA ARSIFEKASK
KYPASAHLYT AMLRGELDQG SVSSAEKLFD VYIKKIRNDA RIWNTMINSL LSKREELALQ
YYENLKNDAH SSPNHFTYYF LFHHFIKRGN KETVQHLIDD LSQKPLRDFG NELPKMLGKL
TGEYKFGPEL INILSNQKKQ N