Gene PICST_73033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_73033 
Symbol 
ID4840227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1514390 
End bp1516205 
Gene Length1816 bp 
Protein Length519 aa 
Translation table12 
GC content41% 
IMG OID640391542 
Productpredicted protein 
Protein accessionXP_001385641 
Protein GI150866147 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.365315 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TATGAGGACA CCCAGAGCCT CTATACTGTC ACGGCGCTAA TAGTGACACC CTTGTCGTTG 
TTTGCTCTCA AGAAAATTGT GCCTATCGCG ACCTCAAGAT CCCGGTCCCA ATCCAAAATC
GATGGCTCCA AGATACTCCT GGGGATTTTG GTGGCTCTCA TCATAGTTCT TAACCTTAAT
GTGTGGTACT ATCTTGTCAA CAGTATGATT GACTCTCATA ACATTTACAA GTCTGTCTCG
GGTCCTTCAT TCAACTTGTT GAACTCGCCT ATTTCTGAAA AGATAGTCCT GTTGGATTAT
TCTGACGTCA AAATCCCTGG ATATTCCTTT ATTGAAAAGG GCATCGACGA AATAGAAAAC
TTCAAGTGTG GCGATGTTCG CTTCCACGAC GACCATGGAG TCAAGGCTTC CAAGCCACAT
AGTCTTGACC AAAAGAGGGA TATGAAAATT ATCAGAGACA AGGCACTTAT AATAAACGGC
ACGGGTGAAT ATCCCATTAT CAGAAAATGT TTTCTTGACA AAGCCTGGGA AAAAGAAGAT
GTAATTGTCA AAAAGAAATG GTACAAGTTC GCTGGTTCTT CTACTTGGTT GGATAAATAT
CAAGTGTTCT TTCTTGTCAG CAGAGTAGCC TACAGTCACC GTTCCCTTCG TAACAAGGCA
ACTATCAGTA TCTTGTATGC CCAAGTTTTC AACAAAAATT GGGAAGAAAT ACTCGACTAC
CAATTTCCAC AGTCTGATAT TGTGTTCCCT GCCATTCTTC CAGTTAATTT AGATGAAAAT
CCTAGAGGAG ATAATGCCTT TTTGGGAGCC GATGATCCTC GTGTAATGTT GAGAAACTAT
AACGATACCA CTAGTGGAAC ACAAGAACAA GAGCCAGTCA TAATCTTCAA TACCTATCGT
GCCGATCTTG GCTGGAAGAG AGCTATACAT GTGTATCGTC CATTGACAAA TGTTAAAGAA
GCCATTCCAA TGAGGTTAGT TGGCATGGAA CCCAGACAGA GAGAAAAGAA CTGGGCTCCT
TTCTTTGACG AAGATGCTTC ATCCATTAAT TTCGTGTACA GTTTGAATCC TTTGCGTATT
GTCAAATGCG ACTTCAACAA CGGAGCTTGT AACAAGATTT CCGGTGACGA TTTTGAAGAA
GATGAAGCCA GACCCTTGAG AGGAGGAACC AACGTTGTCA GAATTCCAGC GTCTTTCCTT
CCTAAACATC TTGCCGAAAA GAGAGAATAC TGGTTTGGAA TTGCTCGTTC ACATGACCAC
AAATGTGGAT GTATCGAGAG GATTTATCGC CCTCACTCTT TTGTCATTTC CAAAGCCTAT
AAAACTGACG ACTACACAAT GGACTACGTC AGTTCATTTG TCGACTTTAA TATCAATACT
ATGGCATGGA ATCCGGCGTT AGAGAAAGCG AAGTGTACGG ACAGTAAGAG CGTATTAATT
CCCAACTCCA TTGCATACTG GGACGTCATT ACTACAAAGG ACAAGAATGG CAAGGATCAG
CTTGAAGACA TAATGGGTGT TACATACTCA GAAGCAGATA TTAACAACCG TCTTATCCAC
GTTAAGGGGT TCTTGCAACA TGTCGCTAAG ATCTTCAGTG GTCAGAAAGA AACTGTCGTA
AACCACTACG CCCAAGTTGA GACTGCCAGA GAGGAAAATA ATTTGTTAAG TAATTGTGCC
ACCTCACTTG CCCAAGAATA CTGTAAGCTG GCTGAAAAGA AGTTCAAATG GGGCTACGAC
AAAAACGGCA AAATGAGCAC ATGAATAGAA TAACTTTATA TCGAAGAATA CCGGGAAATA
TACCATTTTT TAATCT
 
Protein sequence
MIDSHNIYKS VSGPSFNLLN SPISEKIVSL DYSDVKIPGY SFIEKGIDEI ENFKCGDVRF 
HDDHGVKASK PHSLDQKRDM KIIRDKALII NGTGEYPIIR KCFLDKAWEK EDVIVKKKWY
KFAGSSTWLD KYQVFFLVSR VAYSHRSLRN KATISILYAQ VFNKNWEEIL DYQFPQSDIV
FPAILPVNLD ENPRGDNAFL GADDPRVMLR NYNDTTSGTQ EQEPVIIFNT YRADLGWKRA
IHVYRPLTNV KEAIPMRLVG MEPRQREKNW APFFDEDASS INFVYSLNPL RIVKCDFNNG
ACNKISGDDF EEDEARPLRG GTNVVRIPAS FLPKHLAEKR EYWFGIARSH DHKCGCIERI
YRPHSFVISK AYKTDDYTMD YVSSFVDFNI NTMAWNPALE KAKCTDSKSV LIPNSIAYWD
VITTKDKNGK DQLEDIMGVT YSEADINNRL IHVKGFLQHV AKIFSGQKET VVNHYAQVET
AREENNLLSN CATSLAQEYC KSAEKKFKWG YDKNGKMST