Gene PICST_41699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41699 
Symbol 
ID4837361 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp253197 
End bp254948 
Gene Length1752 bp 
Protein Length557 aa 
Translation table12 
GC content43% 
IMG OID640388676 
Productpredicted protein 
Protein accessionXP_001382814 
Protein GI150864112 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.517646 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGTCT ACAAAGATGT CGAGAGAACA TCGAAATTGA AGACGTTCTC CAACCAGTCC 
ATCTTGATGT CGGCAATGGA AAGCTACAAC GATCAAGACG ACGACGACGA CCAAAGCAGT
GATGTTGACT TCAGTTTCGC TTCCAGTGAA GAGGAGGAAG AGGAGGACTA TAACGACATG
ACGCCGGAAG CAGTGGAAGC GGTGAAATAC TTGAAAGGTG TCATCGGCCA GATTGCGAGC
CAGACTAAGA AGTTGAATAA CGAGTTCGAG AAATTGGCGA ACAAGAAGTT AAGAAAGAAC
AACTTGAGCA CTATCGAAAG TAAAAAGGAG AAGATACAAT CTACAGTGGA ACTGAATAAG
TTCCACACCA AAAAGCTCTT TAAGGTCATC AAGTATGTCC GAGCCAACAA GATGTCCGAT
ATGAACTTAA TCTGGTTGAT AAAGGACGAC TTGAACAACT ACTTGGAGAA CAACGGCAAT
ATTGATTTTA CAGACGACAC ATCCATATAT GATGACATCT TCAATCTGGT AGTCATAGAA
GATGACTACT CCGAATTCAA CGACTCAGAA ATACACTCCA ATACAAGTAG AGACCCAGAA
GAGGTACCAA TCAAGAACGG AAGTGCCAAC AACAATGTGC TAGGAAGACT AAGCTCTGGC
ACTGTGGAAA CGAGGACACA ACCGAACCAC ATCAACACAT CCATACCAGC GTCACCTGTA
AATAAACACA TGAGCCCAGA GTTAGCAAGT CCAGCTATTG TCAGAACACT TAAGCCAGCT
TCCACACCTT CAAAGCCCGT AGGGAACTTA AAATGGTCTA CAGCAGCAGC AGGTATCCTG
GAGGTTTCCG AAGAAAGTCA CTACGAAAGC AGTAGAGCTT CGGCAGCTTC TTCCGTAAGT
CCCAAAGTGA CTAATGGTTC TACTACTGTT GCTCCATTGA GCACCGTGAA ATCTTCTTCC
AGTAGAGTAG AGACCAAATT CGTCCATGTC TTGGAGAATT CGTCATTGCC GCAATCTGAG
TTGAACTTGT TCAGTGACTT AAACTTAGTC AAGTTGCCGC CAGGAATGCA AGATTTGATA
ATATCATTCA CATCTAAAAG AAATAACCCC GAAGACTTCA AGTTGCTCTG TAGCACCCGC
AGCTACAATC AGTACGTGAC TCCAATCAAG AAGTGCAACT TCCCAGAACT TGATGCAGCA
GGAAATGTAG GTGGAAACAA CAATAATAAA CAATTCAAGC CACCGGTGCA GTTATTCAAG
TTGCTGTCGT ACTGGAATAG AATCAGAGCT AATGACGAGT TTGATAGAAT CTTGGAAGAG
ATACAGACAT TAAGTGAAAA AGACTCTGGC GAAGGCAATC CAATAGCAAA CGAGTTGACG
TTGGTGTTGT TTTATGGATT CTACTTTGGT TTCACGCCTG TGGAAAACTT GATTGCCGAA
TCGTGCTTGT TCAAGTTAGG CTGGAGACCA TACAATACCA ATCACAGCGA CTCTTCACAG
TTAAATCAAA GCCAAAACCA GATATCGTCA CCATCCAGTA ACGGAAAGGT TTCGCAGTCT
TCAAAGGACA AAGTAACAGT GCACAGCTGG GTTAGACGTA TTAAGTTATT GTCGAATTCA
GAAGAATCCA CAGCCTTTGA GATTGGAGAC TATCAGGTGT TTGACTTGTC TTTCTGGGAA
GTCTACATCA AGTACGGCTT CACATTGGAC CTCAGTCTCT GTAAAACAGA GCCAACCAGC
GCCATTTGCT AG
 
Protein sequence
MEVYKDVERT SKLKTFSNQS ILIEEEEEED YNDMTPEAVE AVKYLKGVIG QIASQTKKLN 
NEFEKLANKK LRKNNLSTIE SKKEKIQSTV ESNKFHTKKL FKVIKYVRAN KMSDMNLIWL
IKDDLNNYLE NNGNIDFTDD TSIYDDIFNS VVIEDDYSEF NDSEIHSNTS RDPEEVPIKN
GSANNNVLGR LSSGTVETRT QPNHINTSIP ASPVNKHMSP ELASPAIVRT LKPASTPSKP
VGNLKWSTAA AGISEVSEES HYESSRASAA SSVSPKVTNG STTVAPLSTV KSSSSRVETK
FVHVLENSSL PQSELNLFSD LNLVKLPPGM QDLIISFTSK RNNPEDFKLL CSTRSYNQYV
TPIKKCNFPE LDAAGNVGGN NNNKQFKPPV QLFKLSSYWN RIRANDEFDR ILEEIQTLSE
KDSGEGNPIA NELTLVLFYG FYFGFTPVEN LIAESCLFKL GWRPYNTNHS DSSQLNQSQN
QISSPSSNGK VSQSSKDKVT VHSWVRRIKL LSNSEESTAF EIGDYQVFDL SFWEVYIKYG
FTLDLSLCKT EPTSAIC