Gene PICST_32194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32194 
Symbol 
ID4839584 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp912203 
End bp914275 
Gene Length2073 bp 
Protein Length673 aa 
Translation table12 
GC content40% 
IMG OID640390899 
Productpredicted protein 
Protein accessionXP_001385185 
Protein GI150865815 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.707923 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCCCT TACCAAGTCT AGAGGAGTTA GAAGTCGACT TCAAGAGTCT TGTAGTGATG 
TCAAATTCAG TAGAAGGTAC ACCTGAAAAT GAGCTGGGCG ACAAAGAATC CTGGAGCGAC
GAACTTACTG AGCCCAATTC CAGCGCCACT AGTCCCCGTA TTGACCCTAA CATTGTTCTC
TACAGTCATA TTGCATTTCT CTACGAGCAG AGGAAATTTT ATGAGCTTCT TGCTTCCATT
CCAAATATCA ATACCAACAG TCTGTTACCA CTGGAGATCC ATTTAATGCT TGCCTGTACT
CTTTTTAGGC TCGGAAGAAT CACGGGTGGA TTAAGGGAGA TGTCCTTGGC TATTGAGTTG
GAACCTATCT ATTACAAAAG AGAGAAATAT ATCAAGACAC TCAGTCAATT CTTCAACAAA
ATTGGCATGA AGGAAGAAGC TGTGTCTTGT TTTGGAGAGA TCATCCATAT GGCCAAAGCT
GAGGCTCTAA AAAAAAATGA CGGTGAACAT GGGCAGCAGA TGAGTCTAAA GCTCAGAAGA
TACCAAGAAG AATTGGAAGC CTTGTTAAGA GGAGATGATG AAGAAGAAGT GTATGAAATA
CGTTAATTGA GTTTATAAGA AGGTTACTAA CAACATTTTA GAGTTATGGT TGTTGAAGAT
TGTTCAAATC TCAGCTTGGT AGAACAAGCA AAACGGTACC TTTCAGCTAA TGGAATGATT
CCAAAAATGT CCATGGTTAC TGATTGTATG GCAGTACTGG ACTTATTGGG AAAAGCTAAT
AAAGCTTTGG AAGAATGGAA TGACGAAGTA GACAAACGAC CTGCAAGTGA TATCTTGAAC
TGGATATTTG ATGCCGTGAA GTATCTTGCT CCACTAAGCT TGTTCGATGA GGTTTACCAG
ATTCTACCGT TGTTGGAGAC ATTTATGAAT TGGGAGATGT GCCGATTCCC TGAAATTGAT
TACGGTATGG ACCCGATTAA GTTTGCCAGG AAACTCAGAA CTTGGGGTAT ATCTAACAAG
ACTGGCATTC GTAGTTTGTG CTGTATAATA GTAAAGCATA AAGTAGTAAT GGCACATATC
GAGTACTACA AGAAGAACTA CTCTGCTGCT ATAATTCACT TTACTTGGGT CTTCAAGGTG
ATGAAACGTC TTGAGAGAAC AGTGCCCATT ATCCAAACCC CAGTTGGTTG TCTAAGTTTG
GAAACCAAGC AAGTGGTCTT ACTTTACTTG TGCAATTGTT ATTGCTTGGA TCCAGCATAT
TCGAACTCGA AGCTCCCGCA TGTCATTACC CAGATGGCCA CACTCGAGAG CCCTCCCGAC
TCAACATACG GCCGTAGTTG TAAGTACTTC ACTTTCCTTG GATACGCTTA TGAAAAACTA
GCCTACAACG AAGCCCAAAG TATTATGATT TGGAAGAATG GAACTGGGAC AAGTGCTTTC
AGGTTGAAAC AGAGTCACAT TATGGATATG CTCAGGAAAT ACATCCTCGC CACAGCAAAC
GGCCTAGATG ATGACCCCTC AGTACTTGAA GTTTATGATA GGATAATATG GGGGATTCTT
GTTCATGGAG GTATTCATCT CAGAACCCTT TGGTTCTTCA TTGTGTTGAA AAACTATTTC
TATCTTGAGT TCGATTTTGG GTGTTTTCAA TTATCAAAGG GACATAGATA CCTTCAATTT
GAGATTAATC GGATCCTTGA TTTCTTCGTC AACGGATGGG AGATAGTTGA CAAATGCAAG
GATCTCTGTT CAGATATCGA AGGATTCGAA AGCGATGACA TCTGGAATGT CGACCATGGC
AATCTCTACT TAATCCCTCA AATCTATGTG ACTCTGAAGA GCCTTGTTCT TATGAACGAA
CAATATGACG ACAGACTAGA TGTCAATTCA TTTGTCTATA TGTCCAAATA TCAGATTAAA
CCAAAGATCA AGTCATGCTT CAAAAGAACG AAAGAGACAT CTATATACAA GGAACGTATA
GATGCCAGTC AGGAGTTGAT TCATCTCTGG TGTACTGCTT ATCGGGAACA TCATCATACC
ATTCCCGGAA TTATCTCTGA TTTAGTTGTT TAG
 
Protein sequence
MVPLPSLEEL EVDFKSLVVM SNSVEGTPEN ESGDKESWSD ELTEPNSSAT SPRIDPNIVL 
YSHIAFLYEQ RKFYELLASI PNINTNSSLP SEIHLMLACT LFRLGRITGG LREMSLAIEL
EPIYYKREKY IKTLSQFFNK IGMKEEAVSC FGEIIHMAKA EALKKNDGEH GQQMSLKLRR
YQEELEALLR GDDEEEVVMV VEDCSNLSLV EQAKRYLSAN GMIPKMSMVT DCMAVSDLLG
KANKALEEWN DEVDKRPASD ILNWIFDAVK YLAPLSLFDE VYQILPLLET FMNWEMCRFP
EIDYGMDPIK FARKLRTWGI SNKTGIRSLC CIIVKHKVVM AHIEYYKKNY SAAIIHFTWV
FKVMKRLERT VPIIQTPVGC LSLETKQVVL LYLCNCYCLD PAYSNSKLPH VITQMATLES
PPDSTYGRSC KYFTFLGYAY EKLAYNEAQS IMIWKNGTGT SAFRLKQSHI MDMLRKYILA
TANGLDDDPS VLEVYDRIIW GILVHGGIHL RTLWFFIVLK NYFYLEFDFG CFQLSKGHRY
LQFEINRILD FFVNGWEIVD KCKDLCSDIE GFESDDIWNV DHGNLYLIPQ IYVTSKSLVL
MNEQYDDRLD VNSFVYMSKY QIKPKIKSCF KRTKETSIYK ERIDASQELI HLWCTAYREH
HHTIPGIISD LVV