Gene PICST_35594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_35594 
Symbol 
ID4837953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1190610 
End bp1191866 
Gene Length1257 bp 
Protein Length418 aa 
Translation table12 
GC content45% 
IMG OID640389268 
Productpredicted protein 
Protein accessionXP_001383855 
Protein GI150864860 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000667051 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGTC CTATTAAGAG CAAGAACGGT GACATTTCCA AGGCGCCAAC ACCGCAAAAC 
ACCCCTGCTT CGGTTACTAA TTCCTATTTG AGATCGCAGC CTCCTACCGT TTCCACGATT
GAAGAAACCA ACGAGGAAAA TGTTGGTCAG CAACTCGCTA ACAATCCGGC ACTTTTGCTG
ATGATCCAAG GCAAATTGGG GGATCTTGTC GGAGCACAGA GCGGATATAT TGACTCTTTG
CCAAAGTCTG TCAAAAAGAG AGTCTGGGGT TTGAAGGCGA TCCAACAACA GCAGATGAAG
TTAGAGGCTG AATTCCAGAA GGAGCTCTTG AGTTTGGAGA AGAAATACTT TAAGAAGTAT
GAGCCTTTGT ATGCAAGAAG AAAAAAGATC ATCAATGGCG CTGAAGAGCC CACTACTGAG
GAGATTGAAG AAGGTGAAGC ATTGGAGGAA AATGACGACG AAGATACCGA AGAAGCAAAG
ATCCAGGAAT TGAAGGATTC CAAGGCAGAA GAAGACGATG AAGAAGAAGA AGATGACGAA
GAAGCTGCTG CTGGTATTCC CGGCTTCTGG TTGACATCGT TGGAGAACTT ATCAACTGTA
TCTGAGACCA TCACAGACAG AGATTCGGAA GTGTTGGAAC ACTTGATAGA CATCAGAATG
GAGTACTTGG AAACCCCAGG CTTCGAATTG ATCTTTGAGT TCGAAGAGAA TGAATTCTTC
TCTAACCAGA TCTTGACGAA AACTTACCAT TACCAGGCCG AACTCGGTTA CTCTGGAGAC
TTTGTCTACG ATCATGCAGA TGGCTGTGAA ATTAACTGGA AGCTGAAGGA GAACAATGTT
ACTATCAATA TCGAAAGAAG AAAGCAGAGA AACAAGAACA CCAAGCAGAC CAGAACCATC
GAGAAGTTGA CTCCTACAGA ATCCTTCTTC AACTTCTTTG ATCCACCTAA GCCTCCTAAG
AGGGATGAAG AAGATGATGA AGAAGAGAAG GACGATGAAG ACGAAGAAGA CGAGGAAGAC
GAGGACTTGG ATGCCCGTTT GGAATTGGAC TACCAGTTGG GTGAAGAAAT TAAGGACCGT
TTAATCCCCA GAGCCATTGA CTGGTTCACT GGAGATGCTG TTGAGTACAA CTTTCCAGAA
GACTTTGACG GACAAGAAGG AGAAGAGTTG GACAGTGAAG AAGACGAAGA TGACGAGGAC
GACAGCGAGG ACGAGGGCAA ACCAAAGGAA AACCCTCCAG AATGCAACCA ACAGTAA
 
Protein sequence
MSGPIKSKNG DISKAPTPQN TPASVTNSYL RSQPPTVSTI EETNEENVGQ QLANNPALLS 
MIQGKLGDLV GAQSGYIDSL PKSVKKRVWG LKAIQQQQMK LEAEFQKELL SLEKKYFKKY
EPLYARRKKI INGAEEPTTE EIEEGEALEE NDDEDTEEAK IQELKDSKAE EDDEEEEDDE
EAAAGIPGFW LTSLENLSTV SETITDRDSE VLEHLIDIRM EYLETPGFEL IFEFEENEFF
SNQILTKTYH YQAELGYSGD FVYDHADGCE INWKSKENNV TINIERRKQR NKNTKQTRTI
EKLTPTESFF NFFDPPKPPK RDEEDDEEEK DDEDEEDEED EDLDARLELD YQLGEEIKDR
LIPRAIDWFT GDAVEYNFPE DFDGQEGEEL DSEEDEDDED DSEDEGKPKE NPPECNQQ