Gene PICST_43698 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_43698 
Symbol 
ID4838194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp1006758 
End bp1008614 
Gene Length1857 bp 
Protein Length618 aa 
Translation table12 
GC content42% 
IMG OID640389509 
Productpredicted protein 
Protein accessionXP_001383821 
Protein GI150864838 
COG category[S] Function unknown 
COG ID[COG0397] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.237593 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.455803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAAAT TATCGGAATT GCCAAAGACT TCGTCTTTTT CAAGCTATAT AGAACCAGAT 
GGGAAGATCG CCTCTACGGA AGTAGCTGCG AAGAATGAAG ATGGCATTAT CAATAAACCA
AGAATACTCT CGTCAGGAGG ATTTTCCTAT TCGTTGCCCG AGTTGAGAAA GGAGTATCGA
TTCTTGACTG CTAACGAAGC GGCCTTGAAC GATCTTGGAC TTGATCCGGA ACAAGTCAAC
GATAAAGAGT TCCAAGAATT AGTCAGCGGA GAATTCTACC TAATGTACAA AGATACGTTT
CAAGATAAAG GTTATCCATT TCCATACTCC CAGGCATATG CTGGCTGGCA ATTTGGTCAA
TTCGCTGGAC AATTGGGCGA CGGAAGAGTG GTGAACCTCT TTGAAGTGCC GAAAGCAAAA
GTGCAGTCAA ATAATAGGCA CAAGTATGAA GTGCAATTGA AGGGCTCTGG TAAGACTCCG
TACTCGAGAT TTGCTGACGG AAAGGCTGTT CTCAGGTCTT CTATTCGTGA ATACATAATC
TCTGAGCACT TGAATGCCAT TGGAATCCCA ACCACCAGAG CTTTGTCGCT CACGTATCTT
CCTGCTACTT ACGCCCAGAG ACATGCTGCC GAGAAATGTG CCATTGTGTC TAGATTCGCT
GAGCTGTGGA TCAGGTTGGG CACGTTCGAT CTTTACAGAT GGAGGGGCGA TAGAAGTGGT
ATAAGGAAGT TGAGTGATTA TGTCATTGAT GAGCTCTTCA CTGTAGAGGG TACTAAATTC
TGCAACTTTG AAAACCTTCT CAGGGAAAAG TCTGACTTCT TTGATAACAC TACCGAATCA
CTCGGTGAAC TAACTGACTA CGATAAAATG TATTACGAAA CTATAGTGAG AAATGCCACT
ACCACGGCTC TCACACAATC CTATGGGTTC TTGAATGGAG TCTTGAATAC TGATAATACT
TCTATTCTTG GCTTGACAAT GGACTTTGGT CCTTTCTCTA TCATGGACAA GTACAGTCCA
ACGTACACTC CCAATTCAGA AGACCACGAA CAGAGATACG GGTACCGTAA TACTCCTACG
GCAATCTGGT GGAACTTAAC CAGATTAGGT GAAGACTTGG CTGAATTGAT AGGTGCCGGT
TCCAAATTGT TATCTGATCC TAAATTTGAA AGAGGCGAAA TAGATAAGGA TTGGGAAGAT
GCAATTATCA AGAGAGCTAC TAAAATAATA GAAATAGGTG GAGATGTATA CCAATACGCA
TTTACCAAGA AGTATGTGGA AACTTTCTTT GCCCGTTTGG GTATATCGCC AAAGATAATA
GACTACACAA ATATTGATAA GCACAACGTC GAGTTGATTG CACCCTTGCT TGAAGTGCTA
TACAAAGTCA AATGTGACTA CAATAAGTTT TTCTTAATAT TGCAGGACCA GAAATTTGAT
GCTGAGAACT ATAACCCCGA CGCAATTGCC GATAATATTT TGGCTCCTTC TTATGACGAG
AATGATAACA GATACTCTAA AAAGGAATTG ACCGATGAAA TTAAAAGCTG GTTAGGGGTA
TACCGTGCAC ATTTGGAAGA GTCTCGGGCA ATAGATCCTA CCTTCTCCCG CTTAGAAAGC
AAGAAGTATA ATCCTGTGTT CTTGCCCCGC AACTGGATTC TCGACCAGGT TATTGCCCAT
GTTCAAGATT CGGGTGCTTA CGACTTGTCC TACTTGAAAA AGTTAGAGAG GATGAGTTTC
TATCCATTTG ATTCCACTAA ATGGGGTGAT GACTTGAAAG AGTTGGAACA ATCATGGTTG
CTTCAGGGAG ACAAAGGAGA AGATTATTCC ATGCTACAAT GCAGTTGTGC CAGTTAG
 
Protein sequence
MSKLSELPKT SSFSSYIEPD GKIASTEVAA KNEDGIINKP RILSSGGFSY SLPELRKEYR 
FLTANEAALN DLGLDPEQVN DKEFQELVSG EFYLMYKDTF QDKGYPFPYS QAYAGWQFGQ
FAGQLGDGRV VNLFEVPKAK VQSNNRHKYE VQLKGSGKTP YSRFADGKAV LRSSIREYII
SEHLNAIGIP TTRALSLTYL PATYAQRHAA EKCAIVSRFA ESWIRLGTFD LYRWRGDRSG
IRKLSDYVID ELFTVEGTKF CNFENLLREK SDFFDNTTES LGELTDYDKM YYETIVRNAT
TTALTQSYGF LNGVLNTDNT SILGLTMDFG PFSIMDKYSP TYTPNSEDHE QRYGYRNTPT
AIWWNLTRLG EDLAELIGAG SKLLSDPKFE RGEIDKDWED AIIKRATKII EIGGDVYQYA
FTKKYVETFF ARLGISPKII DYTNIDKHNV ELIAPLLEVL YKVKCDYNKF FLILQDQKFD
AENYNPDAIA DNILAPSYDE NDNRYSKKEL TDEIKSWLGV YRAHLEESRA IDPTFSRLES
KKYNPVFLPR NWILDQVIAH VQDSGAYDLS YLKKLERMSF YPFDSTKWGD DLKELEQSWL
LQGDKGEDYS MLQCSCAS