Gene PICST_34689 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_34689 
Symbol 
ID4851933 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3228746 
End bp3230689 
Gene Length1944 bp 
Protein Length574 aa 
Translation table 
GC content40% 
IMG OID640393641 
Productpredicted protein 
Protein accessionXP_001386942 
Protein GI126276063 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000996882 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGTGA CTAGTAACGC CTACTACTCT CCACAATACC TAGATAAGCA GAAGCGATCC 
ACTCGAAGTC AATACAGTGG GTATTTTCGG GGCTCGCGGG ACAATTCGTA CAATGACTTT
ATTAATCAAG CCCAAATTGA TGATTGGAAT GACAGGGTTT CACTGGACAG AATTTCTGGC
TACAAACCAG TTTCTGAAGG CAGCTCTAGC AGAAACAGAA ATCGTCAAGC AACAGCAATA
GATTCCATTA GAGAACCCGA AGGCTCTGTC AAAAGTTCCA CTCGAGTTTC CAAAGCTCCT
GTGCTCAATA GAGCCAATTC TACTCCTCTT TCTGTAGTGC AATCTACAAT TGCGGCAGAC
GATTCCATAG AATCTGATCT TGAAGATGAA GAAAGCTATG ATCCAGACGC ATTGGTACAT
GAAATCACAC CGGCTCAGCT CATCATACAA CAACTGAATA CAGTAACTCC TCACAGAGTT
GAGAGAAGAG GTGCTTACAA ACCTCGAGAA CAAACCCCTG AGATTAAAGG CTATGATGGG
TACTACATAA ACCAGAAGAA AAGATATGAA GAGTCTCCCT ATAAGCCCAA ATTGTACACA
CACAAGACAT TTCGTGATGT GTTTAACGAT AAGGAGGAAA GTACAGATAA GTACAACCCT
ATGGAATTTG TATTTCCAGA AGGAGAGAAG AAGAGCAAAT TTGCCAAGAA CGTACAATTT
GTTCTAGGAA AGGACAACTA TGACGAATAT AATTATTACG ACCACAACAA GGGAGAAAAG
AAGAAAAAGA GGAAGAAGAA AATCAATGAA CCGACTGAAG TTTTTGTTAA AGAACTTAGC
GATGACGACG AAGAAGATTA TACTGAAAAT GGCCCTAAAA TATATTTAAC TGAGGAAGAA
CAGCAGAAAG CTAAAAAGAA CAGGAAATTC ACTAAGGTGT TCAAGTCTAA AATGAAGAGA
GCTAGAAAGG AATTGGGCAA AGATTTTGTG AACAATGCGA TAAAGCAGCA GGAACTTGAG
CTGAGACGAA AGGAAGAGAA ACTGGAAAAG AAAAGAGCTG AAGAAGAAGA CAAACAAATA
GCTTTGAAGG AAGAGGCCGA GAGAATACGT GCATTAGAAG AAGAGCAAAG GAGAATAGGT
CAAAATCCAG AGTTTCATCC AATTTGGAAT TATATATTGT CGTGGTTGGT ATACGATGCT
TCAATATCCA AGACACCAGC TGTTGATTCT CATATTGAAG AACTCGACTC TTATGAAATC
CACGAAAAGG CTGACGAAGA ACCTGTGGAG AACAAAGAAC AGCAGGAAAC ATCTAAGTCC
AAGAAGTTAA TTATTTCCTC TAAGAATTTT AAGAACATAA AGAAGAACTA CCTCAACTTG
GTGCACAAAT GGAACGAACC TGTCTCACAT GTTTTTAACG AGCCTCCTCC TCCATACCCA
ACGTCTCGAT CCATAAAAGC AAAAACGCTA AGGTCCTTCG AATCTTCAGC TTTTGATGAA
GGTGATGGTG ACTCAAAAGA GTTCGTTATT GAATACGACG ACGATGGAAC TGAAATCACA
CAAGAGTTAT ACTACAATCC TGTAACCAAA CAGCTCGAAG CCACACCACC AACGTCATAT
TCGTCATTGG ACCCTTCAGC CAAATCTGTA AGCTCCTCTA TGATGGGCTA TGGCATTGAT
ACTACAGGAA GCGCTGTAGC CATTATCTCC AACATCAATG CTTTGATCAA GAGCATCAAG
ATAATGAAAA TCCTTTTCGC ACCCATCGAT GTAGTCTCAG AATATTTCCC CAATCTCCAG
ACGATTGTCA TCTTGGTGGA GTTGGTGATT TTTGTGTGGA TCTTGTACGA AGTCAGCCTC
TTAATCGATG CCTTATGTAT GATGGTCAAG GCTGTATGTG CGCCCATGAT AGCTATGGGA
AGGTTTATGA ACAGAATAAT GTAA
 
Protein sequence
MNVTSNAYYS PQYLDKQKRS TRSQYSGEPE GSVKSSTRVS KAPVLNRANS TPLSVVQSTI 
AADDSIESDL EDEESYDPDA LVHEITPAQL IIQQLNTVTP HRVERRGAYK PREQTPEIKG
YDGYYINQKK RYEESPYKPK LYTHKTFRDV FNDKEESTDK YNPMEFVFPE GEKKSKFAKN
VQFVLGKDNY DEYNYYDHNK GEKKKKRKKK INEPTEVFVK ELSDDDEEDY TENGPKIYLT
EEEQQKAKKN RKFTKVFKSK MKRARKELGK DFVNNAIKQQ ELELRRKEEK LEKKRAEEED
KQIALKEEAE RIRALEEEQR RIGQNPEFHP IWNYILSWLV YDASISKTPA ADEEPVENKE
QQETSKSKKL IISSKNFKNI KKNYLNLVHK WNEPVSHVFN EPPPPYPTSR SIKAKTLRSF
ESSAFDEGDG DSKEFVIEYD DDGTEITQEL YYNPVTKQLE ATPPTSYSSL DPSAKSVSSS
MMGYGIDTTG SAVAIISNIN ALIKSIKIMK ILFAPIDVVS EYFPNLQTIV ILVELVIFVW
ILYEVSLLID ALCMMVKAVC APMIAMGRFM NRIM