Gene PICST_41120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_41120 
Symbol 
ID4837176 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2091234 
End bp2094425 
Gene Length3192 bp 
Protein Length1039 aa 
Translation table12 
GC content42% 
IMG OID640388491 
Productpredicted protein 
Protein accessionXP_001382629 
Protein GI150863968 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.599383 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGACTC TCACTGCCTC TAATGTATGG ACGCAGGACG AGATCGAGCA TTTCCATACG 
TGCATAGTTC ACAACGCTAC ACGGACTGGG ATGGGCCACC TTATCGAGCA AAAGAGCATA
GCTGAGCTCA ACAATGCTGT AACCCAGCTT ATGTCACGTT TCTACTGGAC GAAGGGCGAG
ATGACCTTTC TTGAAAACCA CTTGGGAGCA GGCGTAGCAG AACTTTGTCA CGAGATCCCA
CTTAGAGCTA AACATGGGAT CGTCCAGATG GTTTGGAAAG TGAAAAAGAG TCGTAATCAG
GTAGCACTGG CCAAAAAAGC CAAACCCAAC CCCTCCACCA AGAAGTCCGT TAGGGACTTG
GAAGTCACTA TAGCCGAAAA AAAAGTTGTG AATTTGACTG ACTCTTCTAT GGAGGACACG
ACAGAAGATG ACAATTTAGC GTATTGTATT CAACACGATT TGTCCAGAGA AACACTCGAG
GCTCTCTTCC CCGACTGGTC GTTGTCCGAG ACACTAGACA GAATCGTATT CCAGGCCGGT
CCATTGGATT CCATAGCTCT TACAACTGGT GAAAAACAGA TTATCAAAGA CTCCATAAAG
AAACAAAAGT CGTTGGAATC AGTACAGTTC AACTTCCCGT GCCGTTCCAA AGACTACCTA
GTAGAAAAGT TTAAGGAATT TGAATTTGTC TCTTCGCGTA AGACCAAGTT CAAGTCGTTG
TCAGAGAGAT TGCTTTACGA AGCAAAATGG GTTCTTTATT CTAGTGGGGA AACTTTTACA
ACTACTAGAC GGTCCCGTAA GCGGGCTCTT GAAGAGTCCT TTGAAACCAT GGAAAAGGAA
GCCCTAGTGT CGATCTACAC CAAGCCTGCT CCTAAGCAGG AAATGACTCC CGAAGAGTTG
GAAGAAAGAG AACGTCGTAG AGAAGCCTTG AAGGAACTGA GAAGACTCGC GAATGAAAGA
AAAGCTCTTC TCAGAAAACA AAGGAAGGAA GACTTGGAGA GACGTAAAGC GGCTGGTTTA
ATCAAAGCAA AACCAAAATC TGACATTACC CATTCGATTA AAGATCTCTT GGCTGGATCC
GAGCACTTTC AATCTGTTAT TGGCGATAAA AAGAAAGTTG AGGAGGGACA AAAAAGAAAA
CGTATACAAG CAGTTCACTA CCGTCCTGAA ATCGAACCCA AGAAGCCTTC TAAACTAAAG
GTCAGACACA GACAAGCCGA AAAAAGCAAA ATCAAACTAG CCTTAAAATT GAAAAAACAA
CAAGAACACA ACAAGAGAAA ACCCAAAAAA GAACCAAAGA AAAAGAAGAA GACTGTCGAA
GAAGAGTCCT TAGCTACACC GGAAACGGAG GAATTCATTA AAGAAGAGAT CGACGAAGAC
GAAGAGGAAG AGGAAGAAGA GGACACCTAT AGTCCATTCG ATCCTGTTGA TCTCAATTCA
GATTCGTTTG TGCCTCTTCA TGGAAGACAA TTCTACGCTG AAGAAATATA CGTAGATAAG
CCCCATGTCC CTGAGTTGAA GTTTGTTGAA GTTACAGAAG AGTCTGAAAA TCTGCTCATC
AGCAGTTCTA CTTTGACAGA AACAAAAAGA ATTATGACCA CAAACGACGA TGACATAGTC
TACGAAGACT GCTTGGCAGC AGATATCATC CGTTCTCACA TCAAGAACTA CCGTGATTTA
CCTATATCGT TCCCACCTTT GCTTGACCCT TCTAGCCTGG AAAGGAAGAT ATATCCTACG
AATAAGGTCA GAATCAGATT CTTATTGTAT CCTCAACACT GTGAACTGTT TATTCTTGCA
GCTCCTAAAA CCAACGAACT AGATCCTGTC TATGAGATTA TCAAATTATT CATGATCCAT
TACGCCTTGT TCTTTTCGCA TTCCTCCGAG ATCAGAAGAA TCATCACCGA AGAGTATTGT
CAGAAAATTG AGCATTCTAT CGAAGAGAAT GACTTTTCAG ATTTCATGTT CGTCGTTGAT
AAATGGAACG CTTTAATGCT TAAATTATCA CCCAATGAAG AGGCAGTCCA ATCTATTGTC
CAGAGTGGAA AAGAGGACAT AAATGCTGGT CTTAGATCGT ATCTTAGCGA ACAAGAGATT
CGCGTACCAA CTAGTGAAGA CTTGAAGTTG CAGGCTTTTC TTGAAGCTGT CATTCTTGAA
GATCTTAGTC CTACATTCCA ACTAATCAAA GAAGAACCGA GTGATGCCGA GAAACAGGAA
TTTGTTCCAA AACATCTTGG TGACGTTGAA GCCCCAAAGA ATGTGAGTGA CGAATTGAAG
GATATGAAAC CTGATGATTA TAATTTGATT TTCTTCACTA GGCTCAAAGA GAAGACTGAG
ATTTCTAGAT TTGCGACTCA ACAAATTCTT TTGCGGATTT ATTCGAGAAT CGTTTCGACG
GATTCACGAA AGTTGAGATC ATACAAGGCA TTCACAGCTG AAGTATATGG GGAACTTTTA
CCATCATTTA CCAGTGAAGT TCTTGAAAAG GTTAATTTAC TTCCAAACCA GAAGTTCTAC
GATTTAGGCT CTGGTGTCGG GAATACTACA TTTCAAGCAG CCTTAGAATT TGGTGCTTCT
ATGAGCGGAG GATGTGAGCT TATGGAACAT GCTTCAAAGT TAACTAAACT TCAAGAGGGG
TTATTGCAAA AGCATATGGC CGTGCTTGGT TTGAAGAAGT TAAACTTCAA CTTTGCCTTG
CTGCAAAGTT TTGTTGATAA CGATCCCGTA AGAGATGCTG CTCTGGACTG TGAAGTGCTC
ATCATTAACA ACTACCTCTT TGACGGCAAT TTGAATGCAG AAGTTGGGAG ACTTTTATGT
GGTCTCAAGC CAGGAACCAA GATCATTAGT TTGAGAAACT TTATCAGTCC TAGATACAGA
GCAACTGGGG ATACTATTTT TGATTTCTTC AAAGTTGAAA AGCACGAAAT GAGTGATTTC
TTGTCAGTCA GTTGGACGGC AAACAAGGTT CCATACTACA TTTCCACTGT CCAGGATAGA
ATCTTGCCAG AATACTTAGG CAAAGATGAA TCTCCTGATA GTGATAGATC CACACCTACA
CTCAAATCAG AAAATGGCAG TTCCGAAAAC CTCGCTGGTA GCCTAACCCC TTTTACTGCA
ACACCTGAAC CTGATATGTT CAAGGACCTA TCTTGTGTAT TAGGAGATGA AGACGACATT
CTTCTCCACT AA
 
Protein sequence
VKTLTASNVW TQDEIEHFHT CIVHNATRTG MGHLIEQKSI AELNNAVTQL MSRFYWTKGE 
MTFLENHLGA GVAELCHEIP LRAKHGIVQM KSVRDLEVTI AEKKVVNLTD SSMEDTTEDD
NLAYCIQHDL SRETLEALFP DWSLSETLDR IVFQAGPLDS IALTTGEKQI IKDSIKKQKS
LESVQFNFPC RSKDYLVEKF KEFEFVSSRK TKFKSLSERL LYEAKWVLYS SGETFTTTRR
SRKRALEESF ETMEKEALVS IYTKPAPKQE MTPEELEERE RRREALKESR RLANERKALL
RKQRKEDLER RKAAGLIKAK PKSDITHSIK DLLAGSEHFQ SVIGDKKKVE EGQKRKRIQA
VHYRPEIEPK KPSKLKVRHR QAEKSKIKLA LKLKKQQEHN KRKPKKEPKK KKKTVEEESL
ATPETEEFIK EEIDEDEEEE EEEDTYSPFD PVDLNSDSFV PLHGRQFYAE EIYVDKPHVP
ELKFVEVTEE SENSLISSST LTETKRIMTT NDDDIVYEDC LAADIIRSHI KNYRDLPISF
PPLLDPSSSE RKIYPTNKVR IRFLLYPQHC ESFILAAPKT NELDPVYEII KLFMIHYALF
FSHSSEIRRI ITEEYCQKIE HSIEENDFSD FMFVVDKWNA LMLKLSPNEE AVQSIVQSGK
EDINAGLRSY LSEQEIRVPT SEDLKLQAFL EAVILEDLSP TFQLIKEEPS DAEKQEFVPK
HLGDVEAPKN VSDELKDMKP DDYNLIFFTR LKEKTEISRF ATQQILLRIY SRIVSTDSRK
LRSYKAFTAE VYGELLPSFT SEVLEKVNLL PNQKFYDLGS GVGNTTFQAA LEFGASMSGG
CELMEHASKL TKLQEGLLQK HMAVLGLKKL NFNFALSQSF VDNDPVRDAA SDCEVLIINN
YLFDGNLNAE VGRLLCGLKP GTKIISLRNF ISPRYRATGD TIFDFFKVEK HEMSDFLSVS
WTANKVPYYI STVQDRILPE YLGKDESPDS DRSTPTLKSE NGSSENLAGS LTPFTATPEP
DMFKDLSCVL GDEDDILLH