Gene PICST_83076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_83076 
Symbol 
ID4839045 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp863611 
End bp865467 
Gene Length1857 bp 
Protein Length470 aa 
Translation table12 
GC content40% 
IMG OID640390360 
Productpredicted protein 
Protein accessionXP_001384467 
Protein GI150865310 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.569663 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTTTCTTTTT ATATTTGATT TACTGAAGTG GTGGACATAC TATTTGTGAA CAAAATCTAC 
CAATTTAAAA ATCGGCTTTC CTTATAAATT TTTGTCTTGC CTATGACATT TGGCATTCAA
TAAAAGTGAA GAAGAGGAAG ACCAAGGAAT ATTGAAATAC AACAACTTAA ACAAGTAGAT
AAGAGATACG AAAAGTGCAA GGATAATAGC AAGATAACCT TCAGAGATGT CAGAACTATC
AAGCAGCAAC TCTTCGGATG GTTCTGAGTC ACAGAGCATA AATGATGTCA CCAGTGTCAA
TGTAAAGTAT GATGATTCTG TAGAAATCAA GTCTTATGCT ACTTCGTTTG ATTTATTGAC
TCTGGAGTCT AGACCTGATC TCAGAAATAC CAAATATAAA TCTTCTACCG ATCATTTGAA
TTGTCCCATT TGTCAGCAAC CTTTTATGGA GCCTTTGACT ACGATCTGTG GCCATACTTT
CTGTAAAGAA TGTATCTATG AATGTTTGAA AATGGCGAAA AGCAACCAGC AGAGCTCTGG
CAGCGATAGT TTGTCAGGGT ATTGTCCACT TGACCGGACG CCAATTGACT CCGCCAACAT
AAATGATTTG TTTCCTACTC CATTGTTGAT TTCCAACTTG ATTGACGACT TGAAAGTGTA
CTGCTTGAAT CACGAAAGAG GCTGCGAATG GGATGGAAGT CGCTGGGAGC TCGAGCGTCA
TGTTTTGATA GATTGTGGAT TTACAGGAGT GAAATGCGGA GGTGTCAGAT ACGAGAATAG
TGATGTTCGA CAAGAATCTA AAGAATCTGT GCCAGAAAAT GCTGTTTGCC AGTTGCTTAT
AGAGAGAAGG TTTGCTGACG AAAACCATGG CTGTTCCCAC CAGGTATTTC AGTGCAATTT
CTGTAATCAG GAACTCACCA AGATGACTGA AAGTGACCAT TTGGAAAATG AGTGTTTGTT
CAATTATCAG ACATGCGAAT TGTGTTCCAA CGATATGATT CCCTTGAAGA ACTTGAGCAA
ACACCAGGAG AATTGCTCCA AGATTGGTAT GGTCAAATGT CCTGCTCACG AGATAGGATG
CAAGTGGGTT GGATCCAATG AAACTTCGCT AGAGATTCAT CAACAGGGCA ACAACTGTCA
GCTTAGCCAT TTCTTACCTT ACTATCACAA GATAAACGAC AAGGTGGATC TGCTTACAGA
GGAGAACAGG TTCTTACAGA AACAAATCAA CAAGATCTTG GACTCAATCG TTCAAGGAAA
GATTACTAAT TTGGGCTACA ACGAGTCTAT CGAGGAGATC AACAAGTTCA AGACAATCGA
AGACCAGGAC AAGCTCTTGT ACCTCAACTT TGAGATTGAT AGGTTGAAAT TTGAGTTTAA
CGAGAAGATC ATGCCGTTCA TCAATAAGCA CACCATGAAT GAACAGGAAA CTGTGATCAA
CAATTTGACT CACGACAACT TCATGATGAA AGAAGACTTA AATTTGCAGA GGGTGTTAAT
CAACAGCTTG AGAAAACAGT TGCAATTCCT TTTGTTCTCG CGCAACAGTG CCAGAACCGG
GGCATTTGGT ACAGGCGGCA TGGTGGGCTC GATGGGAGCA GCTCCTAATG TTCTTCTTAT
GGATGACGTT GCCAACGAAC TTCTTGAAGC ATCTTCACGG AGCAGTTCCG AGGAGCGGTT
GAACTTGAAA TTGTAGCACT AGCAAACAGA TCAAGAGAAT GCAATCAGAA AAGGAAAGTG
TTGCGATTTG GATTTTTATG ATTTACGAGA GAAGATTACT AGCTGATAAC GAAAATGACA
TTACTGACGT TTCTGAGTTA GCGACGACAT ATAGATAAAA ATTTAATAGA ATGGATT
 
Protein sequence
MSELSSSNSS DGSESQSIND YDDSVEIKSY ATSFDLLTSE SRPDLRNTKY KSSTDHLNCP 
ICQQPFMEPL TTICGHTFCK ECIYECLKMA KSNQQSSGSD SLSGYCPLDR TPIDSANIND
LFPTPLLISN LIDDLKVYCL NHERGCEWDG SRWELERHVL IDCGFTGVKC GGVRYENKNA
VCQLLIERRF ADENHGCSHQ VFQCNFCNQE LTKMTESDHL ENECLFNYQT CELCSNDMIP
LKNLSKHQEN CSKIGMVKCP AHEIGCKWVG SNETSLEIHQ QGNNCQLSHF LPYYHKINDK
VDSLTEENRF LQKQINKILD SIVQGKITNL GYNESIEEIN KFKTIEDQDK LLYLNFEIDR
LKFEFNEKIM PFINKHTMNE QETVINNLTH DNFMMKEDLN LQRVLINSLR KQLQFLLFSR
NSARTGAFGT GGMVGSMGAA PNVLLMDDVA NELLEASSRS SSEERLNLKL