Gene PICST_32968 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32968 
Symbol 
ID4840178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1101545 
End bp1103935 
Gene Length2391 bp 
Protein Length796 aa 
Translation table12 
GC content38% 
IMG OID640391493 
Productpredicted protein 
Protein accessionXP_001385565 
Protein GI150866087 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACAT ATAATCCTAG GAAGAAAGCA GACAGTGCTT CCAACGACCT ACCCACGACA 
GGGCCACCAT CTCATCGTGC TATATACAAT GCAATTCATC TACCGACAAC ATTCAATCTA
TTGCGATACA AACATTTCAG TTCCAAGTCA TTTTCTAGTC GGCCGGAAAA CCCCACCAGA
TCAGGACTTC CGTCAGATCC AGCCTCAAAT TCATCGAATA CTTCACTGAA CTCAAATTCG
AGTCTGCAAT TTGGGTCCAG CAAGTATGCC CGATCGATGG ATCTAATGGA GAAGCAATTA
CTTAATCGGT TGAATTTCAG AACTTTTAAT TCAGACTCAA TTCTTGTAAT TAATGCTGTA
CTAGCCCTCA CAGCTTACTA CATGCGCTCA AACTGCTTAA AAGCATTCAG TAATTTTGGA
TCTAATGATC CACAACAAAT TCAAAATTAC CAGCAATTTA GAAACTTTCT CATCAAGATA
TCTGCAAAGT ACTACGGCAT TGCCATCCAT AAGCTAAGAA CAATGCTTTC CAGAAAGGAT
TATAATGTAA CCATCGCGAT CATTGTGCTG TCTTTTATGA ACAAAATATC CATCTATGAA
GATGCTGACT TGAACCAATC AGTTACATTT TCAAAAGGTG TCATAGGCAT ATTCAATGAC
ATCTTGTCTA ACTTCAAACA GTCAAACGCT CGATTTCTCA GATTGTTTAA TATATCAAAT
GAATCGTCAT ATACCAGTGC GAACGACGTA TCTTGGATTA TCAATTTCCT AGTGTTTGCT
TCGAAGTCGA TATTTTTCCC TACATATGAC CCAACCATCC TATTGGAATA TAACCAAACA
TTAATTGAAT ACAGAGACAT ACTTCATGAA CTAAATAGAC AGGGAATACA ACCGTCCCAA
CATGTCATAT TTAATTTCAA CCACCTCTTC AGTTACACCC AACAATTACT CTTGTTTATT
CAATCGCATA ACACTAATAA CATCAACCAA TATCCAATTT TACTCTACAA GTTGTTGCGG
CATTGGTTGA TTATTCTCCC ATCAAAAGTT CATGTGCTCG AGTATCTATC CGATCCGTTG
GAAAAGACCT TGATGTACTT ATTCCGTACT TTAACAAAAA TCCTTGATAA CCTTTTCCCG
TCTGTGATAT TCTACTTTTT ACATACATTC AGTGGAGGCT TGTCATTATG GTATGAACCT
AATTACGAAT TAGACCCAAA AATAGTAACC CCACAGGAAA TGTCATCATT TCCTACTCCC
ATTCAACATC ACTTGATTAG GATTACTATT TACAGTGTTC GAGTCTGTAC ATTTTTTCAG
AAGCGAAGTA ACATATTACT GATCTTTTTT GGGGACCAAA GATTGAAACA AGAATTGTCA
CCACAAATTA TCCAGGCTAA CATCAATGAA GTCATGATAA AAGCTTTCAA AAAAACAAAA
ATAAGGCTCT ATCATTACAT ACATTTGCCT AATGTAGTGC AGTTCACAAA GACGAATGTT
ATCTTTGATA ATATACAACA AAGATTCGGT GCCAAGATGC TCTACTACTA CAACCAAGGA
CATTACGACC ATTTCATGTC TAAAAATTCG GACAAATCGT ATAATAATGT GTTCAAGAAA
GACTCGATAG ATCCTATCAT AAACTTTCAT ATGAATAATA TATCTCAAAG CGGGTTTGAA
ACTGCCAACG ATAACAAGAG ATTTGGTAGT ATGGACGGAT CTAGCTCGTT CTCATCGTCC
ACTACTGCTT CACCGCCTGA TACTGGAAGA AGCGTGCCTG TTTCTCTTCC ATTTACAACC
ACTATGGGAA GTTCTGCACC GATAAACTAC TACACTATAT TCGAAGATAA TTTTTCTCTG
ATATTGACGA AGAAACTTAG TCAAGAATAT GAAGATTTAT TTCGTTCTGG CGATAATTCG
CTATTTGTGA ATCCAGACAG TGCACCATTC CTCGATTTGC AGCTCAATCC TCAAAACAGG
TTATTTTTCG GTGACAACGA TCCATTACGA ATTGCCGAGA TCAGCACTGA CATCATTTTC
AAAAGACGAC TTCAGAATTT GTTCACTAAT ACAGAAATTT ACAATCTCAT AGTCAACTTG
CACAACACGA GGAATAGAAG ACTAAGTGCA ATAATGAGCG AACCGGCTTC CACCATAAGT
ATCTCTCGAC AACCTTCTGT GACGTCCACC AATGTCCAAT TAATGGAATC CATTCATCAT
GAGATTGAGG TGGAACATAG AGAAAACATC AAAAATGTCA GGAAACGCAG TGGCTCCAAA
TCAAAAAAGA AGATATCTCC TGAAGAGAGT CCCGTCGAAA AAGTCCAAGA GAAGTTGAGC
GACGTGGACA ATTTCTACGA CGACTTGAAT GAAAAGTACT TTGCACTTTA G
 
Protein sequence
MSTYNPRKKA DSASNDLPTT GPPSHRAIYN AIHLPTTFNL LRYKHFSSKS FSSRPENPTR 
SGLPSDPASN SSNTSSNSNS SSQFGSSKYA RSMDLMEKQL LNRLNFRTFN SDSILVINAV
LALTAYYMRS NCLKAFSNFG SNDPQQIQNY QQFRNFLIKI SAKYYGIAIH KLRTMLSRKD
YNVTIAIIVS SFMNKISIYE DADLNQSVTF SKGVIGIFND ILSNFKQSNA RFLRLFNISN
ESSYTSANDV SWIINFLVFA SKSIFFPTYD PTILLEYNQT LIEYRDILHE LNRQGIQPSQ
HVIFNFNHLF SYTQQLLLFI QSHNTNNINQ YPILLYKLLR HWLIILPSKV HVLEYLSDPL
EKTLMYLFRT LTKILDNLFP SVIFYFLHTF SGGLSLWYEP NYELDPKIVT PQEMSSFPTP
IQHHLIRITI YSVRVCTFFQ KRSNILSIFF GDQRLKQELS PQIIQANINE VMIKAFKKTK
IRLYHYIHLP NVVQFTKTNV IFDNIQQRFG AKMLYYYNQG HYDHFMSKNS DKSYNNVFKK
DSIDPIINFH MNNISQSGFE TANDNKRFGS MDGSSSFSSS TTASPPDTGR SVPVSLPFTT
TMGSSAPINY YTIFEDNFSS ILTKKLSQEY EDLFRSGDNS LFVNPDSAPF LDLQLNPQNR
LFFGDNDPLR IAEISTDIIF KRRLQNLFTN TEIYNLIVNL HNTRNRRLSA IMSEPASTIS
ISRQPSVTST NVQLMESIHH EIEVEHRENI KNVRKRSGSK SKKKISPEES PVEKVQEKLS
DVDNFYDDLN EKYFAL