Gene PICST_47138 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47138 
Symbol 
ID4839430 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1450419 
End bp1451894 
Gene Length1476 bp 
Protein Length467 aa 
Translation table12 
GC content42% 
IMG OID640390745 
Productconserved hypothetical protein 
Protein accessionXP_001385280 
Protein GI150865886 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.440722 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.19959 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCTTC CGTCGAAATA TGCCCCGGAG GGTCTCAGTA GATATGACCA ACATCGACCA 
ATAACGTCCG GCAGAAAACG GCCTCAAAAG CCGATAAATT ATACGGTTCG TGTTATCGAA
GCCAGAATTA AAGATGCTGT GGAGAACCGC ACCTACAATA ACTCAGACTA TGGAGATGGC
GATATAGAAA TGATTCCTAT GGACGACTAT AAGGAAGTAG AGGTGATCTC CGAGTTTGTA
GAATTCTGTC GAGATGCAAA GTTAATATCT GGCAACAGAA CGCTGGAGAC AGACGCGATG
GAAGATATTG TAGAGAATAC TAGTGGCTTT GAGCTTGCAG AAGGTAACAT TTCATATCTT
GTAATAGATA CCAATTTTCT TTTGTCCCAT TTAAACATCT TGGATGAGAT TAAAAACATC
GCTGACAAAT ATGAGCTCAA GCTAGTAGTG CCTATCACTG TGATTCAGGA ATTGGATGGA
CTCAAGAACT CCAATAGAAC AAGTCTCGTG AGCAGTAGCA CTAGCGGTGA GCTCGAAGAC
AGAATATCGG GTAAGTCTAT AGGACATTTG GCTCGATGGG CTACTGACTG GATCTACTCG
TGTCTTTCCA AGAACAGCGG TGTAGTCAAG GGCCAGAAAT TGAGAGAACG GCTCAATAAA
GATGCTGTGA AAGACGATGC CATCCTCGAT TGTGCCTTGT ATTTGAAGGA ATGCCATGCC
AATTCATTAA TCGTGCTCTT TTCCAACGAC AAGAACTTGT GCACCAAAGC TCTTGCAAAC
GGAGTTCTTA CTGTAAGCTA TAAGAAACAC ATGACTTCAG AACTTATAGC GAATGTGGTG
CATACCGAGA ATGTTAGTCG TTTTGGGAAA ATCGAAAAGC GTATCGTGGA AGTGGCTCCT
GCTATACAGT CAATGTCTTA CTCTAATTCA AATTTGCAGT CTAATCCGCA GTATTTGCCA
TCCCAATCGC TTTCGCAGTC TGATTCGCAA TCACATCTGC TTTCACATTC TCGTAAAAAT
AGCCATGTGC TTGTGGAGCA GAATGTCCGC CAGTTTTCCA GCTTTCATCA GATCGCAGAA
AAAGTATATA CCGAGATTCA GATGATAGCC CTATCTGCTA TCCATCAATG CATGGAGTCA
GTATTTAACG AAGATCTAGA TCTTCTTCAG GACTACGAAA AAGAAAAAGT GATCACCCTC
ACCGATTGTT CTAACGTTAT GATTCGGTTT TGGTTCACTG TATTTCAACC GTATTTCAAA
AAACTCCCAA ACAAGTTCAC TCCGTTCGAT GAATCAGGAA GGAATAAAAC TCCACTATAT
GTAGATTTAC CTAGAGATTC CTATGAATTG TTGCAATTCG TTCACTTCTG GACGAAGACC
TTGTCTACCA TATACGCTGC CGAAATGGAC GACTCCAAAA ATGAAGCATT GGACATCCTC
GTCCAACGAT GGGAAAGCAT GGCAAGCCTT TACTAA
 
Protein sequence
MSLPSKYAPE GLSRYDQHRP ITSGRKRPQK PINYTVRVIE ARIKDAVENR TYNNSDYGDG 
DIEMIPMDDY KEVEVISEFV EFCRDAKLIS GNRTSETDAM EDIVENTSGF ELAEGNISYL
VIDTNFLLSH LNILDEIKNI ADKYELKLVV PITVIQELDG LKNSNRTSLV SSSTSGELED
RISGKSIGHL ARWATDWIYS CLSKNSGVVK GQKLRERLNK DAVKDDAILD CALYLKECHA
NSLIVLFSND KNLCTKALAN GVLTVSYKKH MTSELIANVV HTENVSRFGK IEKRIVEVAP
AIQSMSYSNS NLQSNPHHVL VEQNVRQFSS FHQIAEKVYT EIQMIALSAI HQCMESVFNE
DLDLLQDYEK EKVITLTDCS NVMIRFWFTV FQPYFKKLPN KFTPFDESGR NKTPLYVDLP
RDSYELLQFV HFWTKTLSTI YAAEMDDSKN EALDILVQRW ESMASLY