Gene PICST_36445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_36445 
Symbol 
ID4839731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1510779 
End bp1512203 
Gene Length1425 bp 
Protein Length474 aa 
Translation table12 
GC content44% 
IMG OID640391046 
Productpredicted protein 
Protein accessionXP_001384979 
Protein GI150865666 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0311033 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.112227 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATTTGG TCAACGATGC CATTGACGAA ATTGGTTTCA CTCCATATCA TTTGAAACTC 
TTCTTCCTTA ATGGTATGGG TTACTGGACC GATACTCAAT TGACATACCT TGAAAGTTCA
GTGAGAACCT TTGTCAATTA CCAATTTGGC TACACCTATG CTGTGTCCAA CGAGATGTTG
GCTGCTGGTC TTTTGGTCGG TGCCATTTTC TGGGGGTTTT CTGCTGATTT GATCGGAAGA
AAGATAGCTT TCAACCTTTC GTTGTTACTT TCGGCTGTCT TCACAATCAT CACTGGTACC
ATGGGAACCA TGGCTTCATA CTGTATCTTT GTTTTCTTGC TGTGTTTCGC TGCTGGAGGT
AACTTGGTAC TTGACACTTG TGTTTTCCTT GAATACTTGC CTCACAAACA CCAATGGCTT
TTGACATTTT TCGCCTTTTT CTGGGGTATT GGTCAAACCA TTGCTGTTTT GCTTGCATAC
GCTTTCTTGC CTAACAACTC ATGTTCATCC GCTGACGACT GTCCTTCTCA CAAGAACAGG
GGCTGGAGAT ATGTCTACTA TGTCAATGGA GCCATTGTGC TTGTCATGGC TATTTTGCGT
ATCACTGTCA TCAGATTAAA GGAGACGCCT AAGTTCTTGG TTTCCAATAA CAGAGATGCT
GAAGCAGTAG AAGTTTTGCA ATCGATTGCC CGTAAATACA ACCGTCAATG TTCTTTAACT
CTCGAACAAT TGAATGCTAT TGGAGAAGTC AAGTCCAGTG ACGATTACAG AAAGCACTTA
AACGTCAAGG GCACTTACAC TTTGGTTAAG CACCATCTCA CCATCTTGTT TGCCAACAGA
AAGACTGCCA GACTGACGAT CTTATTGTTT CTCTCTTGGT TCCTTCTTGG GTTTGCTTAT
CCCTTATACT CGTCTTTCTT GCCGGTATAC TTGGCTACGA GAGGTAATAA TATTTCTGCG
CCAGATGTAC ATGGAGTTTA CCGTGACAAT TTGATTAGTA ACGTGTCTTC CATGGGTGGT
CCATTCATTG CTGGAGCTTT GTTGTATTTC TTTCCGGCCT TGGGAAGAAG AGGAGTCTTG
TGTATAGGTG GTCTCGTCAG TATGGCCTTC CTCTTTGGCT ACACCCAGAT CAAGAACAGA
GCCCAAAATG TGGCTCTTTC GTCGACTTCA TTCCTTGCCA TCTATATCTA CTATGCTGTG
TTGTACGCTT ACACTCCGGA AGTGTTGCCC TCAGCAGCAA GAGGTACAGG TAATGCTCTC
AGTATTGCTT GTACTCGTGT AGCCAGTTTG GTTGTGCCAG TCATTGCTTA CTTCTCTGAC
ACTAGTTCTG CAGTTCCGAT CTGGATCTGT GGTGCGTTTG TTGGAGTGAT TGGTTTGATG
GCATTGTTGT TCCCATTCGA ACCAAGTAAG CACAGAGTTG TATAA
 
Protein sequence
MHLVNDAIDE IGFTPYHLKL FFLNGMGYWT DTQLTYLESS VRTFVNYQFG YTYAVSNEML 
AAGLLVGAIF WGFSADLIGR KIAFNLSLLL SAVFTIITGT MGTMASYCIF VFLSCFAAGG
NLVLDTCVFL EYLPHKHQWL LTFFAFFWGI GQTIAVLLAY AFLPNNSCSS ADDCPSHKNR
GWRYVYYVNG AIVLVMAILR ITVIRLKETP KFLVSNNRDA EAVEVLQSIA RKYNRQCSLT
LEQLNAIGEV KSSDDYRKHL NVKGTYTLVK HHLTILFANR KTARSTILLF LSWFLLGFAY
PLYSSFLPVY LATRGNNISA PDVHGVYRDN LISNVSSMGG PFIAGALLYF FPALGRRGVL
CIGGLVSMAF LFGYTQIKNR AQNVALSSTS FLAIYIYYAV LYAYTPEVLP SAARGTGNAL
SIACTRVASL VVPVIAYFSD TSSAVPIWIC GAFVGVIGLM ALLFPFEPSK HRVV