Gene PICST_29495 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_29495 
Symbol 
ID4836850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp406979 
End bp408199 
Gene Length1221 bp 
Protein Length406 aa 
Translation table12 
GC content42% 
IMG OID640388165 
Productpredicted protein 
Protein accessionXP_001382839 
Protein GI126132628 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCCGA GACTCTGTCG CTTTTCTATA GGGCTCACGG GACGCTCTAT CGTATTAAAT 
AGATCTACGT CTGTTTTTCC CAAACACTTT CATTCCAATC CGGTGGTTTT ATCGTTTACC
TCCAGTTCTG GCTTTGAATA TGTGGAAAGC TACAAGAGCT TGAACGATAA GATCGAAGCT
GTTTTTGGGC AGAAGGACGT TTCTGACGAC GACATAATAG CAGCTCTTGT TGCCTGTCGT
AATCTAGAGC GAAACTATCC AGTCAATCAA CAGTTGCACA CCAATTCCAG ACTCATCCAA
GAAGCTTCCC ATTCTATAGA GCTAATCTTC AAGAACGACA CCAAGTTCTC AGCTGAATTG
TTGAAGAAGA TCTTTCTCTT GAAGTTGGCT ACTCCGTTGA ACTTGAAGAT TATCAACACT
TTCTACGAAA AGAATCCAGG AGCCAATACC ATTATCGATA AGAGTACTGC ACTTGTAGCT
TTGAGAAATG CCTTGGCCAA TGCTGACTTC CTCAATGCCA TCAAGTTGAC CGATGTGACT
GTAGGCCATC CCAATTATAT CGAGCACAAC AATAGGATCC TCAGGAAGGG TTTTTCACAG
TTGGTGGGTA CATCTTTGGT AATAACGTTC TTGACCAAGT ATGGAGTTAA TGAAATCATC
GATATGGGGG CTTTGAACGA AGGCTGGAAA CATTTGGGAG CCATTAACTC GTTGATTTTA
ACCTATTTGT TCAACTCCAG TTTCTTCTTG ACAATTGTCA GAGTCGGGCG ACAGTTAATC
AGCTCCGGTG GTGACTATTT GACCTGGCAA AAAGGAACAT TCTATACCCA TTGGTTCAAA
CATGCTGATG AGATGTTATT TTCAGCCAAA ATTGTAGAAG CTGATCGTCA GTTGAATGGT
GGAGAGTCAA ACCCTGAGAT CATCAACGAG TTATGTAGAA CCAGTGACGA TATGTTCAAT
ACCCAACGTA CATTACAGCC CGGATATAAT CGTGAAGGTG AAAAGATCAG ATTGTTGGAA
GCCAAAGACA ATATGGAAGA CCTCAAAATG CAAGCATATT GGATGAGTGG AGGTGATGGC
TTCGAATGGG TTGAACCAGA TCAGGATCCT GCCGATTTGA TCTGGAAACA ACATCTCGAT
AGTTTTAATA AACCTACTCT AGACAATAAT AGCAAGGCTA AGAACTTGAA ATGGGCTGAA
GAGTTGATTG GGGACAAGTA G
 
Protein sequence
MLPRLCRFSI GLTGRSIVLN RSTSVFPKHF HSNPVVLSFT SSSGFEYVES YKSLNDKIEA 
VFGQKDVSDD DIIAALVACR NLERNYPVNQ QLHTNSRLIQ EASHSIELIF KNDTKFSAEL
LKKIFLLKLA TPLNLKIINT FYEKNPGANT IIDKSTALVA LRNALANADF LNAIKLTDVT
VGHPNYIEHN NRILRKGFSQ LVGTSLVITF LTKYGVNEII DMGALNEGWK HLGAINSLIL
TYLFNSSFFL TIVRVGRQLI SSGGDYLTWQ KGTFYTHWFK HADEMLFSAK IVEADRQLNG
GESNPEIINE LCRTSDDMFN TQRTLQPGYN REGEKIRLLE AKDNMEDLKM QAYWMSGGDG
FEWVEPDQDP ADLIWKQHLD SFNKPTLDNN SKAKNLKWAE ELIGDK