Gene PICST_33787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33787 
Symbol 
ID4840941 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp489732 
End bp491492 
Gene Length1761 bp 
Protein Length586 aa 
Translation table12 
GC content42% 
IMG OID640392256 
Productpredicted protein 
Protein accessionXP_001386684 
Protein GI150866925 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.715519 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.121325 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCCTG ATCCAAATAC AAAATACGAC GATCAGCAAC TGAAAAGGCG CAAACTCGAA 
CAACAATCGT ACCCTGATGT ATCTAGAAAT ATTTCAATCT ACAGCATCTC CAGTAAGCAC
CCTTTGAACG TAAGACCAGC AGGAAACTCA TATTTGACTC TGGAAGATGT TGCATTGAAA
GAAGCAAAAC GGAACCTGTT GGGACATCTC AATATGTTCC CAGAAGAGTT ACTTATGGAA
CTCTTGACAT ACATAGACGA TAAAGAGACC TTGCGGAACC TTTCTCATAC TTCCAGAATC
CTATACGCAT ATCTTTATGA TGAAGAGATC TGGAAGAAGC TCTTTGTGAA AAGCATAGAA
GACTCGACTC AGAATCTGCC TCAAAAGTGG AACGGTTCAT GGAGATGTAC CGTACTTGGA
ATTGATAAGA AGCATCTGGC CAATATCATA CTACCAGATA ACCTTGTCTG CTCCGATATC
TTATACAGAC CTTTCCAATG TTCACAGATC AACTATGAAA AGTTATTCCG TAAAATCATA
CAGGAAGAAG AAACCTACCA TCTCGATGCC TTGTCAGATA ACCTCAAGCA ATTACCACCT
GGCCGAATTC AGAGAATACC AGAATCAGAA TTGTCTCTCG AGCAATTCAA TACAGAATAT
CATGATGTGC CCTTCATATT AACCAATAAA GACAAGACCA GGTGGCCACG CTGGGATTTT
CCAACTTTGT TAAGTCGGTT TCCGAATGTA AAATTCCGTC AAGAGGCCGT TCAGTGGGAT
TTGGCACTTT ATTCTGAGTA TTTGAAGTCT AACCTTGATG AAAACCCATT GTACTTATTC
GATTGTAGCA GTGAAGCTAT GACTACTTTA CGTAAGGAGT ATGACTCTCC TCTGATATTC
AAAGAAGACT TGTTTACTCT TTTTAACTTG AATAATGGAC AACTGAACTG CCGTCCAGAC
CATGCTTGGT TGATAGTAGG ACCAGAAAGA TCTGGTTCTA CCTTCCACAA GGATCCCAAT
TATACATCTG CATGGAATGC AGCTTTGAAG GGCAGAAAGC TTTGGGTGAT GTTACCTCCT
GGAATCACTC CACCTGGTGT AGGCACTGAT GAAGAAGAAA GCGAAGTGAC TTCACCTGTA
GGAATTGCTG AATGGGTTAT CTCAGGTTTC TTTAACGATT CGTTGAAGAT CAAGGAATGC
TTAGTGGGAA TCACATTCCC AGGTGAATGT ATGTACGTTC CATCAGGTTG GTGGCATTCG
GTTATAAACT TGGACGACTC GGTTGCGTTG ACTCAGAACT TTGTACCGTT TTCCAAATTG
ACCAATGCCA TGAACTTCTT GAAAAATAGA AGGGACCAAA TCAGTGGGTT CCGCCCCTAT
CCAGTCAAAG AATCAATTGA CTATGCGGTA GAGACGCTTC TTAAAGGAAA GAATAACGAG
GATATAGAGA AGATGAGGGA GTACAGTGAA AAATTCAATT CCTTGAACTT GGGAGAGAAG
TTAATTAATG AAGACTGCGG TGAAATCAGT GAACTACCAC CCATGCCTGT TTACGAGCTT
TTCAAGCAGT TGTTGATACT TAACGGAAAA GAAGATGAGT TGGCTACAGC TTTGGAAGAG
TTGAAGAAGC TAGAATCGAG AAACAGAGCA AAAACTTCAG GTAGGAGTGA AGCATGGGAG
AAATTGACTA CTCCGGCACT TGAAGAGCAA CAGGGATTCA GTTTCGGGTT CAACCTCGAT
GAAAGCAGCG ATGAGGAATG A
 
Protein sequence
MSPDPNTKYD DQQSKRRKLE QQSYPDVSRN ISIYSISSKH PLNVRPAGNS YLTSEDVALK 
EAKRNSLGHL NMFPEELLME LLTYIDDKET LRNLSHTSRI LYAYLYDEEI WKKLFVKSIE
DSTQNSPQKW NGSWRCTVLG IDKKHSANII LPDNLVCSDI LYRPFQCSQI NYEKLFRKII
QEEETYHLDA LSDNLKQLPP GRIQRIPESE LSLEQFNTEY HDVPFILTNK DKTRWPRWDF
PTLLSRFPNV KFRQEAVQWD LALYSEYLKS NLDENPLYLF DCSSEAMTTL RKEYDSPSIF
KEDLFTLFNL NNGQSNCRPD HAWLIVGPER SGSTFHKDPN YTSAWNAALK GRKLWVMLPP
GITPPGVGTD EEESEVTSPV GIAEWVISGF FNDSLKIKEC LVGITFPGEC MYVPSGWWHS
VINLDDSVAL TQNFVPFSKL TNAMNFLKNR RDQISGFRPY PVKESIDYAV ETLLKGKNNE
DIEKMREYSE KFNSLNLGEK LINEDCGEIS ELPPMPVYEL FKQLLILNGK EDELATALEE
LKKLESRNRA KTSGRSEAWE KLTTPALEEQ QGFSFGFNLD ESSDEE