Gene PICST_47685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_47685 
Symbol 
ID4840283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp671593 
End bp673158 
Gene Length1566 bp 
Protein Length521 aa 
Translation table12 
GC content40% 
IMG OID640391598 
Productpredicted protein 
Protein accessionXP_001385832 
Protein GI150866286 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.773754 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACC AACTCACCTT GGAAGAACAG CTGGAGGGCC TCCCTCTTTT CCAGTTGGTA 
ATTATTGGAT GTCTCCAAAT TGGTCAACTG ATAGCTTTTT CTTCCATGTA CCCCTATATT
TACTCCATGG TTAAGTATTT TGATATTGCT GACAACGATT CACAAATCGC CACATACAGT
GGATATTTGG CAGCTGCTTT TTCACTCGGC GAGTACATGA GCTTGCTGTA TTGGTCCAGT
GCCTCAAATA TATATGGCAG AAAAACCATA CTCTTGTGTG GCTGTGCTGG TACAGCATTT
TCGATAGTTC TCTACGGCTT TAGCACGAAC TTTTACATGG CGCTTTTTGC GAGATTGTTG
ATGGGAATAT GTAGTGGTAA TACTGAAGTC TTGAGAATTA CAATAGATGA AATTGCCCCC
GAAGATAGAC ACAAAGGTTT CGCTTTCGGC AATATCTTTC TGATCTCAAA CAAGTATAAA
TTCATTGGGT ACTCCTTGGG AGTTCTCGGT GAATCTAGCG TATCTAAGTC TGCTTCTAAG
AGCCGGAGAG AAGATGGATT TTCAATTCCA AGCTATCCAT TTCTTCTTCC CAGCCTCATA
GCAGGAAGCT TTGTTGTATT TTTCATCAAC ATCGGTTGGC TCTTTTTGGA AGAAACACAT
GAGCGAATAA AGTATGAGCG TGACATTGGC ATAAATGTTG GAGATTCTAT TAGGCGTCTA
TTGAGAATAC GAGTACCGGA AAGGCCATGG AATCTGAGAG AACAATACCT AAAAGTTGAC
CATCAACTAT TGGAGGGAAA AATTGATTCT TCAGAATTGC CCTACTACCC CACCAAAAGC
AGATCTCTTT CTGTAGAAGT TGCAGATTTT GAAGAACCCT CGCAAAGCGA AACAGGCACA
GAAACAAGCG ATCCTATAGC ATTACCTGCC GTAAGAAATC GCATGATTAA CAATTTTATG
TTCTGCTTTC ACGGTGTATT CTACTTCGAG TTTCTCCCAA TTTTACTTGC CACTAAACTT
AGAATAGAGG ACATGAAGTT CCCATTTCAT GTTAGAGGAG GGTTCGGTTA CAGTTCAATA
GGAATTGGAA TTCTTGTAAA TAGTTCGGCA GGTATTGGGT CATGTGTTGC TATGTGGCTT
ATTGTTTTTG TTAAATATTG TGGTATAAAG CCTGTGTCTC TTGGCTTGAT CGTATACCCC
ATTGTTTACT TTTTATTGCC CTTACTTCTT TTCACACTGC ACCAGTACAA TAATGGAATA
CCAGAATACG TACCGGTATT ATTGCTTTTC ATTATAATAC TTGTTGATTT GTCAGCTGAC
TTTCTTACTA TTTCCCGATT CCAAATTTTC TTCGACACTA CGTCTTCCAA GGAGGAGAAA
CAGCTAATTA GTAGATATTC AATCAGAGTC ATCAGCTTAG CAAAGTGTTT AGCCCCAATT
ATTGGAGGTT GGATGATATC GAAGTCCGAG ACACACGGTT ACAGCGAATT GCCTTGGTGG
GCTCTTTCAG TTTGGTCAAC GATAACACTA TTACATTCTA ATTACATCGA TAAGAGTGCC
TGGTGA
 
Protein sequence
MTNQLTLEEQ SEGLPLFQLV IIGCLQIGQS IAFSSMYPYI YSMVKYFDIA DNDSQIATYS 
GYLAAAFSLG EYMSLSYWSS ASNIYGRKTI LLCGCAGTAF SIVLYGFSTN FYMALFARLL
MGICSGNTEV LRITIDEIAP EDRHKGFAFG NIFSISNKYK FIGYSLGVLG ESSVSKSASK
SRREDGFSIP SYPFLLPSLI AGSFVVFFIN IGWLFLEETH ERIKYERDIG INVGDSIRRL
LRIRVPERPW NSREQYLKVD HQLLEGKIDS SELPYYPTKS RSLSVEVADF EEPSQSETGT
ETSDPIALPA VRNRMINNFM FCFHGVFYFE FLPILLATKL RIEDMKFPFH VRGGFGYSSI
GIGILVNSSA GIGSCVAMWL IVFVKYCGIK PVSLGLIVYP IVYFLLPLLL FTSHQYNNGI
PEYVPVLLLF IIILVDLSAD FLTISRFQIF FDTTSSKEEK QLISRYSIRV ISLAKCLAPI
IGGWMISKSE THGYSELPWW ALSVWSTITL LHSNYIDKSA W