Gene PICST_59246 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_59246 
Symbol 
ID4838586 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp301495 
End bp302751 
Gene Length1257 bp 
Protein Length399 aa 
Translation table12 
GC content48% 
IMG OID640389901 
Productpredicted protein 
Protein accessionXP_001384014 
Protein GI126134980 
COG category[S] Function unknown 
COG ID[COG2930] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.518781 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
AGAAAAGCGG CTAAGGTACT CTCGTCGTTT ATAAAGCCTA ACCAATTGGC TGGGCCCGAA 
AATATCATTC CTCCCAATGT ATTGAAAAAT GCTAAAGGTT TGGCTGTCAT CACAGTTTTG
AAAGCCGGCT TTTTGTTTTC AGGGAGAGCC GGTTCCGGTG TCATCGTAGC CCGTTTGCCG
GACGGATCCT GGTCGCCTCC TTCAGCCATT GTTACTGCCG GTGCCGGTGT TGGTGGCCAG
ATTGGAGCTG AGTTGACCGA TTTCGTCTTC ATCTTAAACA CCAAGTCAGC TGTGGACTCG
TTTGCTCAGT TTGGTTCTGT GACTTTGGGT GGTAACGTTT CTGTTGCTGC TGGTCCTTTA
GGTAGAAACG CTGAGGCTGC GGGTACTGCC TCGTTAAAGA GTGCCTCTGC TGTATTCTCA
TATTCCAAGA CCAAGGGTTT GTTTGCTGGT ATATCTTTGG AGGGTTCCGC TATCGTTGAA
AGAAGAGAAG CCAACAGAAA GTTTTACGGT AGCAATTGTA AGGCCAGAAA CATCTTGGCT
GGTGAAGTAG AGGCTCCTCC AGCTTGTTCG TCTTTGATGA GAGTGTTAGA GTCACGTGTG
TTCAACAACA GACAACCCTA CGATGACGAC GATTTCTATA ACGACGATTA CTACGACGAC
ATTCCAGACG ACTTTTCCGG CTCAACCTCG CCATCTTCTA CCAGAAGAGG CAGTACCAGA
ACCGGAGGAC GTGGCGGAAG CCGTAGAGGT AGCAGATACT CGGAAGACGA AGAAGATTAT
TCCGACGACG ACGATGATGA CTACAATTAC TCCAATAGAC GAAGAACTAA CTCGCGTGCT
AGTCCCAGCC ATACGAGCCA TGGTTACAAT GGTTCTGGTT CTGGGTCCGG AGCTGGAGCT
CGTAGATCAG GATGGGAAGA CGATGTCTAT GACAAAAATA GTAGCGATAG AAGAAGAAAC
CAAGGCAGTA GTGATGTAGA CAACTTGGGT TCTAGGCTCA ACAATACCCG TTTGAATAGC
GGCCCTACCA GACCTCCTAC TGGCTCCAAG CCCAACTTTG GTGGAACTCC AAAGACAAAC
ACAAACCAAG CCATAGCCTT GTACACTTTC AAGGGAGAAC AAAGTGGAGA CTTGCCATTC
AAGAAGGGCG ATGTCATTGA CATTATCAGA AAGACCGAAA CCGTAGACGA CTGGTGGACC
GGCAGAAACA ACGGGGTCAC AGGTATCTTC CCTGCCAACT ATGTGGAGTT GATCTAA
 
Protein sequence
RKAAKVLSSF IKPNQLAGPE NIIPPNVLKN AKGLAVITVL KAGFLFSGRA GSGVIVARLP 
DGSWSPPSAI VTAGAGVGGQ IGAELTDFVF ILNTKSAVDS FAQFGSVTLG GNVSVAAGPL
GRNAEAAGTA SLKSASAVFS YSKTKGLFAG ISLEGSAIVE RREANRKFYG SNCKARNILA
GEVEAPPACS SLMRVLESRV FNNRQPYDDD DFYNDDYYDD IPDDFSGSTS PSSTRRGSTR
TGGRGGSRRG SRYSEDEEDY SDDDDDDYNY SNRRRTNSPG ARRSGWEDDV YDKNSSDRRR
NQGSSDVDNL GSRLNNTRLN SGPTRPPTGS KPNFGGTPKT NTNQAIALYT FKGEQSGDLP
FKKGDVIDII RKTETVDDWW TGRNNGVTGI FPANYVELI