Gene PICST_57105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_57105 
Symbol 
ID4838166 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp55626 
End bp56706 
Gene Length1081 bp 
Protein Length339 aa 
Translation table12 
GC content41% 
IMG OID640389481 
Productpredicted protein 
Protein accessionXP_001383289 
Protein GI150864464 
COG category[S] Function unknown 
COG ID[COG3332] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.669873 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCATTT TGTTATCTAC CACCGAACAT CCAGACTATC CGTTCATCCT TTTGTCCAAC 
AGAGATGAGT ACTTCACGAG ACCTACTCAA GCTGCCCATT TTAGATCGTT TGATGGTACC
ATGAAGATTT TATCTCCTCT TGATATGGCT AGACCTGAAC ATGGCACTTG GATAGGAGTA
ACCACATCGG GAAAGGTTGC TGTTCTAGTC AATTATCGTG AGATAGATCA CGCTCGTATG
TATTGATTCT ATTTATGATG GCAAATTTTG CGATATGTGG CACTAACAAT TTGTAGACCT
GCTAAGTGAG GTATCCAGAG GCATATTACC ATTAGATTAT CTTTGCACGA ATAAACTGGC
CGATAAATGG CATAGAACCC TCGAATCGTC GTTGAGCCAC GTTACCAGGG GCAAAGTAGA
GTTGAGTCAG ATCGGCGGGT TTTCACTTGT GTATGGACAA TTGTCCATTG ATCCCAAGAC
AGGCAAGCTT AACCACTTGA ATATACTAAG CAACAGAGGA GACCATGGCA AGATTCATGC
ATCTGCGAAA GATAATTCCA ACGAAGAAAG AGAAGAAAAA GAAGAAGCAG ATGAAGAAGA
AGAAGATGAT GATGAAGAGG ACGACTTGCA CGGTGATATA AGTAACAAGA CCACATTTGG
CTTATCCAAT TCATTGTACT ACGAACCTTG GAAGAAGGTC AAGCTTGGTG AGGAGCTCTT
ACATGAGTTG GTAGAGAAGT CTAAAGAAAT GAAGCTTTCG CAGGAAGCTC TAGTTTCTGA
ATGTTTCAAG TTGCTTAGTC ATAATACTTA CGACAAGGAA GTAGCCAAGC AGAAGGATTT
TTCCAAAAAG ATTACAGAAC TTAGAAACTC GATATACATC CCCCCATTAG AAACCTACAT
TAGCCCCAGT GCCAGATTGC TTACTGCTGG AAAATACTAC GGTACAAGAA CCCAAACCAT
CTTGTTGCTT GACAGGTTTG GGTACCTTAA CTACTATGAG AAAAACATCC ATAACAGCGA
TGACGTCGAC GACGACAACA TTGTCTTCAA CCACTACAGG TTTAATATCG ATCAACAATA
G
 
Protein sequence
MCILLSTTEH PDYPFILLSN RDEYFTRPTQ AAHFRSFDGT MKILSPLDMA RPEHGTWIGV 
TTSGKVAVLV NYREIDHAHS LSEVSRGILP LDYLCTNKSA DKWHRTLESS LSHVTRGKVE
LSQIGGFSLV YGQLSIDPKT GKLNHLNILS NRGDHGKIHA SAKDNSNEER EEKEEADEEE
EDDDEEDDLH GDISNKTTFG LSNSLYYEPW KKVKLGEELL HELVEKSKEM KLSQEALVSE
CFKLLSHNTY DKEVAKQKDF SKKITELRNS IYIPPLETYI SPSARLLTAG KYYGTRTQTI
LLLDRFGYLN YYEKNIHNSD DVDDDNIVFN HYRFNIDQQ