Gene PICST_39662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_39662 
Symbol 
ID4852050 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp3505939 
End bp3508131 
Gene Length2193 bp 
Protein Length592 aa 
Translation table 
GC content39% 
IMG OID640393758 
Productpredicted protein 
Protein accessionXP_001387003 
Protein GI126276440 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACACTA GCACTGAAAG GAGTTTCAAG CGAGTAAAAA CTGGCTGTTT GAAGTGTCGA 
AAAAGACATA AGAAGTGTGA TGAAGTTAGA CCTAATTGTC TGTCGTGTAC AAAGAAAAAA
GAAGTCTGTG AATGGCCAGT AAGCTATGGA AAGTTCCACA AGAATTCAAC GTTCCAATTA
CCAGCCAACA AAGCTGTTCA CAAAACAGCA ACTACAAAGG AACTGAAAAA GTATTCCGAA
CATCTTGCTA ACAATTTCAA TCGGATGTCT CATTTAGAAG CAGAAGGAAA CTTATCTAGT
GAAGCAGAGA CGTTGCTCTC GATGCTGGTA CCTAGATTAA ATAAGTCCAA AAGCAATGGT
GATATTCAAA TTGAAAAGTC CATGTCAATA AGTAAGATTT CAACCCCTCC TCAAACTTCA
AGCTTTTCAC AATTACAAAT TCTGCCAAAG TCTACTAGTA ATCCTCTGGA ATTCAATGCT
TTATCGTTGA ATAGAATATT GAATACTGAC CCAGATTCTG AGCAATTAGA TGGATCAATA
ATACCAATAG ACCCCAACAT GGATGGATTG ATAGATAGTC GCAATGGCTC CACAACCATA
TCCAATACTG ATAAAAGTGA ACCAAGCATA TTTGAAAATA TTTCCCCGAG TTCAGCTTCA
TATGATGACA CATACTCATT TAGTCCAGAG TCAATAATGG AACCGTTCTT GCTTTACAAC
GAGTTGCACA ATACTTTGAG AGAGTATATG TTTTCAAATG TTTCTTCTAC AGCAGAAGTT
TCAGAGAAAA TAAATGGCTC CGATAACATC ATTTTTTGTC CCACTACTTC AAATAATATC
GATAAAGTTG GCGAGCAAAT TGAAGGAGAT TCTTCACTGT CAAGGGCCAA TAACAAACAA
GTTGAGGATG ATTTAGATGC AACCAATGAG CGAGAACCCA TTCCAAATAA ATCTAATGAA
TTATTTAATG TACTTATCCA CGAGCCCCCC GCGTTGACAG AAATGGAAAA GTTGTTCCTC
TACAAGAACT ACTTATATGA AGTTGCCCCA TGGTTGGATA TGTTTGATTA TTCACAGCAA
TTTGGCATCA CTATCCCCCA GCAAGCAAAT CTGAATGCAG CTCTCATGTT TGCCATTTAT
GCCGTTTCTT CAAGACAGAT CGAGCTTACC AATCCAGATT ATGACAAAGA TAAAACAATC
AAGATTTATC AAGAATCTCT CAAATATTTG ATTCCTACAG TAGAGCAAAC AATGGATAGA
GCTATCATTT CTTCTTGTGT TATTCTTTGT GTTCTTGAGA TGATGTCATC GTCTCCAAAA
GAATGGCGAC ACCATTTGGA AGGTTGTTCT GCACTATTTA GGACCAATAA CATTCATGGG
TTCAGTGACG ATTTGGAAAG AGGACTTTTT TGGTGCTATG CTAGAATGGA TGTAAGTTCA
GCAGTTATTG GAGAGCAGTC GACTGTACTT CCCACAGAGT ACTGGTTACC TAAAGAATTC
AAAATTAAAG ATCTGAAAGA GTACTTTCAA AAAGAAAATA AACCTGATAT GTATGCAAAT
TATATTGTCT TTCTTTGTTC CAGGGTTTTA AATTTGATCT CAAATGAGAC ACCTAATTTT
AGAGCTGAAT GGGAAAGCTT ATTTAGCGAG GTGGTTGCTT GGCACGTGAA CCGACCTCCT
GAATTGCAAC CGTTCATGGA GTTTGAGCAC TTTCCCTTCC CAGGGCTTCT CTTTTTGAAC
GGGCCCGCCA TTTCATCTAA CCAATTATAT CACATGGCGA TAATACTCTT ATCGCAGAAC
AAGCCAAGAC TTCTCAAAGT TAAGCCATCG AGGAGTGTAG TAAGTATAGC ATTAATGTAG
CCGATTTCAA TCTCTCTACT AACGCAGACT CTAGAAATCC AACATCTGGC ATGCCAAGCA
GATTTGTGCC ATCAGTTTGC ACAATACCCA CCAGTACGTA TAAATTCCTT TTGTCTAACT
ACAATTGAAC TACAGGATTG CTAACATTTT TTCAGTGGAT GCTGGAACAA CGCCCTACAG
CCACTATGGA TCGCCGGGAA GCTCCTCAGT AGCGAAGAAG AGCACACCAT CATACTCAAC
CTCCTCGACA AAATAGAATC TACAACGGGT TGGCAAATGA ATTTCAGGGC GCGGGACTTA
AAGAGGTTCT GGGATGGAAA GTTGGTTGAG TAA
 
Protein sequence
MNTSTERSFK RVKTGCLKCR KRHKKCDEVR PNCLSCTKKK EVCEWPVSYG KFHKNSTFQL 
PANKAVHKTA TTKELKKYSE HLANNFNRMS HLEAEGNLSS EAETLLSMLV PRLNKSKSNA
SYDDTYSFSP ESIMEPFLLY NELHNTLREY MFSNVSSTAE VSEKINGSDN IIFCPTTSNN
IDKVGEQIEG DSSLSRANNK QVEDDLDATN EREPIPNKSN ELFNVLIHEP PALTEMEKLF
LYKNYLYEVA PWLDMFDYSQ QFGITIPQQA NLNAALMFAI YAVSSRQIEL TNPDYDKDKT
IKIYQESLKY LIPTVEQTMD RAIISSCVIL CVLEMMSSSP KEWRHHLEGC SALFRTNNIH
GFSDDLERGL FWCYARMDVS SAVIGEQSTV LPTEYWLPKE FKIKDLKEYF QKENKPDMYA
NYIVFLCSRV LNLISNETPN FRAEWESLFS EVVAWHVNRP PELQPFMEFE HFPFPGLLFL
NGPAISSNQL YHMAIILLSQ NKPRLLKVKP SRSVKSNIWH AKQICAISLH NTHHGCWNNA
LQPLWIAGKL LSSEEEHTII LNLLDKIEST TGWQMNFRAR DLKRFWDGKL VE