Gene PICST_32703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32703 
Symbol 
ID4840095 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp447250 
End bp449281 
Gene Length2032 bp 
Protein Length636 aa 
Translation table12 
GC content45% 
IMG OID640391410 
Productpredicted protein 
Protein accessionXP_001385771 
Protein GI150866242 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACG AAGTAAGTAT ACTGTGCCAA TGACAGGAAA AAGACTCGCA CCAACCGTTT 
CATTACGTCA CTGTACTGAT GCGATTAAGC GACTCACCTT AATATCTATG GCGGTACTAA
CGACACCCTA TAGCAACAGA AGACTACTTT TGATTTAGAG CCCAATCCTT TTGAACGCTC
GTTCGCCTCC AAGGACTCGC TGGTACTGAA CGCTTCACTG GTTCTGGAAC ATCATAACAG
AGACTCCGAG AACTCATCAG CCACTTCGTC TACCAAGTCC TCTAACAAAC ACAACCTCCA
CATCCCCAAT CTTTCCACAC TCAACGGTGC CAACAACAAC ATTAATAACA ACATCAACAT
CAACAGCAAC AATAATAGTA TCAATAATAG TAGTAATAGT ACCAGTAACA AACTTCCGGG
AATCACTCCT CCCCTTTTTA CACCCGGTGG AAGAAGATTA CATCCTTTGG GACTTTCTCC
TCCTGTCCCC GGATCTAATG GTGCTGCTGT CACAGCTAAC GGAACGGCCT TGCTGAATCC
TGGCACTCCA GGTTCTAACT TATGGAACAG TTTGTTGAGT GCTACCAACA ACCACAATAA
TAATGGTTCC AACGTCAATA CTGCCAATGT TGCTACTAAC GGTGCCAATG CCAATGTGAT
AGCTGCAGGC AATGGTCCCA ATTCTCAAGC CAATTTCAAC CAGTTTGTGA ATACTCTCAG
AAAGACTGGA TTGACTCCTA ACGAGTCGAA TTTGCGTTCT GGCTTGACGC CTGGAATTCT
CTCCCATCAG TTTTCATTTG GAGCACAGGT TCCGGGCTTG ACTACTCCTA GCGCCTTGCT
TAATAGTCCT ATGACTCCTG GTTTGTCTTC CTTGTTGGGC TTGACGTCTA ACAATTCTGC
CAATAACGTC GCTACCATTA ACTCGTTACA GCCAACACAT CAACAGACAT ACGATACCCT
TCCCCTGATC CCGCAAGAAC CTTCTGAAAG TTTGCCTACT TCAGAACCCC TTAGACAGCC
AATGGCTGCT CCAGTTGTGA AACAGGAAAT TAAGAAACAA GAAACGAGTA AACGTAGTCA
AAGCAAGAAG AGAAAGGCAG ATACCGCAGA TTCTAGTAAG GGAAAGAGGC AAAAGGCAGA
TTCCGCTGCA GCTAAGAAGG CTGCCGCCAC CAGAGCCAAC CTGGAATCCG ATTCGGACAA
AGAGTCCTCT CCACCTAGAA ACTCGAACAA TCCCAAATCT GAAGACGAGA AAAGAAAGAG
CTTTCTCGAG AGAAACAGAG TGGCTGCGTC TAAGTGTAGA TTACGTAAAA AGCAATTGGT
TCAGAAGATG GAGGACGAAT TGGCGTTTTA TTCTACAGGC TACAGAGAGT TGTCTGCTGA
AGTCAACCAA TTGCGCGATC TGTTGATTAC ACTCAAAAGT ATTATAGAAA ATCACAAGGG
CTGCGCTTTG TTGGCACAGA ATGTCGGAGG TTTTGATCAG ATAGAGAGAA TAATACAACA
AGCCAACTAC ATCGCTGAGA TGAGTAACAA CAGTCTGAAC GATGTTACCT CTATCCCACT
GACTATTCCA ACAACTCTTC ACAGCACAAA TTCCGTCAGT GCCATTCCTG CTCGTGGAAA
TGATTCTCAG TTTCAGGCGA TGTCCAACAC CTTAGTTACT AAGACCATCG GCACTCCTAA
CAGCAACGAT GTTCAAGCCA ACTCTAGCAC GAATACCACT ACTGTGGTGA CACCTGAATT
GGCAGGAGCT TATTCTCATG CCACTATCAA CCATGCTCAC GGCATGTCAG ACATGCCACA
ATCTAATCCT GATGGTCCTG TTGCTATGAA TGGAGGCAAT GGTGAGTTGA GGGCCATAAA
CAGCATGTCA AACTTATCCG CTTTGAACAC AGGTGCTCAA GCTCAAATGC AGCAACTTCC
ACATCCTCAA CAGGCCTTGC AGAACTATAG TCTCCGTCCT GTTAGCAGCA TGGTTGAGTT
ACAACAAGCC ATGCATGCAC ATGGCAATCT TGGAAGCGAG TTGAACGTAT AG
 
Protein sequence
MTNEQQKTTF DLEPNPFERS FASKDSSVSN ASSVSEHHNR DSENSSATSS TKSSNKHNLH 
IPNLSTLNGA NNNINNNINI NSNNNSINNS SNSTSNKLPG ITPPLFTPGG RRLHPLGLSP
PVPGSNGAAV TANGTALSNP GTPGSNLWNS LLSATNNHNN NGSNVNTANV ATNGANANVI
AAGNGPNSQA NFNQFVNTLR KTGLTPNESN LRSGLTPGIL SHQFSFGAQV PGLTTPSALL
NSPMTPGLSS LLGLTSNNSA NNVATINSLQ PTHQQTYDTL PSIPQEPSES LPTSEPLRQP
MAAPVVKQEI KKQETSKRSQ SKKRKADTAD SSKGKRQKAD SAAAKKAAAT RANSESDSDK
ESSPPRNSNN PKSEDEKRKS FLERNRVAAS KCRLRKKQLV QKMEDELAFY STGYRELSAE
VNQLRDSLIT LKSIIENHKG CALLAQNVGG FDQIERIIQQ ANYIAEMSNN SSNDVTSIPS
TIPTTLHSTN SVSAIPARGN DSQFQAMSNT LVTKTIGTPN SNDVQANSST NTTTVVTPEL
AGAYSHATIN HAHGMSDMPQ SNPDGPVAMN GGNGELRAIN SMSNLSALNT GAQAQMQQLP
HPQQALQNYS LRPVSSMVEL QQAMHAHGNL GSELNV