Gene PICST_31606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31606 
Symbol 
ID4838604 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1238795 
End bp1240237 
Gene Length1443 bp 
Protein Length480 aa 
Translation table12 
GC content39% 
IMG OID640389919 
Productpredicted protein 
Protein accessionXP_001384192 
Protein GI126135336 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTCGC TTTCGTTCAA TGTCTCGTAC AATTCCATTA CGAAAAAAGT GACTGTTCCG 
AGATCCAATA CTGTTCAACA GCTCATTGCC GTGAGTTTGG ACAAGTTTTC TATCAATTCA
GGCAAGTATG GGGGCCAATT ATACCACAAC AATAAATTGC TAGAATCGTC TTTATCGCTT
CGTCTCGCAA ATTTGATTAA CAACTCCAAA TTGACGCTTA AAACGACGAA TTTGGCTGCA
TCTGCTCAGC AAATCAATGT AAAATTGATG ATTTCCAGCG ATTCTGAAGG TACAAAACAG
ATTATCAATA AAGTGGACAG TAATGCAACT CTTCTTGAGT TATTGCAGCA ATTTGAAACG
AGCGAAAATA TCCAATTGTT GACGAAACCA AGTCAATTGG GTATTTTGAG TGTGACGTAT
CCTTCTGATA GCTACTCTTC TACCAGATTA GGTTCGCTTG TAGGTAATGT GTCGAATGTA
GTAATCAGAT TCAACTACAC TATGGGTGTA GATACAGCAA AATTGAAACA GCAGGAACAG
CAGGAATCAG TAAAGTTGCA ATTGAAACAA CAGCAAGAAA GGATCGCTAG ACAGAGAGAG
GAAGAAAGAG CCAAGGCACA AAAAGAATTA GAATTACAGA AACAGCAAGA GCAAGATCAG
GCATTGAAGG AAGAGGAGGA AGAGGAAGAG ACTCCGGAAC CTACTGAAAG TATCATAGAA
ACAAAAGAGA AGCCTTCAAT TCCTTCTTCT ACTATTAATG CTGATTCTAT ATTAGAAAAA
GAATCGTACC AATTTCAGAC TCCACAAATT GAGGAAACTC CGCTACTCTT TGTACCCGGC
AACTCCAATT CTGCCTTATA TGAGAATCCA GATGAAGATT ACGAGATGAC GGTATCTCAG
GCAAAGACGT ACCAGCAGCT AATCCAAAAT TCTGGAAAGA AGAGAAAAGC CAAGCAAATC
AATAAACCTG TGAGGCTCTT AATTAGAGTC AAATTTCCAG ATCGGTCTAT TTTACAGATT
AATTTTGTAA ATGATGTCGA CACCATAAAG TTGGGACATT TGGTTAAGAA AATCGATGGC
TTGTTGAAAC CAGAATATAT CAATCATTAT AATATTAAGG CGGGATACCC CCCACAAACG
ATTCCATTGA ACTTTGAAAA CAACAATACG TTTTTGGTAG ATATTCCCGA TTTTCAGAGC
GAGAGAATCG TGTTAATCTG GGAGCTCTCG GACGGTGCAC CTAGTAAGAA TGGACCGTTC
TTGAATGAGC AGCTTATTGA GGATGTTAAG ACATCAACGG ATTTACCTGA AGTAGTTCTT
GAAAGTCATA GGGGAGAATT ACCTGATGAT GCGCATACTA AAACAAGAAC TTCTGGACAA
GGATCTGAGG CCAAATCAGA GAGTAAGGGT AAACTAGTTC CTAAGTGGTT GAAGTTTAAG
TGA
 
Protein sequence
MSSLSFNVSY NSITKKVTVP RSNTVQQLIA VSLDKFSINS GKYGGQLYHN NKLLESSLSL 
RLANLINNSK LTLKTTNLAA SAQQINVKLM ISSDSEGTKQ IINKVDSNAT LLELLQQFET
SENIQLLTKP SQLGILSVTY PSDSYSSTRL GSLVGNVSNV VIRFNYTMGV DTAKLKQQEQ
QESVKLQLKQ QQERIARQRE EERAKAQKEL ELQKQQEQDQ ALKEEEEEEE TPEPTESIIE
TKEKPSIPSS TINADSILEK ESYQFQTPQI EETPLLFVPG NSNSALYENP DEDYEMTVSQ
AKTYQQLIQN SGKKRKAKQI NKPVRLLIRV KFPDRSILQI NFVNDVDTIK LGHLVKKIDG
LLKPEYINHY NIKAGYPPQT IPLNFENNNT FLVDIPDFQS ERIVLIWELS DGAPSKNGPF
LNEQLIEDVK TSTDLPEVVL ESHRGELPDD AHTKTRTSGQ GSEAKSESKG KLVPKWLKFK