Gene PICST_16361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_16361 
Symbol 
ID4837805 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009043 
Strand
Start bp468409 
End bp469707 
Gene Length1299 bp 
Protein Length433 aa 
Translation table12 
GC content41% 
IMG OID640389120 
Productpredicted protein 
Protein accessionXP_001383723 
Protein GI150864755 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.898174 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.978716 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATCAGTACTA TAGATATTGT GGGAAACAAG TTCTTCTACA ATGATACAGG AGAACAATTC 
TTCCTCAAGG GCATTGCCTA TCAAAGATCC AGAAGAGAAG GGGAGGACTT TGATAGAAGT
AAAGAAGTTC CCTACATTGA CTCTTTAGCC AATATTACTA CTTGTTTAAG GGATTTGCCT
CTTTTAGTTG AACTTGGCGT CAACGTTATC AGAGTTTACC AGATTGATCC AAATGCTGAC
CACGACATTT GTATGAATGC TTTGGCAAGT AAGGGAATCT ACGTCTTGGC TGATATGGCA
GAATCTGAAA TGTCTATTGT CAGAAACAAC CCAAATTGGG ATACTACTCT CTATACACGT
TATACTTCTG TTGTCGATGC TTTGCACAAA TACAACAACT TGTTGGGATT GATTGCTGGT
AACGAAGTCA CCAACACAAA GTACAATACC GATGCTTCTC CCTTTGTGAG AGCTGCAATC
AGGGATGTCA GGAAGTACAT TAAGAACAGC GGCTACAGAA AGATTCCTCT TGGTTATGCT
TCTAACGACG ATCCCGAGAT TAGAAAACTG CTTTCCAATT ACTTTGTCTG TGAAGACCCA
GCTACGAGAT GTAGCACTGC TGACTTTTTT GGGCTTAACA CTTATGAATG GTGTGGCTAC
TCCACTTACG CCACTTCTGG CTATCGTGAT CTCACCATTG AATATTCCAA GTTTCCAGTT
CCAATTTTCT TTAGTGAGTT CGGCTGTAAT GCTGTGTCTC CGCGTCCCTT TACAGAAGTT
GAAGCTCTCT ACGGAAGTAC AATGGCAAAG GTATGGTCTG GAGGAATTGT TTATGAATAC
TTTCAAGAAG TTAACAACTA CGGTGTGGTC AAGGAAAATG ATGATGGTTC AGTTGCCAAA
TTGGAAGACT TTGAGTTTCT TAAGCAGAGA TATAATAGTG TGATTCCTCA AGGCATAACC
GCTGCTGAAG AATCGGCAAG AGAAATTCCA AAACTCAATT GTCTGGGACA AAATGACATT
TGGAAAGCAA ACACCAGTCT TCCTCTAATA CCCAGTAGTG GCAAGTGTGA TTGTATCTAT
GAACTGTTCA AGTGTACCGC TAATCCTGCC AATCTTGACA ACGACTTGTT GAAGGAGATA
TGTTCCAATA TTGACTGCTC TGAAATTAGT GCCAATGGAA CTACAGGCGT CTATGGTCCT
TATTCTGATT GCCTGCTCCA ACAGAAACTT TCCTATGTTT TGAACAAACA ATACATTGAA
GGAGGAAACA AGACAGAATT GTGCGATTAT GATGGAAAA
 
Protein sequence
ISTIDIVGNK FFYNDTGEQF FLKGIAYQRS RREGEDFDRS KEVPYIDSLA NITTCLRDLP 
LLVELGVNVI RVYQIDPNAD HDICMNALAS KGIYVLADMA ESEMSIVRNN PNWDTTLYTR
YTSVVDALHK YNNLLGLIAG NEVTNTKYNT DASPFVRAAI RDVRKYIKNS GYRKIPLGYA
SNDDPEIRKS LSNYFVCEDP ATRCSTADFF GLNTYEWCGY STYATSGYRD LTIEYSKFPV
PIFFSEFGCN AVSPRPFTEV EALYGSTMAK VWSGGIVYEY FQEVNNYGVV KENDDGSVAK
LEDFEFLKQR YNSVIPQGIT AAEESAREIP KLNCSGQNDI WKANTSLPLI PSSGKCDCIY
ESFKCTANPA NLDNDLLKEI CSNIDCSEIS ANGTTGVYGP YSDCSLQQKL SYVLNKQYIE
GGNKTELCDY DGK