Gene PICST_33379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33379 
Symbol 
ID4840688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp462546 
End bp463814 
Gene Length1269 bp 
Protein Length422 aa 
Translation table12 
GC content42% 
IMG OID640392003 
Productpredicted protein 
Protein accessionXP_001386290 
Protein GI150866626 
COG category[T] Signal transduction mechanisms 
COG ID[COG0478] RIO-like serine/threonine protein kinase fused to N-terminal HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00101834 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCTTG ATACTTCTCA TATGAGGTAT TTGACCTCTG ACGATTTCAA AGTTCTTCAA 
GCTGTGGAAA TTGGCTCGAG AAATCACGAG TTGGTTCCTA CCACCTTGAT TCATTCTATA
GGAGGGTTGA AGTCTCCCTC AGGCACCAAC AGGGCCATTG GGGACCTAGC CAAATTGAAA
TTAGTCAACC GGTTAAGGAA CGCCAAATAC GACGGGTTCC GATTGACTTA CTCTGGTTAC
GACTACTTGG CTCTCAAATC CATGCTTAAC AGACAAACTG TGTACTCTGT AGGAACTACT
ATTGGTGTAG GTAAAGAATC GGATATCTAT TCTGTCAGTG ATCCACAAGG AGTTCAGAAG
GTGATGAAGA TTCACCGTTT GGGTAGAACA TCCTTCAAAA CTGTCAAAAA CAACCGTGAC
TACTTGAAAA ATAAGCTGAC TTCCAACTGG ATGTACTTGT CTCGTCTTGC TGCCGAGAAG
GAACACGAAT TTATGGTAGT ATTGTATAAC AACGGGTTCA ATGTTCCCGA GCCGTTTGAT
TCGTCCAGAC ACTGTGTGTT AATGGAGTGG ATCAAGGGAA TTCCTATGAA ACACTTGCGA
AAACATAGAG ACTACAGGAA GTTGTACTCT GAGTTGATGA ACTTCATCGT CAAGTTGGCT
AACCATGGGT TGATCCACTG TGACTTCAAT GAGTTCAACA TAATCATCCG AGACGACTCT
GAAGCTTCCA AGCACGAGTT CGACTTTGTA GTCATCGATT TCCCTCAGTG TGTCTCCATA
GAACATCCTG ACGCTAAGCA GTACTTTGAC AGAGACGTGG AAGGTATACG ATCTTTTTTT
GAAAAGAAGT TTAGATACGC TCCTAGCCAC GATGCTACCA TGTTCGACAC TGAAGGATAC
GGTGATGGTT ACAAGTATGC TTATCCTAAC TTCAAACGTG ATGTTATCCG TGAAAAGAGT
CTAGATGTAG AGGTGAAGGC ATCGGGATAT GCTAAGAAAA CGACTGGGGT CAAGGAGGAC
AAAGACTTGG AAAAGGCAGT TTTGGGAATG AGAATAAATC GATATGAGGA CGAAGATGAC
CTTTCGGAAT TCGATGATGA AGATGTAGAC GGTGAAGACG ATGAAGACGG TGATGATTAC
GAAGAAGAAG AGATTGACAG TGACGATGAC AACCAAGAGG AAGAAAACGA AAGAATCGTA
GAGATGTTGT CTAGTGGAGT CAAGAACCTA AAGATGGACA AGTTGGGAAA TTATATTATA
GAAGAATAA
 
Protein sequence
MKLDTSHMRY LTSDDFKVLQ AVEIGSRNHE LVPTTLIHSI GGLKSPSGTN RAIGDLAKLK 
LVNRLRNAKY DGFRLTYSGY DYLALKSMLN RQTVYSVGTT IGVGKESDIY SVSDPQGVQK
VMKIHRLGRT SFKTVKNNRD YLKNKSTSNW MYLSRLAAEK EHEFMVVLYN NGFNVPEPFD
SSRHCVLMEW IKGIPMKHLR KHRDYRKLYS ELMNFIVKLA NHGLIHCDFN EFNIIIRDDS
EASKHEFDFV VIDFPQCVSI EHPDAKQYFD RDVEGIRSFF EKKFRYAPSH DATMFDTEGY
GDGYKYAYPN FKRDVIREKS LDVEVKASGY AKKTTGVKED KDLEKAVLGM RINRYEDEDD
LSEFDDEDVD GEDDEDGDDY EEEEIDSDDD NQEEENERIV EMLSSGVKNL KMDKLGNYII
EE