Gene PICST_31223 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31223 
Symbol 
ID4838916 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp246344 
End bp247468 
Gene Length1125 bp 
Protein Length374 aa 
Translation table12 
GC content48% 
IMG OID640390231 
Productpredicted protein 
Protein accessionXP_001384009 
Protein GI150864975 
COG category[T] Signal transduction mechanisms 
COG ID[COG0631] Serine/threonine protein phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.611184 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGCCT CCTCCCTCGC ATCTCGTGCC GTTGCTGCAA AAAGATTCAA GGGCTCATCC 
TCCCTTTTCC AAGGAGCCAG ATCGTACTTC CTTTCCTTCG GCAAGGAGCG AAACCAGGAA
ATGCTCGTCG ACGACATTAA CCCCTTCATC CGGTTCCCTG TCGTCACACC AGAACAGATG
AGGAAGATCT CCAACGACAA GTTCAAGTTC GACTTTGCGT ATACGTCGTT TGCATTTCAC
AGCTCGAGCT CAGTGCCTCA GATCCACTCG TTGTCGGACT TGACCGATCC CACGCTGTTG
AACTCGTTGC TTCCCAAAAG AAGACCCCAG GGTTCGCCTG CGGATACGCT TTCCATCAAA
GCTGGTGACG ATACCATGTT GGTGTCGCCG ACTGTACTTG CTGTGGCCGA TGGAGTCTCT
GGATGGGAGG ACAAGTCCGA TGCGGACGCC GGCATCTGGT CCCGTTCGAT GTTGGAGACG
TTCTCGAGAC TTATGACCGA ATACAAGATA AGCCATTCTC CTCATCACTT GAACAAGAGA
GACATCTCCG AGATTCTCGA CGACTCCTTC TTGCACACGT CACATTTGAT GGATTTGCAA
AGGTTAGAAG GCTCTTCAAC CTTGATACTA GGCATGCTAA GTGGAGACTT GCTTCAGATG
GTCAGCATTG GAGACTCCAA ACTCTACATC ATCAGAGACG GCGAGATCAT CAAAACAAAC
GAAGAGCAAA TGGTAACAGA TTTGTGTCCC CAGCAGATCG GAACTCACAC CTTGACTCAG
TTGCCATCGG AAGTTGCCTG GGTCGAATCG ATTGAATTGA AAGAAAACGA TCTTATCGTC
GTGTGTTCAG ACGGTATTTC CGACAATCTT TATGAATGGG AGATTGTGGA TTATCTCGAC
CAGTTCTTGA ACGGCAAAAA AGACAGCTTG AAGCGTGCAG TGAACAAACT TCTCTTGAAA
GCCAAGGAAG TGAGTTTTGA TGACTACGCT TGCACTCCTT ACAACCAGAA GGTGAATTCT
ATGTCTGGTA AACACGGCAA GCAGAACAGC GTAGGCGGAA AACTCGACGA TATGTCGCTC
TGTATTGCCA GAGTGGTGCT CAATAAAAGG GGAAAGTTAA ATTAA
 
Protein sequence
MFASSLASRA VAAKRFKGSS SLFQGARSYF LSFGKERNQE MLVDDINPFI RFPVVTPEQM 
RKISNDKFKF DFAYTSFAFH SSSSVPQIHS LSDLTDPTSL NSLLPKRRPQ GSPADTLSIK
AGDDTMLVSP TVLAVADGVS GWEDKSDADA GIWSRSMLET FSRLMTEYKI SHSPHHLNKR
DISEILDDSF LHTSHLMDLQ RLEGSSTLIL GMLSGDLLQM VSIGDSKLYI IRDGEIIKTN
EEQMVTDLCP QQIGTHTLTQ LPSEVAWVES IELKENDLIV VCSDGISDNL YEWEIVDYLD
QFLNGKKDSL KRAVNKLLLK AKEVSFDDYA CTPYNQKVNS MSGKHGKQNS VGGKLDDMSL
CIARVVLNKR GKLN