Gene PICST_44442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_44442 
Symbol 
ID4838729 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1235325 
End bp1236737 
Gene Length1413 bp 
Protein Length443 aa 
Translation table12 
GC content39% 
IMG OID640390044 
Productpredicted protein 
Protein accessionXP_001384532 
Protein GI150865352 
COG category[R] General function prediction only 
COG ID[COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily)  
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGAAG CTGTTCGAAA GGCACGCTTC AAGAATACTG TCATCGAGTA CGATCTTTTT 
CGGGCCATTC TTCTGGACGA CATCTACATA GAAGAGCCTA TAGAGGAAAA GAATGCAGAA
GAAAAAGGAC GACAGCCAGA AAATGAAGTG CAAGATGAGG AACCACATAC CAGGAACGTT
CCGCTTTATG CTGAACTTCT TGATATTGAA ATCGACGAAG ATGTCTCCAT TCATGAGCTC
TATCTAGTTA ATAATACAGA CCCATCTGTG ACGGAAACTC ATCTCGTAAT TATCCATGGG
TATATGGCAG CCTTGGGGTA TTTTGTCAAG AATGTCGAAC CGTTGCTCAA ATCTCAACCA
GGGCTTCGTC TACACGTCAT CGACTTGCCT GGCTTTGGAA ATTCCTCTAG ACCTACTTTC
CCTAAAGAGT TCTTGACAGA GCCAAAAACT AAAGCAGAAA AGATTAAACA GATTGTAGAT
ATCGAAAACT GGTTTATTGA TAAACTAGAG TGCTGGAGAA AGAAACGTGG AATCAAAAAA
TTCAAACTAG TGGGCCATTC CATGGGTGCG TACTTGTCCT GCTGCTACTT GATGAAATAT
AATAAGGCAC TGGATAAAGT AGTAGACGAA TTTATAGCTG TCAGTCCTAT GGGCACGGAG
TCAAGTTATA ATAGTTTGCT TAATAACAAA AAGCACCAGA AGAATTATCA TGGAGACGAA
ATCGATCCTT TCAGAGAAAT TGTCGATTTT GACGAAGAAG ATGTAGTGAC AGATGAACTT
AGGTCACTTT GGGAGAGCTT GGGACAGCCT AAGTTTCCCA AAAATTACAT TCTCGAGAAG
ATATGGAAAT ACAACAAATC ACCGTTCCAA ATTCTCCAAA ATTTCGGACC CTTCTACTCA
AAAATATTAT CCTATTGGTC CTATAAACGA TTTAGAAACC TTAAGACTAA TGAGGATGAG
GCTGTAGACG AGGATGTCGA TACTCTAGTC ACTAATGCGA ATATAAAACG CTCGTATCTG
GAGTCCTCAG CAGCTTCGGT AAAGTCTAAT TCTTCCCTAG ACTTAATTCT TAAACTTCAC
AGTTATTCAT ATTCCATTTT CAACCAATAC CAGGGCTCAG GAGAAATAGC CATTACCAAA
TTCATTAACC ATGAGATCTT GGCTCGTTTA CCTCTCTGTG ATCGTGGTCT AGTCGAGTAT
TTGAACGAAA CATTAGTTAA AAGTTTATGG GTATATGGCG ACAAGGATTG GATGAACCAA
AAGGGTGGGT TATATATTCA TGATAAACTA AAGGAAATAA ACCCAGAGAT TTCAGACTTT
AAAATTATCG AAGATGCGGG GCATCACATT TACTTGGATA ATCCTGAAAG TTTTAACAAG
ACGTGTATAG ATTTTTTTCA CTTGTGTGCT TAA
 
Protein sequence
MSEAVRKARF KNTVIEYDLF RAILSDDIYI EEPIEEKNAE EKGRQPENEV QDEEPHTRNV 
PLYAELLDIE IDEDVSIHEL YLVNNTDPSV TETHLVIIHG YMAALGYFVK NVEPLLKSQP
GLRLHVIDLP GFGNSSRPTF PKEFLTEPKT KAEKIKQIVD IENWFIDKLE CWRKKRGIKK
FKLVGHSMGA YLSCCYLMKY NKASDKVVDE FIAVSPMGTE SSYNSLLNNK KHQKNYHGDE
IDPFREIVDF DEEDVVTDEL RSLWESLGQP KFPKNYILEK IWKYNKSPFQ ILQNFGPFYS
KILSYWSYKR FRNLKTNEDE ASNSSLDLIL KLHSYSYSIF NQYQGSGEIA ITKFINHEIL
ARLPLCDRGL VEYLNETLVK SLWVYGDKDW MNQKGGLYIH DKLKEINPEI SDFKIIEDAG
HHIYLDNPES FNKTCIDFFH LCA