Gene PICST_32420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_32420 
Symbol 
ID4839436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp1477701 
End bp1478702 
Gene Length1002 bp 
Protein Length333 aa 
Translation table12 
GC content45% 
IMG OID640390751 
Productpredicted protein 
Protein accessionXP_001385285 
Protein GI150865891 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.303596 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCCCT TATTTGTTCT GTTGTTGTTC TATCACTTGA GCAAAAAGAT TCGTGATATC 
TCCAACCGAC TAGTCGGTAA CTACTTCGAG CCAGAAAAAG ACACCGTACT CATTACAGGG
GGATGCTCCG GTTTAGGGAA AGAGCTCGTC AACACGTTTG CCGCTACTAG AGCCAAAGTA
GTTGTACTAG ATATTGTCGT ACCAACTGAC GAAGAACAGC CTGAAAATGT TTACTACTAC
AAGTGTGATG TTAGCGACAG AAAGCAAGTT CTCCAAGTCC ACAAAACCAT CAAAAAGGAG
ATCGGCAATA TTACAGTGCT CATTAACAAC GCTGGAATCA CTACTGGCAA GCCCCTAGTC
GATCTCAGCT ACCATGAAAT AGAGAAAACT ATACAGATAA ATTTAATGTC CAGTTTCTAC
ACCATCAAGG TATTTCTTCC TTCCATGCTC AGATTGCACA GGGGGTACAT TGTTACTATA
GCCTCTGTGT TAGGATACAT GTCCCCGGCT AGATTGAGTG CATACGGAGC ATCAAAATCG
GGGCTCATTG CTCTTCACGA GTCTCTTACT TACGAATTGG GACCCCCATC GATGAACCCT
ACGGGGGTGA AAACTTTACT AATCTGTCCC GGTCAGTTAA AAACCGCCAT GTTTTCGGGT
GTAAATACTC CGTCCTCGTT ATTGGCCCCT GAGCTAGATC CCAAGTATGT CGCCTCTAGT
GTTCTTTCAG CTCTAGAGCT TGGTCGCAGA GGCGAAATCA AGCTTCCTCT CTACGGAAAC
TTCTTGCCGA TGTTCCGCGC ATTTCCATGG CCCATTGTTG AAGTAGCGAG AGCAATTTCA
GGCATCGACC ACAGCATGAA TTCCTTTAAA AACACACTCA CAAAAGTGGC CAGCACTGTC
TCTACATTAT CACAATCAGG ATCCAAGAAT AGCTCAGAGA AAGCCTCTTT ACTAGGAGAA
GTAGATGTCA GCGAAGGGGT CAGTGAAATA GTAACGGCTT AG
 
Protein sequence
MGPLFVSLLF YHLSKKIRDI SNRLVGNYFE PEKDTVLITG GCSGLGKELV NTFAATRAKV 
VVLDIVVPTD EEQPENVYYY KCDVSDRKQV LQVHKTIKKE IGNITVLINN AGITTGKPLV
DLSYHEIEKT IQINLMSSFY TIKVFLPSML RLHRGYIVTI ASVLGYMSPA RLSAYGASKS
GLIALHESLT YELGPPSMNP TGVKTLLICP GQLKTAMFSG VNTPSSLLAP ELDPKYVASS
VLSALELGRR GEIKLPLYGN FLPMFRAFPW PIVEVARAIS GIDHSMNSFK NTLTKVASTV
STLSQSGSKN SSEKASLLGE VDVSEGVSEI VTA