Gene PICST_31190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31190 
Symbol 
ID4838632 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp162004 
End bp163109 
Gene Length1106 bp 
Protein Length313 aa 
Translation table12 
GC content42% 
IMG OID640389947 
Productpredicted protein 
Protein accessionXP_001383994 
Protein GI150864964 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.14246 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAGTA TTGACGACAC AGTACGATAT GTCGCTACCT ATATAGATGT AATAGTGTTG 
ATCATCGGAA TTTTAATGTA AGTTCGAATC TCTTGGGTCT GGTTTCTTGT ATCAGTGCTC
TATCGGATGT TTCAGTGCTA ACGAATTCTA GCATACCGAT GAATCTTGTG GTGTTGGTGG
TCTTGTTGGT TTCTGGTCGG CTATTCTATT ACTATTGTGT GAGATACAAC CTTCCCAGTA
GCTATAAGGA GTACCAGATC TCGTCGAAAG ATATTGCCTT GATTACTGGC GGATCTGGAG
GTTTGGGCTC AGAGTTGATT AAACAGATGC TCTCCAAAGG TGTACAGAAG ATCTACAACT
TGGATTTGAA GCTGTCTGTG GAGAGAGATT CTCGTATAGA TTACAGAAGA TGCGATGTTG
GTAATGAATC TGAGCTCAAA CAGAGCTTGG ACAACATCTT GTCTGAGTTG AAAGAACAAA
ATAGGAATAT CACTATTCTT ATTAATAATG CTGGCATCCG CCATAGCCAG TCGCTTTTGG
ATCTTCCAGA TAAAGAAATC CACGACATCT TCAATGTCAA CACATTCTCT TTCATCTGGA
CTCTCCGTAA AGTGACCTCG AATCATATAG ATACTGTCTT CAAGCAAGAC ACAAAAATAG
ATAGAAAGCT CAGAATCGTC AATGTGTCTT CGATTCTTGG AGCTTTGGCA CCGCGAAATC
TCTCGTTGTA TTCAGCTACA AAGTCTGCCA TAGTTCTGAT TCATGAATCC TTGACGCAGG
AGCTTCTGGA GTATCCTGAA ATTCGGTTGT TGCTTGTAAC TCCTGGCCAG CTCTCTACGG
GCATGTTTAA GGATGTCGAG CCTTCACGAA CGTTCTTAGC ACCCATCATC AGTGCCGAAT
ATTTGGCAAG AAGGATTGTG GAAAAGATTA ATGTTGGAGA AAGCGGAGTC TACTGTGAAC
CCTTGTACGC TAATTTCTTG CCCGGTATAA GGGTATTCCC AATGGTGCTA CAACACTTCT
GTCGCTGGTT TTCAGAGATG GACACCAAGG TGAATTCAAA TTCAAAAAGA GGAAACGAAA
GAGTCGATAT AGAAATAGAA AGATAG
 
Protein sequence
MISIDDTVRY VATYIDVIVL IIGILIYKEY QISSKDIALI TGGSGGLGSE LIKQMLSKGV 
QKIYNLDLKS SVERDSRIDY RRCDVGNESE LKQSLDNILS ELKEQNRNIT ILINNAGIRH
SQSLLDLPDK EIHDIFNVNT FSFIWTLRKV TSNHIDTVFK QDTKIDRKLR IVNVSSILGA
LAPRNLSLYS ATKSAIVSIH ESLTQELSEY PEIRLLLVTP GQLSTGMFKD VEPSRTFLAP
IISAEYLARR IVEKINVGES GVYCEPLYAN FLPGIRVFPM VLQHFCRWFS EMDTKVNSNS
KRGNERVDIE IER