Gene PICST_38073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_38073 
SymbolECM31 
ID4850984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp636399 
End bp637337 
Gene Length939 bp 
Protein Length312 aa 
Translation table 
GC content46% 
IMG OID640392692 
Productprotein involved in cell wall biogenesis and architecture 
Protein accessionXP_001387760 
Protein GI126273945 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0413] Ketopantoate hydroxymethyltransferase 
TIGRFAM ID[TIGR00222] 3-methyl-2-oxobutanoate hydroxymethyltransferase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.137581 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.577745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTTCCA GAGTCCAAAT CTTCAGAAGA ACATTCAGCC TGAGTGTTTC TGCCAAATCC 
TCCTACACTG GCACAGCTAG AAAGACTGTG GCCGATATCA ACAGGTTCTA TGCTAGCTCC
AAGCCAATTA CCGTCGTAAC AGCACACGAC TTCATAACCG CCAAGATGGT TGATCATGCC
GGTATCGACA TTTGTTTGAT AGGAGACTCT CTTGCCAATA CTACGTTGGG GCTTGATGAC
ACTAACGAGT TGGAGTTTCA AGAGATGCTT TATCACGTGA AGTCAGTCCA AAGAGGTAAT
GATTCGTCTT TGATCGTGGC CGATTTACCA TTTGGCTCGT ACGAGAAGTC CTCGGAACAA
GCCTTGGATA CGGCCATGAC TATTATCAAG CACGGTAAGA TTCAAGGTGT CAAGGTCGAA
GGAGGAGATG AATTCATCTT GCCAACGGTA AACCGGTTGA CCACGGTCGG AATTCCTGTA
ATGGGCCATG TTGGACTTAC TCCCCAAAAA CACAACGCTC TTGGAGGCTA CAGGCTTCAG
GGGAACTCTG TTGAAAATGC TGTCAGTATA TACAAGCAGT GTCTTGACCT CCAACGAGCT
GGTGTTTTTT CCATTGTGCT AGAGTGTATT CCCAACAAAT TGGCTCAGTA CATCACTGAA
AACTTGAGCG TTCCGACCAT TGGTATTGGT GCCGGTCCAT TTACTTCCGG GCAAGTTCTC
GTGATCTCTG ACATTCTCGG AATGAAGAGT AATAAAGAGA ACCACAAGCC CAAGTTTGTC
CGTGCCTACG AGGACTTTTA TACCAAAGGA GTTGAGGCGT TGACCAGCTA CGGTGAACAT
GTAGAGAACG CTCAATTTCC TGACGTAGAT GAGCACGGTT ACAAGATCAA GAGAGATGTA
TTTGAAGAGT TCAAGAAACA GGCCCACCAC ATCCATTAG
 
Protein sequence
MSSRVQIFRR TFSLSVSAKS SYTGTARKTV ADINRFYASS KPITVVTAHD FITAKMVDHA 
GIDICLIGDS LANTTLGLDD TNELEFQEML YHVKSVQRGN DSSLIVADLP FGSYEKSSEQ
ALDTAMTIIK HGKIQGVKVE GGDEFILPTV NRLTTVGIPV MGHVGLTPQK HNALGGYRLQ
GNSVENAVSI YKQCLDLQRA GVFSIVLECI PNKLAQYITE NLSVPTIGIG AGPFTSGQVL
VISDILGMKS NKENHKPKFV RAYEDFYTKG VEALTSYGEH VENAQFPDVD EHGYKIKRDV
FEEFKKQAHH IH