Gene PICST_31696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31696 
SymbolHMC3 
ID4838557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1458079 
End bp1459323 
Gene Length1245 bp 
Protein Length414 aa 
Translation table12 
GC content38% 
IMG OID640389872 
Producthypothetical protein 
Protein accessionXP_001384579 
Protein GI150865386 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.772275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.350827 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAATT TTTCCACAAA TGCTCTGTCG ACTGGTAGAT CGGAACTGCC AGAAGAACAG 
ACATTCAGTC CCATCACTGC AAGTCTTCTC GCAGAAGCGA TTAGGGAACA AGCAAATCAA
TTGCTTGAGA AACAATCAGA CGAATTTGAG GCTAAATTTC TACTGCTTCA AAACCAAATT
CATGATATCC TAACGCTTAT TTCTGCCGAG AAAAGTGGTA ACAGAAAAGG AGTTTCGAGA
AATACTGACG ATTCTTCGAC CATTACGAGT ACTAATAGTG ATATTAATCT TATTCAATCT
AGTATTGAGT CTAGTTCGGT GCTCGCTGAT GATGACTTAA GTGTAAACAT CAGTGCTCCA
CCTAAAATTG AACACTCTCC ATCGTTTAAA TCAAGTAAAC ATTCTGCTGA AAAAACATTC
AATTTATTTG AAAGAACTAA CGATACGATT CGAAAGTCTC AGAAACGCTT TCATGAATCT
TGGAGAAAAT TACCCAAGCT CAATGATAGC AGCGTTGAAC TCTGGTCCAG GGCTCTTCAG
GAACTAAACA GTGACCCAGA CTATAAAGCT TTGTCTAAGG CAAACTTCAA AGTAGACTGG
AACAGTTTCG AATCCAGAAC TGGACTCCAT GGTAATGAAC TTAAATACTA CTACGATTGC
TGGAAGGATG ATCTCATTGC ACCTTATTGC AATAATACTT TGAGTATTCT TGCTGCTACC
CGAGATCATA CTGTCACCCT TGAAGATCTA ATTGAGTACA CATCGGAGCA TGCAGATGAT
GCTAAAACGA TGTCTATACT TGAGGAAGTG CAACGACGCT ACCGAATCGA TATACTGTGT
AAAGACTATG TCTCAGAATT GAGAGGCAAA AACACGCATG ATTATGACCG TATAATTCAA
TTTATTGACG GTATTCCTTC CGACCTATAT GGCACCATTA GTCACTACTG TAACCAAAGA
CACGATGGCA ATTGTATAAT AGCTGCCGCT ACGGCCAATT TCTATTATAA AGAATTTATG
ACCAAGGAGA ATTTTCATTA CCCTACCCCC AACACTTTCC AAAAGAAAAT GATCAGTACA
CCTGGTTTCT CTGGTAAAGT TTTATCTGAT TCCTCTAAAT CAAGAACGAA TTCCAAACAC
AGAAAAGACA GGAGTAACTA TAATAATTAT TCTCAAAATA AACTGCTGTT GGAGAATAAC
CAGAAATCGC ACAGAAGACA AACGGACAAT GTTAATCACA ACTAA
 
Protein sequence
MSNFSTNASS TGRSESPEEQ TFSPITASLL AEAIREQANQ LLEKQSDEFE AKFLSLQNQI 
HDILTLISAE KSGNRKGVSR NTDDSSTITS TNSDINLIQS SIESSSVLAD DDLSVNISAP
PKIEHSPSFK SSKHSAEKTF NLFERTNDTI RKSQKRFHES WRKLPKLNDS SVELWSRALQ
ELNSDPDYKA LSKANFKVDW NSFESRTGLH GNELKYYYDC WKDDLIAPYC NNTLSILAAT
RDHTVTLEDL IEYTSEHADD AKTMSILEEV QRRYRIDISC KDYVSELRGK NTHDYDRIIQ
FIDGIPSDLY GTISHYCNQR HDGNCIIAAA TANFYYKEFM TKENFHYPTP NTFQKKMIST
PGFSGKVLSD SSKSRTNSKH RKDRSNYNNY SQNKSSLENN QKSHRRQTDN VNHN