Gene PICST_33431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_33431 
SymbolHMC4 
ID4840448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009047 
Strand
Start bp635382 
End bp636530 
Gene Length1149 bp 
Protein Length382 aa 
Translation table12 
GC content38% 
IMG OID640391763 
Producthypothetical multicopy protein 
Protein accessionXP_001386131 
Protein GI150866503 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.613875 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAACT TATCTACAAA TGTTTTGTCG GCTGGTAGAT CGGCACCGCC AGAAAGTCAC 
GGCTCCAAAA CTCTTACTGT AGATAATATC GCAGAATTGA TTAAGATACA ATTAGAGGAT
TATGAAGCTA AGTTTCTAAA GCTTTATGCT CAACAGCAAT CTCAAGTTAA TGAACTTATA
CTGATTATAT TAACCAAGAA GAGCGATTCT GAGAAAGATA CCGGTGATTC CGCCATTACG
AGTACTGATA GTGATATTAA TCTTATTACG AGTATTGAGT CCAGTTCTGT GCTTGCTGGT
AAAGACATTA CTGAAAACAC CAGTGTTCCA CCTAAAATTG AAACCTTTCC ATCGCTTAGA
ACAAGTCAAA CCACTGCTCT AGATGGATTC GATTTTTTTG AGCGAACTCA AAGGGTGATT
AGAAAATCTC AGAAACATAT TCATGAATAT TGGAAAAAAT TACCCCCGCT CAATGAGACC
AGCGCTGAAC TCTGGTCCAG GGCTATTCAG GATTTGAACA ACGAGATGGA TTATAGAGCC
TTATCTAAAG CCAATTTCAA AGTCGACTGG AACACTTTCC AATCCAAAAC CGGACTCCGT
GGTGATAAAT TAGAATATTT TTACGAGTGC TGGAAGGATG CTCTTATTGG ACGTTATCGC
AATAACACTT TGCGTATCCT TGCTGTCAAT CGAGACCATA TTATTACTCT TGAAGATCTA
CTTGAGTACA CATCGCAAAA TGCAGACTAC GACAAAACCA ATTCTATACT TGAGGAAGTG
CAAAGACGCC GCCGAATCAA TCCAATGTGT CAAGACTATA TCTCAGAATT TAGAGGTACA
AACATTCATG ATTATGACCG TATAATTCAA TTTCTTAACG GCCATCCCGC CGATCTCTAT
TGTGCCATTA GTCACTTCTG TAACCAAAAA CATGAAGGCA ATCGTACTAT TGCTGCCGCT
ACGGTCAACT TCTATTATCA GGATTTTATG ACCAAGGACA ATTTTCAATA CCCTTCAGTC
AACGCTTTTG AAAAGAAAAT GAAAAGTACA CTTGGTTACT CTTGTAAATT TTATTCTGAT
TCATCTAAAT TAAGAACGAA TTCAAAACAC AGAAGGGGTA AGAGTAATAG TAATCATTAT
TCACAATAA
 
Protein sequence
MSNLSTNVLS AGRSAPPESH GSKTLTVDNI AELIKIQLED YEAKFLKLYA QQQSQVNELI 
SIILTKKSDS EKDTGDSAIT STDSDINLIT SIESSSVLAG KDITENTSVP PKIETFPSLR
TSQTTALDGF DFFERTQRVI RKSQKHIHEY WKKLPPLNET SAELWSRAIQ DLNNEMDYRA
LSKANFKVDW NTFQSKTGLR GDKLEYFYEC WKDALIGRYR NNTLRILAVN RDHIITLEDL
LEYTSQNADY DKTNSILEEV QRRRRINPMC QDYISEFRGT NIHDYDRIIQ FLNGHPADLY
CAISHFCNQK HEGNRTIAAA TVNFYYQDFM TKDNFQYPSV NAFEKKMKST LGYSCKFYSD
SSKLRTNSKH RRGKSNSNHY SQ