Gene PICST_37167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_37167 
SymbolYMC3 
ID4840865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009048 
Strand
Start bp356579 
End bp357739 
Gene Length1161 bp 
Protein Length386 aa 
Translation table12 
GC content42% 
IMG OID640392180 
Productmitochondrial carrier protein 
Protein accessionXP_001386462 
Protein GI150866760 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.787045 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.905393 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCA ACCAAACTGT TTCACTTTCA GCCGCTGGTA TGAGAGCTCT TATGTACCAG 
CTTCAGTCTC TCTACCTAAG AACACCAGTG AAGTTATTTC GTCCTCTGCG ATTTGACTAT
CTAGCCTATG TACGTGAGTT GGCTAACAAA CACGATAACA TCCACGAGAA ACCGTATAAA
TTCAGAACGC ATTCCTCGAT AGGAATGCTT GTCAATGTAG TAAAGAAAGA GGGATGGAGA
TTTATCCCTG ATCAGGTGTT GCCTCCTCTT GTAGCCAATT CGGCCACAGG GTTGATATTG
TACGGAACTT ACTTGACGGC TTTGGATCGA TTCAACAGTC TGCATACCCC AAAAGCAGAT
AAACCCCGAG AATTGTTCTA TTACTCCCCA TTCGATACAT GGAGAGCTGG TTTTATTGCT
GGTGCCGTTC AGTCACTCGC AGCAGCACCT GTAGACGCCA TATATACTAG ACTGACAGCA
GCAGAAATGT TGAGTGGTTC ACACCAAAAC CTCTGGATGT ACGGTTTGAA CAAACTTAAA
GAAGTCGGAT TGGTTGGAGT TTTTGCTGGG TATAGTTTTT CGCTTGTAAA AGAATCGCTT
GGATTTGCCT TCTACTTCTC TACCTTTGAG TTCGTCAAGA CTCAAGGCTA TACAGCAACT
TTCAAGGTTG TCAATGTTTA TAGACGAAGC AAAGAATCCA TCAAAAGCAA GTTACGACAA
TACTCAAATA TGAACGAAGA ACAGATAGAC GAACGACTAT TGAGCTTGGA GCGTACGAGA
ACCAAAAAGA TATTGAGGTC AACCTTTATT CTCGTAGCTG GTGCATCTGC CGCTTTCTCG
TTGTTGGCAA TACAATACCC AATCACCAAG ATTCAGAAGA TTCATCTTTC TAGACTTGAA
GCTCTTGATT TCTACAATGC ATCAGCTACG CGTTCATACA AACCCTCCAT AACCTTGTAC
TACAACTCAT ACATTGATAC ATATAACCAA ATCCTTAGGA TGAAGACAAA ATCAAAGTTG
ACTTGGTATC AAATGGCATA CAAAGGTTTT GTCCGCAATG CATTGACAAC AATACCGGCT
ACATCCGTGG CCCTCTTGGT TTTTGAAATA ATGAGAACCA GATTGACTGA CGACTTACTG
GAATTCGAAA TTTTGGAATA G
 
Protein sequence
MSSNQTVSLS AAGMRALMYQ LQSLYLRTPV KLFRPSRFDY LAYVRELANK HDNIHEKPYK 
FRTHSSIGML VNVVKKEGWR FIPDQVLPPL VANSATGLIL YGTYLTALDR FNSSHTPKAD
KPRELFYYSP FDTWRAGFIA GAVQSLAAAP VDAIYTRSTA AEMLSGSHQN LWMYGLNKLK
EVGLVGVFAG YSFSLVKESL GFAFYFSTFE FVKTQGYTAT FKVVNVYRRS KESIKSKLRQ
YSNMNEEQID ERLLSLERTR TKKILRSTFI LVAGASAAFS LLAIQYPITK IQKIHLSRLE
ALDFYNASAT RSYKPSITLY YNSYIDTYNQ ILRMKTKSKL TWYQMAYKGF VRNALTTIPA
TSVALLVFEI MRTRLTDDLS EFEILE