Gene PICST_31820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31820 
SymbolMET1 
ID4838944 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp1759339 
End bp1761063 
Gene Length1725 bp 
Protein Length574 aa 
Translation table12 
GC content43% 
IMG OID640390259 
Productmethionine metabolism 
Protein accessionXP_001384647 
Protein GI126136246 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.164559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAACA TTCTAGCCTC CTTAAACTGT AGAGATGAAC ACCATCTCAT CATAGGAGTT 
TCCAACGTTG CTACACTCCG GATAAATTCC ATCTTGGATG CAGGTGCCAT TCCTATACTT
TTCACTGAAT CTGGGGAAGT GTCTAAGGTA GAAGAGAGTG AAAAATTGAA GACAATAACC
GGAAAGTTCG AGTTTGACAC TATAAGGCAA TTACTTATAT CTGAGGGGAG AGCCGAAGTT
GACTTTATTG TCGATAGGGT ATTTGTGGCG TTACCGATGT CTCAAGCTAG ATTGAAGAAG
AGCATATATG ACTACTGTAA GAAGAACAGA ATACCGATCA ACACTTCTGA TTCGCCTGAT
CTCTCTACCT TCACGTTGTT GTCAACATTT ACTTCGGGAG ACTTCCAAAT GGGTGTTACC
ACTTCTGGCA AGGGGTGCAA ATTAGCATCG AGAATCAAGA GAGAGTTGAC CAATGCCCTT
CCAGCAAATA TAGGTGATAT TTGCGACAAG ATCGGCGACT TGAGAAAAAA GATACAAGAA
GAAGATAATC TCGAGCTTGA AGACTTGAAA AACAGCGATC ATTTCCATTC CATTGGAGAG
CACGACGAGG ATGCCATCAA TACAAGCAAG TTGAATGCTT TAGTGGAGGA ATTCAATATG
ACACAAGAAC AAAAAAAATT GCAAAGAACG AGATGGTTGA GCCAAGTCGT CGAATACTTC
CCGTTGAACA CGTTGGGGGA ACTATCGTTG GATGACTTGA CCTCAGCCTT CCATGAATAC
AAAGCTGGCG CTGCCGAAAC AGATGAGCCA GAAAAGAAGA AACAAAAAGT CGTAGATTCC
AAGAAGGGAA GCATATCCTT AGTGGGTTCT GGACCTGGGT CAGTTTCTCT TTTAACTATT
GGAGCACTTC AAGCTATTCA TAATGCCGAT CTCATCCTTG CAGACAAATT AGTTCCACAA
CAAGTGCTTG ACATCATTCC AAAGAAGAGA ACCAAATTGT TCATAGCCAG AAAGTTCCCT
GGGAATGCTG AAAGGGCTCA ACAAGAATTG TTATCTATGG GTTTGGAAGC GTTATTGCGT
GGACAAAAGG TGGTCAGATT GAAACAGGGT GATCCATATA TCTTTGGAAG AGGGGGAGAA
GAGTACAATT ACTTCTCTGA AAGAGGATTT ACTCCAGTAG TGTTACCGGG AATCACTTCT
GCCTTAGCAG CTCCTGTATT GACTAATATT CCAGCTACCC ACAGAGACGT AGCCGATCAG
GTTTTGATCT GTACAGGCAC CGGTCGTCGT GGAGCTCTTC CAAACTTGCC TGAATTTGTC
AAGTCCAGGA CAACAGTCTT TTTAATGGCA TTACATAGAG TGGTTGATTT GATTCCCAAA
TTGATAGAAA GAGACTGGGA CCCAAAATTG CCAGCAGCAA TTATAGAAAG AGCTTCGTGT
CCGGACCAGC GAATAGTAAG AACGACTATT GAGAATGTAG CCAAAGCTGT TGAAGCCTGT
GGATCCAGGC CCCCAGGATT GTTGGTTACC GGATATGCCT GTGATGTTAT CTTCAAGCAC
AATAGTGAAA CATCCGAACC ATGGGTGATA GAAGAAGGAT GTGAAACTGC CAACAGCACT
CATTTAGAGC CATTCTTGAA ACTTGTTTCC TCCTATAATC CAGAAGACAT CTCCAAACCC
AGCATTCATC AGACTCCCCC ACCTGAGCCA TTAGCTACTA GTTAA
 
Protein sequence
MTNILASLNC RDEHHLIIGV SNVATLRINS ILDAGAIPIL FTESGEVSKV EESEKLKTIT 
GKFEFDTIRQ LLISEGRAEV DFIVDRVFVA LPMSQARLKK SIYDYCKKNR IPINTSDSPD
LSTFTLLSTF TSGDFQMGVT TSGKGCKLAS RIKRELTNAL PANIGDICDK IGDLRKKIQE
EDNLELEDLK NSDHFHSIGE HDEDAINTSK LNALVEEFNM TQEQKKLQRT RWLSQVVEYF
PLNTLGELSL DDLTSAFHEY KAGAAETDEP EKKKQKVVDS KKGSISLVGS GPGSVSLLTI
GALQAIHNAD LILADKLVPQ QVLDIIPKKR TKLFIARKFP GNAERAQQEL LSMGLEALLR
GQKVVRLKQG DPYIFGRGGE EYNYFSERGF TPVVLPGITS ALAAPVLTNI PATHRDVADQ
VLICTGTGRR GALPNLPEFV KSRTTVFLMA LHRVVDLIPK LIERDWDPKL PAAIIERASC
PDQRIVRTTI ENVAKAVEAC GSRPPGLLVT GYACDVIFKH NSETSEPWVI EEGCETANST
HLEPFLKLVS SYNPEDISKP SIHQTPPPEP LATS