Gene Msil_3024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_3024 
Symbol 
ID7093519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp3336280 
End bp3337797 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content68% 
IMG OID643466334 
Producturoporphyrin-III C-methyltransferase 
Protein accessionYP_002363296 
Protein GI217979149 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAAACC GGCGCTCAAA AGATGTTTCG AGCGAAGACT ATGACGAAAG CGTCGCCATG 
AACAGGACGT CGGCCGCTCC CGAGCCGGGC CGCCCGATCA TGACCGGGCT GGCGTCGCTG
CCGGTGTTCT TCAACGTGAG GGGCAGGCGC GTCGTTATCG CTGGCGGGTC CGAGGCCGCC
TTGTGGAAAG CCGAGCTTGT TCAGGCGGCG GGAGCGTCTG TGGTGGTTTT TGCCGCCGCC
CCTTGCGACG GATTGGCGGA GCTTGAGCGC CGAGGCGCTT CGGTCGCGCT AGTGCGCCGC
CGCTTCGAGC CTGGCGATTT GGACGGCGCG GTTCTCGCAC TCGCCGACTG CGATATCGCC
GAAGAGGCGG AGCAGTTTCA CGCCGCCGCG CGGGCGCGCG GCGTTCCGGC GAATGTGATC
GACAAGCCTG CCGCCTCCGA TTTTCAGTTC GGCGCAATCG TCGACCGCTC GCCTTTGGTC
ATCGCCATCT CGACCGACGG CGCCTCGCCC ATTCTCGGAC AGGCTTTGCG CGGCCGCATC
GAGGCCATGC TGCCAGCGGC GATCCGGCTT TGGGCCGGCG CCGCGAAATC CTGGCGCGCG
CCGCTAAAGG CATTGGAGCT CGCTCCAAAA TTGCGCCGGC GATTTTGGGA GCTCTTCAAC
GAGCGGGCTT TGACGGCGAG CGCCGTGCCG CCAGGCCCCG ATGAATTTAA ATCCTTGCTG
GCGGAGGCGA CGGCTGAGGG TCCGCGCGCG GCGAAGGGCT CCATCGCCCT CGTCGGGGCA
GGCCCCGGCG ATCCGGAACT GCTGACGCTG AAAGCCTTGC GCCTGCTGCA AGCGGCGGAT
GTCGTGCTCT ATGACGATCT TGTCGCGCCC GAGATTCTCG ATATGGGCCG CCGCGAGGCG
ACAAAAATCC CGGTCGGCAA GCGCGGCTAT CGGCCGTCCT GCAAGCAGGA CGACATCATC
GATCTGATGA TCAAGCTCGC GGCGGAAGGC AAGCGGGTGG TGCGGCTCAA AGGCGGAGAT
CCGATGATTT TCGGGCGCGC CAGCGAGGAG CTTGCCGCCC TTCACGCCGC AGGAATCGCG
ACCAGCGTTA CGCCCGGCGT TACGGCGGCC CTTGGCGCCG CCGCCTCGCT GCAACTCTCG
CTGACCGAAC GCGTGCGGGC GCGGCGGTTG CAATTCATCA CCGCTCACGC CCATGACGGA
AGGCTGCCGG AAGACATCGA CTGGCGCGCG CTGGCCGATC CCTGCGCCTC CAGCGTCATT
TACATGGGCG CGCGAACGCT CAACTCCCTC GTCGAGCGTC TGGCGGCGCA TGGGGCGGAC
CCTTCGACGC CCGCGCTTCT CGTCGAGCGC GCGACCTGCC CGGACGAGCG CGTGATCAGG
GGAACGCTGG CGAGCCTGCC CGCGAAAGCC GCCGCGCTGT CGCCGTCCGG GCCTTGCCTG
ATCCTGATCG GCGCCGTCTT CGCCGGCGGC GTCGAGGCAG AGCAGATTCG GGAGGCGGCC
GATATTGCGA TCGCCTGA
 
Protein sequence
MRNRRSKDVS SEDYDESVAM NRTSAAPEPG RPIMTGLASL PVFFNVRGRR VVIAGGSEAA 
LWKAELVQAA GASVVVFAAA PCDGLAELER RGASVALVRR RFEPGDLDGA VLALADCDIA
EEAEQFHAAA RARGVPANVI DKPAASDFQF GAIVDRSPLV IAISTDGASP ILGQALRGRI
EAMLPAAIRL WAGAAKSWRA PLKALELAPK LRRRFWELFN ERALTASAVP PGPDEFKSLL
AEATAEGPRA AKGSIALVGA GPGDPELLTL KALRLLQAAD VVLYDDLVAP EILDMGRREA
TKIPVGKRGY RPSCKQDDII DLMIKLAAEG KRVVRLKGGD PMIFGRASEE LAALHAAGIA
TSVTPGVTAA LGAAASLQLS LTERVRARRL QFITAHAHDG RLPEDIDWRA LADPCASSVI
YMGARTLNSL VERLAAHGAD PSTPALLVER ATCPDERVIR GTLASLPAKA AALSPSGPCL
ILIGAVFAGG VEAEQIREAA DIAIA