Gene Mlg_1679 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1679 
Symbol 
ID4268911 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1920918 
End bp1922363 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content71% 
IMG OID638126437 
Productprecorrin-2 dehydrogenase / uroporphyrinogen-III C-methyltransferase 
Protein accessionYP_742515 
Protein GI114320832 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0007] Uroporphyrinogen-III methylase
[COG1648] Siroheme synthase (precorrin-2 oxidase/ferrochelatase domain) 
TIGRFAM ID[TIGR01469] uroporphyrin-III C-methyltransferase
[TIGR01470] siroheme synthase, N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACGC TCCCCGTGTT CATGAAACTG CGCGACCGAC GCTGCCTGGT TGTCGGTGGC 
GGCCGCCGGG CCGAACGCAA GGCCCGCCTC CTGCTGGCGG CAGGGGCGGA ACTGACGGTC
CTGGCCGAGG CCCCCTTGCC GACCCTGGCG GCCCTGACGG AAGCACACGG CTGCCGCATG
GTGCGACGCC CCCTCACGGC CCGCGATCTG GATGGCGTTT CCCTGGTGAT CTCCGCCGCG
GATGAGGCCA CCGACCGCCG CGCGCACATG CTGGCGCGGG CCCGAAACAT CCCCATCAAC
GTGGTGGACC GGCCCGATCT GTGCTCCTTC ACCCTGCCTG CCACCGTCGA CCGGGGCCCG
GTGCAGATCG CCGTCTCCAC CGGCGGCACG TCCCCGGTGC TGGCCCGGAT GCTGCGCAAC
CGGCTGGAGG CCGACATACC CTCCGCCTAC GGGCGGCTGG CCCGTCTGGC CGAGCGGTAC
CGGCGGCCGG TCCGCGAGGT CCTGCCGGAG GCGTGGCAGC GCCAGCGGTT CTGGGAAGAG
GTCCTCAGTG GTGAGGTGGC AGAGCGGGTC TTCGCCGGCC AGGACGACGC CGCGCGTGAG
GGCCTGGAAG AGGCCATCGG CCGCGCCACC CGAGAGCTCC AGACCCGGCG GGGGGAGGTC
TACCTGGTGG GCGCCGGGCC GGGTGACCCG GATCTGCTCA CCCTGCGCGC CCTGCGCCTG
ATGCAACAGG CCGATGCCGT GGTCTACGAC CGGCTGGTCA ATCCGGCGAT CATGGCCAAG
GTCAACCAGG ATGCCGAGCG GATCGACGTG GGCAAACGCT GTGGCCACCA CCCGGTACCA
CAACACGCCA TCAACGACAA GCTGGTCACC CTGGCCCGCC AGGGCTACCG GGTCCTCCGG
CTGAAGGGGG GTGACCCCTT CGTCTTCGGC CGCGGCGGCG AGGAGCTGCA GACCCTGGTC
GATGCCGGCG TGCCCTTTCA GGTGGTCCCG GGCATCACCG CGGCCACCGG CTGTGCGGCT
TACAGCGGCA TCCCCCTGAC CCACCGGGAT TACGCCCATA GCTGCGCCTT CTACACCGGC
CATCTCAAGA ACGACCGCCT GGACCTGGAC TGGTCACGGA TGGTGCAACC CGGGCAGACC
CTGGTGTTTT ACATGGGCGT GGCCAGCCTG CCCGAGCTCA GTCGCCAGCT CTGCTGGCAC
GGCCTGCCGG CCGAGACCCC CGCCGCGCTG GTGGAAAAGG GCACCACCCC GGAACAGCGC
ACCCTCACCG CCACGCTGGA GACCCTGCCG GGGCTGGCGC GCCAGCAGGG GTTTCAGTCG
CCCGCGTTGG TCATCATCGG CGAGGTGGTC CGGCTGTACG AGCAGATCAA CTGGTACCGG
CCACCGGGGG ACGAGACCCC CAAGGCCGCG GAGCCGGGCG AACTGAACCG CCGCCGGCCT
GCCTAG
 
Protein sequence
METLPVFMKL RDRRCLVVGG GRRAERKARL LLAAGAELTV LAEAPLPTLA ALTEAHGCRM 
VRRPLTARDL DGVSLVISAA DEATDRRAHM LARARNIPIN VVDRPDLCSF TLPATVDRGP
VQIAVSTGGT SPVLARMLRN RLEADIPSAY GRLARLAERY RRPVREVLPE AWQRQRFWEE
VLSGEVAERV FAGQDDAARE GLEEAIGRAT RELQTRRGEV YLVGAGPGDP DLLTLRALRL
MQQADAVVYD RLVNPAIMAK VNQDAERIDV GKRCGHHPVP QHAINDKLVT LARQGYRVLR
LKGGDPFVFG RGGEELQTLV DAGVPFQVVP GITAATGCAA YSGIPLTHRD YAHSCAFYTG
HLKNDRLDLD WSRMVQPGQT LVFYMGVASL PELSRQLCWH GLPAETPAAL VEKGTTPEQR
TLTATLETLP GLARQQGFQS PALVIIGEVV RLYEQINWYR PPGDETPKAA EPGELNRRRP
A