Gene Mkms_2503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMkms_2503 
SymbolhemH 
ID4616063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMycobacterium sp. KMS 
KingdomBacteria 
Replicon accessionNC_008705 
Strand
Start bp2625069 
End bp2626109 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content70% 
IMG OID639792171 
Productferrochelatase 
Protein accessionYP_938490 
Protein GI119868538 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.146154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTTTTG ACGCGCTGCT GGTGCTGTCC TTCGGTGGAC CCGAGGGCCC CGACCAGGTG 
ATGCCTTTTC TGGAGAACGT CACCCGGGGC CGCGGGATCC CGCGGGAACG CCTGGCCAGC
GTGGCCGAAC ACTACCTGCA CTTCGGCGGC GTCTCACCGA TCAACGGCAT CAACCGGGCG
TTGATCGCCG AGATCGAGGC CGAACTGGGC GACCGCGGCG AGACGCTGCC GGTGTACTTC
GGCAACCGCA ACTGGGATCC CTACGTCGAG GACGCGGTGA CCGCGATGCG CGACGACGGG
GTGCGGCGCG CGGCGGTGTT CGCCACATCG GCGTGGGGCG GATACTCCAG TTGCACGCAG
TACAACGAGG ACATCGCCAG GGGTCGCGCG GCCGCCGGTG ACGGTGCGCC CCAGCTGGTG
AAGTTGCGGC ACTACTTCGA CCACCCGCTG CTCGTGGAGA TGTTCGCCGA GTCGATCAGC
GTTGCGGCGC AATCACTTCC GGCCGATGTG CGCGATGAGG CCCGGCTGGT GTTCACCGCC
CACTCCATCC CGGTCGCCGC CGACGACCGA CACGGCCCCA ACCTCTACAG CCGCCAGGTG
GCCTACGCGA CCCGGCTGGT GGCCGCCGCG GCCGGATACT CCGAATTCGA CCAGGTGTGG
CAGTCGCGGT CGGGTCCGCC GCGTATCCCG TGGCTCGAAC CCGACATCGG CGACCACGTG
ACCGCCCTCG CCGAGCGCGG TACGAAAGCC GTCATCATCT GTCCGATCGG ATTCGTCGCC
GACCACATCG AGGTGGTCTG GGATCTCGAC AGCGAGGTGC GCGAACAGGC CGCCGACCTG
GGCATCGCGA TGGCCCGGGC CAGGACGCCC AACGCCGACC GGCGTTACGC CCGCCTCGCC
CTCGACCTCG TCGACGAACT CCGTGGCGAC CGCGACCCCC TGCGCGTCGC GGGTGTCGAC
CCCGCACCGG GGTGCGGATA CAGCGTCGAC GGCACCACCT GCGCGGATTC ACCGCGCTGC
GTCGCCCGCA TCACCGGCTG A
 
Protein sequence
MLFDALLVLS FGGPEGPDQV MPFLENVTRG RGIPRERLAS VAEHYLHFGG VSPINGINRA 
LIAEIEAELG DRGETLPVYF GNRNWDPYVE DAVTAMRDDG VRRAAVFATS AWGGYSSCTQ
YNEDIARGRA AAGDGAPQLV KLRHYFDHPL LVEMFAESIS VAAQSLPADV RDEARLVFTA
HSIPVAADDR HGPNLYSRQV AYATRLVAAA AGYSEFDQVW QSRSGPPRIP WLEPDIGDHV
TALAERGTKA VIICPIGFVA DHIEVVWDLD SEVREQAADL GIAMARARTP NADRRYARLA
LDLVDELRGD RDPLRVAGVD PAPGCGYSVD GTTCADSPRC VARITG