Gene TM1040_2900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2900 
SymbolhemH 
ID4078578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3069070 
End bp3070134 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content57% 
IMG OID638008229 
Productferrochelatase 
Protein accessionYP_614894 
Protein GI99082740 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.724237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGATG CGACCTCAAC TGCGACCCGC CCTGACAATG CCCCTGCGGA TCATCCGCCG 
GTCAAGGCGG AAAAGGTCGG GATCCTGCTC GCAAACCTCG GCACCCCGGA TCACTACAGC
TATTGGCCGA TGCGGCGCTA TCTGAACGAG TTTCTCTCGG ACAAACGTGT GATCGACTAC
CCGTCCTGGA AATGGCAGCC GCTCCTGCAG CTGATCATCC TGACCAAACG CCCCTTTGCC
TCTGGCGAAG CGTACAAGTC GATCTGGAAC CATGAGCGGG GCGAAAGCCC ATTGATGACG
ATCACCAAGG ATCAGACCAA CGCCATGGCC AAGGCGATGG AAGAGCTCTA TGGCGATCAG
GTCATGGTCG ATTTCTGCAT GCGCTACGGC AATCCCTCCA CCAAATCCAA GGTAGAGAAG
ATGATTGCCG CTGGCTGCCG CAAGATCCTC TTTGTTCCGC TTTATCCGCA CTATGCGGGG
GCGACCTCTG CAACTGCAAA TGATCAGTTC TTCCGTGTGC TGATGGAGCA GCCCTGGCAA
CCCGCCGTAC GTACGATCGA GCCCTACTTC GACCAACCCG AATACATTGA TGCGCTCGCC
AGATCCGTGG AAGACGCCTA TGCCAAACTG GACAAGACCC CGGATATCCT GGTCTGTTCC
TATCATGGCA TGCCAAAGCG CTACCTGATG CAGGGCGATC CCTATCACTG CCAGTGCCAA
AAGACGACGC GCCTGCTGCG CGAGCGCCTG GGTTGGGACG AATCGAAGAT CATGACCACG
TTCCAGTCTG TCTTTGGTCC AGAGGAATGG CTGCGCCCCT ACACGGTTGA GCATGTCGCC
GAACTGGCGA AACAGGGCAA GAAGAACATC GCCGTGATCG CTCCGGCCTT CTCGGCGGAT
TGCATCGAGA CTCTGGAGGA GATCAATGAG GAGATTTTCG AGAGTTTTGA ACACGCGGGC
GGCGAAGAAT TCACCTACAT TCCTTGCCTG AACGACAGCG AAGCGCATAT TGCCGCGCTT
TCAAGCGTGA TCCGCAACAA CCTCAAAGGA TGGCTTGAGG CGTAA
 
Protein sequence
MLDATSTATR PDNAPADHPP VKAEKVGILL ANLGTPDHYS YWPMRRYLNE FLSDKRVIDY 
PSWKWQPLLQ LIILTKRPFA SGEAYKSIWN HERGESPLMT ITKDQTNAMA KAMEELYGDQ
VMVDFCMRYG NPSTKSKVEK MIAAGCRKIL FVPLYPHYAG ATSATANDQF FRVLMEQPWQ
PAVRTIEPYF DQPEYIDALA RSVEDAYAKL DKTPDILVCS YHGMPKRYLM QGDPYHCQCQ
KTTRLLRERL GWDESKIMTT FQSVFGPEEW LRPYTVEHVA ELAKQGKKNI AVIAPAFSAD
CIETLEEINE EIFESFEHAG GEEFTYIPCL NDSEAHIAAL SSVIRNNLKG WLEA