Gene Tbd_2033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTbd_2033 
SymbolhemH 
ID3672204 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThiobacillus denitrificans ATCC 25259 
KingdomBacteria 
Replicon accessionNC_007404 
Strand
Start bp2124162 
End bp2125265 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content68% 
IMG OID637710735 
Productferrochelatase 
Protein accessionYP_315791 
Protein GI74318051 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.814285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATACG CCGAAGAGCC CGAATACGCG CACGGAACGC CGCTGTGCGC AGGCGTCCTG 
CTGGTCAATC TGGGCACCCC CGAGGCGCCG ACGGCCGCTG CCCTGCGGCC TTATCTCAAG
GAATTCCTGT CCGACCCGCG CGTGGTCGAA ATCCCGCGCG CGCTCTGGTG GCTCGTGCTG
AACGGCATCA TTCTCAACAC CCGGCCGCGC AAGTCGGCCG AAAAATACGC CGCGATCTGG
ACCAGCGAGG GGTCACCGCT GAAGGTCCAC ACCGAAAAGC AGGCGAAACT CCTCAAGGGC
TGGCTCGGTG AGCACGCCGC CGCGCCGGTC GTCGTCGACT ACGCGATGCG CTACGGCCGG
CCGGGGATCC CGGAGGTTCT CGCACGAATG AAGGCCGCCG GCTGCGACCG TATCCTCGTA
CTGCCGGCCT ATCCGCAATA CGCCGCGTCG AGCACGGCGA CCGCGTTCGA CGCGGCGTTC
GACTGGCTGC GCAGAACGCG CAATCAGCCG GCACTGCGCA CGCTCAAGCA CTACCACGAC
CACCCCGAGT ACATCCGCGC ACTCGCCGCC AACCTGCGCG ACTACTGGCA GATGCACGGC
CGCCCCGACG TCCTCGTCAT GAGCTTCCAC GGCGTGCCGC GCTACACGCT CGACAAGGGC
GACCCCTATC ACTGCGAATG CCAGAAGACG GCGCGCCTGC TCGCCGCCGC ACTCGGCCTC
GAGCCGGGTC AGTTCCGCGT GACCTTCCAG TCGCGCTTCG GCCGGGCCGA ATGGCTCAAA
CCCTATACCG ACAAGACGCT CGAAGCGCTC GGCCGCGAGG GCGTCGGACG GGTCGACGTC
GTTGCGCCGG GTTTCACGGC CGATTGCCTG GAGACGCTCG AGGAACTCGC GATGGAGGGA
CGCGCGAGCT TTCTCGCCGC CGGCGGCAAG GAATTCCACT ACGTCCCCGC GCTCAACGAG
CACCCGCAAT GGATCGCCGC ACTCGGCAGG ATCGCGCTCG CCAACCTCGC GGGCTGGCTC
GACGAGGGCT GGACACCCGA CGCCGACGAG GCGTCGCGTC AGCTCAGCAG AAGCCGCGCG
CTCGCGCTGG GCGCGCAACG CTGA
 
Protein sequence
MKYAEEPEYA HGTPLCAGVL LVNLGTPEAP TAAALRPYLK EFLSDPRVVE IPRALWWLVL 
NGIILNTRPR KSAEKYAAIW TSEGSPLKVH TEKQAKLLKG WLGEHAAAPV VVDYAMRYGR
PGIPEVLARM KAAGCDRILV LPAYPQYAAS STATAFDAAF DWLRRTRNQP ALRTLKHYHD
HPEYIRALAA NLRDYWQMHG RPDVLVMSFH GVPRYTLDKG DPYHCECQKT ARLLAAALGL
EPGQFRVTFQ SRFGRAEWLK PYTDKTLEAL GREGVGRVDV VAPGFTADCL ETLEELAMEG
RASFLAAGGK EFHYVPALNE HPQWIAALGR IALANLAGWL DEGWTPDADE ASRQLSRSRA
LALGAQR