Gene Gdia_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_2030 
SymbolhemH 
ID6975457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp2252266 
End bp2253294 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content69% 
IMG OID643391560 
Productferrochelatase 
Protein accessionYP_002276405 
Protein GI209544176 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.273433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTCC TGACCATTCG CCCGGCTAAG CCGATCGCCC CGTCCCGTAT CGGCGTCCTG 
CTGACCAATC TGGGCACGCC GGAAGGAACC GGCTATGGCG CCATCCGGCG CTATCTTTCC
GAATTCCTGT CCGACCGCCG CATCATCGAG GTCAGTCCCG CCCTGTGGCA GCCGATCCTG
CAGGGACCGC TGCTGGCCCT GCGCCCCAGG CGGACCGGGG CGGCCTATCG GCGCATCTGG
CATACCGAGC GGGACGAGAG CCCGCTGCGC ACCCATACAA GGGCCCAGGC CGAGGCCCTG
GCCGCGCGCA TGGAACCGGA CGGCGTGGCG GTGGAATGGG CCATGCGCTA CGGCACCCCG
TCGATCGCAT CGGGCATCGA ACGGCTGCTG GCCCGGGGCT GCGCGCGGGT GCTGCTGCTG
CCGCTTTATC CGCAATACAG CGCCACGACG ACGGCCACGG CCAACGACCA TGCCTTTCGC
GCGCTGATGC GGCTGCGCAA CCAGCCGGCG GTCCGCACCG CGCCGTCCTT CCCCGACCAT
CCGCTCTATA TCGAAGCCCT GGCCCGGTCG GTGCGCGAGA CGCTGGCCGG CCTGCCCTTC
GTGCCGCAAC GGATCGTGGC GTCGTTCCAT GGCCTGCCGC GCGATTATGT CACGCGCGGC
GACCCGTATC CCGAGGAATG CGAGCGCACG CTGGCGGCGC TGCGCCGGGC GCTGGACATG
GACGAGGAGA CGATGACGCT GACCTATCAG TCGCGCTTCG GCCCCGCCCG ATGGCTGGAA
CCCTATACCG CGCCGCTGGT CGCCGGATTG CCCGCCCGGG GCGTCACGCG TGTCGCCGTC
ATCATGCCGG GCTTCATGGC CGACTGCATC GAGACGCTGG ACGAGATCGG CAACGAGGTC
CGGAAGGACT TCATTGCCGC CGGCGGAACC GATTTCGCGC TGGTTCCCTG CCTGAACGCG
GCGCCGGCCG CCATTGACCT GCTGGAAGGC CTGACGCGCC GGGAACTGGC GGGATGGTTG
AAGGATTGA
 
Protein sequence
MTFLTIRPAK PIAPSRIGVL LTNLGTPEGT GYGAIRRYLS EFLSDRRIIE VSPALWQPIL 
QGPLLALRPR RTGAAYRRIW HTERDESPLR THTRAQAEAL AARMEPDGVA VEWAMRYGTP
SIASGIERLL ARGCARVLLL PLYPQYSATT TATANDHAFR ALMRLRNQPA VRTAPSFPDH
PLYIEALARS VRETLAGLPF VPQRIVASFH GLPRDYVTRG DPYPEECERT LAALRRALDM
DEETMTLTYQ SRFGPARWLE PYTAPLVAGL PARGVTRVAV IMPGFMADCI ETLDEIGNEV
RKDFIAAGGT DFALVPCLNA APAAIDLLEG LTRRELAGWL KD