Gene Caul_5017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5017 
SymbolhemH 
ID5902479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5420115 
End bp5421161 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content66% 
IMG OID641565538 
Productferrochelatase 
Protein accessionYP_001686635 
Protein GI167648972 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0276] Protoheme ferro-lyase (ferrochelatase) 
TIGRFAM ID[TIGR00109] ferrochelatase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.400702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTAAGC GCAAGATCGC CGTCGTCCTG TTCAATCTGG GGGGACCTGA TGGGCCGGAC 
GCGGTGCGGC CGTTCCTGTT CAACCTGTTC CGCGACCCGG CGATCATCGG GGTTCCGGCT
CTGTTGCGCT ATCCGCTGGC TGCCCTGATC GCGGGCACGC GCGCCAAGCT GGCCAAGGAG
AACTACGCCC TGATGGGCGG GGGTTCGCCC CTGTTGCCGG AGACGCGGGA GCAGGCGAAG
GCGCTCGAAG CTGATCTCGC CGCCCGTTTT CCAGACGCCG AGACCCGCTG CTTCATCGCC
ATGCGCTACT GGAAGCCGCT GACGAACGAG ACCGCCAAAG CCGTGAAAGC TTTTGCTCCG
GATGAAGTCG TGCTGCTGCC GCTCTATCCG CAGTTCTCGA CCACCACGAC GGGGTCGTCG
CTGAAGGCCT GGAGCCGCGC CTATCGCAAG GGGTCCGGCC GGATCTCGAC GGTCTGCTGC
TATCCGGTGG ACGAGGATCT GGTTCAGGCT CATGCCGACC TGATCAAGGC GGCCTACGAC
AAGGCCGGTC GTCCCGGCCC GGCGCGTCTG CTGTTCTCGG CCCATGGCCT GCCTGAAAAG
ATCATCGAGG CCGGCGATCC CTACCAGCAA CAGATTGAGG CGACGGCCGC GGCGGTCGCC
GCCAGGCTGG GCGGGGGCTG GGACTGGCGG GTGACCTACC AGAGCCGGGT CGGACCCATG
AAGTGGATCG GACCCTCGAC CGAGGAAGAG ATCAAGTCGG CCAGCGAGCA GGGTCTGGCC
CTGGTTGTCA CGCCGATCGC CTTCGTCTCC GAGCACATCG AGACCTTGGT CGAGCTGGAT
CATGAATATC GCGAAGTGGC GCTGAAGGCG GGCTGCCCGG CCTATGTTCG CGTCCAAGCC
TTGGGCGTCG CGCCCGGATT CATTCGTGGC CTCGGCCGCG CCATCCAGGG CGTGCTCGCT
GCGCCTGATC GTGTGGTCAC AGCCTGCGCC TGGCGCTGCG GCGGCGATCG CACCCAATGT
CCGAACAAGC AGGGAGCACG CGCGTGA
 
Protein sequence
MTKRKIAVVL FNLGGPDGPD AVRPFLFNLF RDPAIIGVPA LLRYPLAALI AGTRAKLAKE 
NYALMGGGSP LLPETREQAK ALEADLAARF PDAETRCFIA MRYWKPLTNE TAKAVKAFAP
DEVVLLPLYP QFSTTTTGSS LKAWSRAYRK GSGRISTVCC YPVDEDLVQA HADLIKAAYD
KAGRPGPARL LFSAHGLPEK IIEAGDPYQQ QIEATAAAVA ARLGGGWDWR VTYQSRVGPM
KWIGPSTEEE IKSASEQGLA LVVTPIAFVS EHIETLVELD HEYREVALKA GCPAYVRVQA
LGVAPGFIRG LGRAIQGVLA APDRVVTACA WRCGGDRTQC PNKQGARA