Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_5017 |
Symbol | hemH |
ID | 5902479 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 5420115 |
End bp | 5421161 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641565538 |
Product | ferrochelatase |
Protein accession | YP_001686635 |
Protein GI | 167648972 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0276] Protoheme ferro-lyase (ferrochelatase) |
TIGRFAM ID | [TIGR00109] ferrochelatase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.400702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACTAAGC GCAAGATCGC CGTCGTCCTG TTCAATCTGG GGGGACCTGA TGGGCCGGAC GCGGTGCGGC CGTTCCTGTT CAACCTGTTC CGCGACCCGG CGATCATCGG GGTTCCGGCT CTGTTGCGCT ATCCGCTGGC TGCCCTGATC GCGGGCACGC GCGCCAAGCT GGCCAAGGAG AACTACGCCC TGATGGGCGG GGGTTCGCCC CTGTTGCCGG AGACGCGGGA GCAGGCGAAG GCGCTCGAAG CTGATCTCGC CGCCCGTTTT CCAGACGCCG AGACCCGCTG CTTCATCGCC ATGCGCTACT GGAAGCCGCT GACGAACGAG ACCGCCAAAG CCGTGAAAGC TTTTGCTCCG GATGAAGTCG TGCTGCTGCC GCTCTATCCG CAGTTCTCGA CCACCACGAC GGGGTCGTCG CTGAAGGCCT GGAGCCGCGC CTATCGCAAG GGGTCCGGCC GGATCTCGAC GGTCTGCTGC TATCCGGTGG ACGAGGATCT GGTTCAGGCT CATGCCGACC TGATCAAGGC GGCCTACGAC AAGGCCGGTC GTCCCGGCCC GGCGCGTCTG CTGTTCTCGG CCCATGGCCT GCCTGAAAAG ATCATCGAGG CCGGCGATCC CTACCAGCAA CAGATTGAGG CGACGGCCGC GGCGGTCGCC GCCAGGCTGG GCGGGGGCTG GGACTGGCGG GTGACCTACC AGAGCCGGGT CGGACCCATG AAGTGGATCG GACCCTCGAC CGAGGAAGAG ATCAAGTCGG CCAGCGAGCA GGGTCTGGCC CTGGTTGTCA CGCCGATCGC CTTCGTCTCC GAGCACATCG AGACCTTGGT CGAGCTGGAT CATGAATATC GCGAAGTGGC GCTGAAGGCG GGCTGCCCGG CCTATGTTCG CGTCCAAGCC TTGGGCGTCG CGCCCGGATT CATTCGTGGC CTCGGCCGCG CCATCCAGGG CGTGCTCGCT GCGCCTGATC GTGTGGTCAC AGCCTGCGCC TGGCGCTGCG GCGGCGATCG CACCCAATGT CCGAACAAGC AGGGAGCACG CGCGTGA
|
Protein sequence | MTKRKIAVVL FNLGGPDGPD AVRPFLFNLF RDPAIIGVPA LLRYPLAALI AGTRAKLAKE NYALMGGGSP LLPETREQAK ALEADLAARF PDAETRCFIA MRYWKPLTNE TAKAVKAFAP DEVVLLPLYP QFSTTTTGSS LKAWSRAYRK GSGRISTVCC YPVDEDLVQA HADLIKAAYD KAGRPGPARL LFSAHGLPEK IIEAGDPYQQ QIEATAAAVA ARLGGGWDWR VTYQSRVGPM KWIGPSTEEE IKSASEQGLA LVVTPIAFVS EHIETLVELD HEYREVALKA GCPAYVRVQA LGVAPGFIRG LGRAIQGVLA APDRVVTACA WRCGGDRTQC PNKQGARA
|
| |