Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5243 |
Symbol | hemC |
ID | 6971305 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4886598 |
End bp | 4887554 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643388907 |
Product | porphobilinogen deaminase |
Protein accession | YP_002273321 |
Protein GI | 209396873 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0181] Porphobilinogen deaminase |
TIGRFAM ID | [TIGR00212] porphobilinogen deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 65 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTAA CAAGCATGTT AGACAATGTT TTAAGAATTG CCACACGCCA AAGCCCACTT GCACTCTGGC AGGCACACTA TGTCAAAGAC AAGTTGATGG CGAGCCATCC GGGCCTGGTC GTTGAACTGG TACCGATGGT GACGCGCGGC GATGTGATTC TTGATACGCC GCTGGCGAAA GTCGGCGGAA AAGGCTTATT TGTAAAAGAG CTGGAAGTCG CGCTCCTCGA AAATCGCGCC GATATCGCCG TACATTCAAT GAAAGATGTG CCGGTTGAAT TCCCGCAAGG TCTGGGCCTG GTCACTATTT GTGAGCGTGA AGATCCTCGC GATGCCTTTG TGTCCAATAA CTATGACAGT CTGGATGCGT TACCGGCAGG CAGTATCGTC GGGACGTCCA GTTTACGTCG CCAGTGCCAA CTGGCTGAAC GCCGCCCGGA TCTGATTATC CGCTCCCTGC GCGGCAACGT CGGCACTCGC CTGAGCAAAC TGGATAACGG CGAATACGAT GCCATCATTC TTGCCGTAGC CGGACTAAAA CGTTTAGGTC TGGAGTCCCG CATTCGCGCC GCGTTGCCAC CCGAGATTTC TCTTCCGGCG GTAGGACAAG GTGCAGTCGG TATTGAATGC CGCCTTGATG ATACACGCAC TCGCGAGCTG CTTGCCGCGC TGAATCACCA CGAAACTGCA CTGCGCGTTA CCGCAGAACG CGCCATGAAT ACCCGTCTCG AAGGTGGATG TCAGGTGCCA ATTGGTAGCT ACGCCGAGCT TATTGATGGC GAAATCTGGC TGCGTGCGCT GGTCGGCGCG CCGGACGGTT CGCAGATTAT TCGCGGTGAA CGCCGCGGTG CGCCGCAAGA TGCCGAACAA ATGGGGATTT CGCTGGCAGA AGAGCTACTG AATAACGGCG CGCGCGAGAT CCTCGCTGAA GTCTATAACG GAGACGCCCC GGCATGA
|
Protein sequence | MTVTSMLDNV LRIATRQSPL ALWQAHYVKD KLMASHPGLV VELVPMVTRG DVILDTPLAK VGGKGLFVKE LEVALLENRA DIAVHSMKDV PVEFPQGLGL VTICEREDPR DAFVSNNYDS LDALPAGSIV GTSSLRRQCQ LAERRPDLII RSLRGNVGTR LSKLDNGEYD AIILAVAGLK RLGLESRIRA ALPPEISLPA VGQGAVGIEC RLDDTRTREL LAALNHHETA LRVTAERAMN TRLEGGCQVP IGSYAELIDG EIWLRALVGA PDGSQIIRGE RRGAPQDAEQ MGISLAEELL NNGAREILAE VYNGDAPA
|
| |