Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4170 |
Symbol | hemC |
ID | 6142678 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4269468 |
End bp | 4270424 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641618993 |
Product | porphobilinogen deaminase |
Protein accession | YP_001746121 |
Protein GI | 170680193 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0181] Porphobilinogen deaminase |
TIGRFAM ID | [TIGR00212] porphobilinogen deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.473943 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGTAA CAAGCATGTT AGACAATGTT TTAAGAATTG CCACACGCCA AAGCCCACTT GCACTCTGGC AGGCACACTA TGTCAAAGAC AAGTTGATGG CGAGCCATCC GGGCCTGGTC GTTGAACTGG TACCGATGGT GACGCGCGGC GATGTGATTC TTGATACGCC GCTGGCGAAA GTAGGCGGAA AAGGCTTATT TGTTAAAGAG CTGGAAGTCG CGCTCCTGGA GAATCGCGCC GATATCGCCG TACATTCAAT GAAAGATGTG CCGGTTGAAT TCCCGCAAGG TCTGGGGCTG GTCACCATTT GCGAGCGTGA AGATCCTCGC GATGCCTTTG TGTCCAATAA CTATGAAAGT CTGGATGCGT TACCGGCAGG CAGTATCGTC GGGACGTCCA GTTTACGTCG CCAGTGCCAA CTGGCTGAAC GCCGCCCGGA TCTGATTATC CGCTCCCTGC GCGGCAACGT CGGCACTCGC CTGAGTAAAC TGGATAACGG CGAATACGAT GCCATCATTC TTGCCGTAGC CGGACTAAAA CGTTTAGGTC TGGAGTCCCG CATTCGCGCC GCATTGCCAC CCGAGATTTC TCTTCCGGCG GTAGGACAAG GTGCGGTGGG TATTGAATGC CGCCTTGATG ATTCTCGCAC TCGCGAGCTG CTTGCCGCGC TGAATCACCA CGAAACTGCA CTGCGCGTTA CCGCAGAACG CGCCATGAAT ACCCGTCTCG AAGGCGGATG TCAGGTGCCA ATTGGTAGCT ACGCCGAGCT TATTGATGGC GAAATCTGGC TGCGTGCGCT GGTCGGCGCG CCGGACGGTT CGCAGATTAT TCGCGGTGAA CGCCGCGGTG CGCCGCAAGA TGCCGAACAA ATGGGGATTT CGCTGGCAGA AGAGCTACTG AATAACGGCG CGCGCGAGAT CCTCGCTGAA GTCTATAACG GAGACGCCCC GGCATGA
|
Protein sequence | MTVTSMLDNV LRIATRQSPL ALWQAHYVKD KLMASHPGLV VELVPMVTRG DVILDTPLAK VGGKGLFVKE LEVALLENRA DIAVHSMKDV PVEFPQGLGL VTICEREDPR DAFVSNNYES LDALPAGSIV GTSSLRRQCQ LAERRPDLII RSLRGNVGTR LSKLDNGEYD AIILAVAGLK RLGLESRIRA ALPPEISLPA VGQGAVGIEC RLDDSRTREL LAALNHHETA LRVTAERAMN TRLEGGCQVP IGSYAELIDG EIWLRALVGA PDGSQIIRGE RRGAPQDAEQ MGISLAEELL NNGAREILAE VYNGDAPA
|
| |