Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4025 |
Symbol | hemC |
ID | 5592075 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4016663 |
End bp | 4017625 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640923130 |
Product | porphobilinogen deaminase |
Protein accession | YP_001460596 |
Protein GI | 157163278 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0181] Porphobilinogen deaminase |
TIGRFAM ID | [TIGR00212] porphobilinogen deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 0.105851 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAATGA CGGTAACAAG CATGTTAGAC AATGTTTTAA GAATTGCCAC ACGCCAAAGC CCACTTGCAC TCTGGCAGGC ACACTATGTC AAAGACAAGT TGATGGCGAG CCATCCGGGC CTGGTCGTTG AACTGGTACC GATGGTGACG CGCGGCGATG TGATTCTTGA TACGCCGCTG GCGAAAGTAG GCGGAAAAGG CTTATTTGTA AAAGAGCTGG AAGTCGCGCT CCTCGAAAAT CGCGCCGATA TCGCCGTACA TTCAATGAAA GATGTGCCGG TTGAATTCCC GCAAGGTCTG GGACTGGTCA CTATTTGTGA GCGTGAAGAT CCTCGCGATG CCTTTGTGTC CAATAACTAT GACAGTCTGG ATGCGTTACC GGCAGGCAGT ATCGTCGGGA CGTCCAGTTT ACGTCGCCAG TGCCAACTGG CTGAACGCCG TCCGGATCTG ATTATCCGCT CCCTGCGCGG CAACGTCGGC ACTCGCCTGA GCAAACTGGA TAACGGCGAA TACGATGCCA TCATTCTTGC CGTAGCCGGA CTAAAACGTT TAGGTCTGGA GTCACGTATT CGCGCCGCGT TGCCACCCGA GATTTCTCTT CCGGCGGTAG GACAAGGTGC GGTGGGTATT GAATGCCGCC TTGATGATTC ACGCACTCGC GAGCTGCTTG CCGCGCTGAA TCACCACGAA ACTGCACTGC GCGTTACCGC AGAACGCGCC ATGAATACCC GTCTCGAAGG CGGATGTCAG GTGCCAATTG GTAGCTACGC CGAGCTTATT GATGGCGAAA TCTGGCTGCG TGCGCTGGTC GGCGCGCCGG ACGGTTCGCA GATTATTCGC GGTGAACGCC GCGGTGCGCC GCAAGATGCC GAACAAATGG GGATTTCGCT GGCAGAAGAG CTACTGAATA ACGGCGCGCG CGAGATCCTC GCTGAAGTCT ATAACGGAGA CGCCCCGGCA TGA
|
Protein sequence | MIMTVTSMLD NVLRIATRQS PLALWQAHYV KDKLMASHPG LVVELVPMVT RGDVILDTPL AKVGGKGLFV KELEVALLEN RADIAVHSMK DVPVEFPQGL GLVTICERED PRDAFVSNNY DSLDALPAGS IVGTSSLRRQ CQLAERRPDL IIRSLRGNVG TRLSKLDNGE YDAIILAVAG LKRLGLESRI RAALPPEISL PAVGQGAVGI ECRLDDSRTR ELLAALNHHE TALRVTAERA MNTRLEGGCQ VPIGSYAELI DGEIWLRALV GAPDGSQIIR GERRGAPQDA EQMGISLAEE LLNNGAREIL AEVYNGDAPA
|
| |