Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4320 |
Symbol | hemC |
ID | 5587255 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | - |
Start bp | 4310109 |
End bp | 4311071 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640927937 |
Product | porphobilinogen deaminase |
Protein accession | YP_001465286 |
Protein GI | 157156845 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0181] Porphobilinogen deaminase |
TIGRFAM ID | [TIGR00212] porphobilinogen deaminase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.365174 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAATGA CGGTAACAAG CATGTTAGAC AATGTTTTAA GAATTGCCAC ACGCCAAAGC CCACTTGCAC TCTGGCAGGC ACACTATGTC AAAGACAAGT TGATGGCGAG CCATCCGGGC CTGGTGGTTG AACTGGTACC GATGGTGACG CGCGGCGATG TGATTCTTGA TACGCCGCTG GCGAAAGTAG GCGGAAAAGG CTTATTTGTA AAAGAGCTGG AAGTCGCGCT CCTGGAGAAT CGCGCCGATA TCGCCGTACA TTCAATGAAA GATGTGCCGG TTGAATTCCC GCAAGGTCTG GGGTTGGTCA CTATTTGTGA GCGTGAAGAT CCTCGCGATG CCTTTGTGTC CAATAATTAT GACAGTCTGG ATGCGTTACC GGCAGGCAGT ATCGTCGGGA CGTCCAGTTT ACGTCGCCAG TGCCAACTGG CTGAACGCCG CCCGGATCTG ATTATCCGCT CCCTGCGCGG CAACGTCGGC ACTCGCCTGA GCAAACTAGA TAACGGCGAA TACGATGCCA TCATTCTTGC CGTAGCCGGA CTAAAACGTT TAGGTCTGGA GTCCCGCATT CGCGCCGCGT TACCACCCGA GATTTCTCTT CCGGCGGTAG GACAAGGTGC AGTCGGTATT GAATGCCGCC TTGATGATAC GCGCACTCGC GAGCTGCTTG CCGCGCTGAA TCACCACGAA ACTGCACTGC GCGTTACCGC AGAACGCGCC ATGAATACCC GTCTCGAAGG TGGATGTCAG GTGCCAATTG GTAGCTACGC CGAGCTTATT GATGGTGAAA TCTGGCTGCG TGCGCTGGTC GGCGCGCCGG ACGGTTCGCA GATTATTCGC GGTGAACGCC GCGGTGCGCC GCAAAATGCC GAACAAATGG GGATTTCGCT GGCAGAAGAG CTACTGAATA ATGGCGCGCG CGAGATCCTC GCTGAAGTCT ATAACGGAGA CGCCCCGGCA TGA
|
Protein sequence | MIMTVTSMLD NVLRIATRQS PLALWQAHYV KDKLMASHPG LVVELVPMVT RGDVILDTPL AKVGGKGLFV KELEVALLEN RADIAVHSMK DVPVEFPQGL GLVTICERED PRDAFVSNNY DSLDALPAGS IVGTSSLRRQ CQLAERRPDL IIRSLRGNVG TRLSKLDNGE YDAIILAVAG LKRLGLESRI RAALPPEISL PAVGQGAVGI ECRLDDTRTR ELLAALNHHE TALRVTAERA MNTRLEGGCQ VPIGSYAELI DGEIWLRALV GAPDGSQIIR GERRGAPQNA EQMGISLAEE LLNNGAREIL AEVYNGDAPA
|
| |