Gene EcolC_4202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_4202 
SymbolhemC 
ID6067671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4640754 
End bp4641716 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content56% 
IMG OID641603630 
Productporphobilinogen deaminase 
Protein accessionYP_001727126 
Protein GI170022172 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0530592 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAATGA CGGTAACAAG CATGTTAGAC AATGTTTTAA GAATTGCCAC ACGCCAAAGC 
CCACTTGCAC TCTGGCAGGC ACACTATGTC AAAGACAAGT TGATGGCGAG CCATCCGGGC
CTGGTCGTTG AACTGGTACC GATGGTGACG CGCGGCGATG TGATTCTTGA TACGCCGCTG
GCGAAAGTAG GCGGAAAAGG CTTATTTGTA AAAGAGCTGG AAGTCGCGCT CCTCGAAAAT
CGCGCCGATA TCGCCGTACA TTCAATGAAA GATGTGCCGG TTGAATTCCC GCAAGGTCTG
GGACTGGTCA CTATTTGTGA GCGTGAAGAT CCTCGCGATG CCTTTGTGTC CAATAACTAT
GACAGTCTGG ATGCGTTACC GGCAGGCAGT ATCGTCGGGA CGTCCAGTTT ACGTCGCCAG
TGCCAACTGG CTGAACGCCG TCCGGATCTG ATTATCCGCT CCCTGCGCGG CAACGTCGGC
ACTCGCCTGA GCAAACTGGA TAACGGCGAA TACGATGCCA TCATTCTTGC CGTAGCCGGA
CTAAAACGTT TAGGTCTGGA GTCACGTATT CGCGCCGCGT TGCCACCCGA GATTTCTCTT
CCGGCGGTAG GACAAGGTGC GGTGGGTATT GAATGCCGCC TTGATGATTC ACGCACTCGC
GAGCTGCTTG CCGCGCTGAA TCACCACGAA ACTGCACTGC GCGTTACCGC AGAACGCGCC
ATGAATACCC GTCTCGAAGG CGGATGTCAG GTGCCAATTG GTAGCTACGC CGAGCTTATT
GATGGCGAAA TCTGGCTGCG TGCGCTGGTC GGCGCGCCGG ACGGTTCGCA GATTATTCGC
GGTGAACGCC GCGGTGCGCC GCAAGATGCC GAACAAATGG GGATTTCGCT GGCAGAAGAG
CTACTGAATA ACGGCGCGCG CGAGATCCTC GCTGAAGTCT ATAACGGAGA CGCCCCGGCA
TGA
 
Protein sequence
MIMTVTSMLD NVLRIATRQS PLALWQAHYV KDKLMASHPG LVVELVPMVT RGDVILDTPL 
AKVGGKGLFV KELEVALLEN RADIAVHSMK DVPVEFPQGL GLVTICERED PRDAFVSNNY
DSLDALPAGS IVGTSSLRRQ CQLAERRPDL IIRSLRGNVG TRLSKLDNGE YDAIILAVAG
LKRLGLESRI RAALPPEISL PAVGQGAVGI ECRLDDSRTR ELLAALNHHE TALRVTAERA
MNTRLEGGCQ VPIGSYAELI DGEIWLRALV GAPDGSQIIR GERRGAPQDA EQMGISLAEE
LLNNGAREIL AEVYNGDAPA