Gene EcHS_A4025 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A4025 
SymbolhemC 
ID5592075 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp4016663 
End bp4017625 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content56% 
IMG OID640923130 
Productporphobilinogen deaminase 
Protein accessionYP_001460596 
Protein GI157163278 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0181] Porphobilinogen deaminase 
TIGRFAM ID[TIGR00212] porphobilinogen deaminase 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value0.105851 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATGA CGGTAACAAG CATGTTAGAC AATGTTTTAA GAATTGCCAC ACGCCAAAGC 
CCACTTGCAC TCTGGCAGGC ACACTATGTC AAAGACAAGT TGATGGCGAG CCATCCGGGC
CTGGTCGTTG AACTGGTACC GATGGTGACG CGCGGCGATG TGATTCTTGA TACGCCGCTG
GCGAAAGTAG GCGGAAAAGG CTTATTTGTA AAAGAGCTGG AAGTCGCGCT CCTCGAAAAT
CGCGCCGATA TCGCCGTACA TTCAATGAAA GATGTGCCGG TTGAATTCCC GCAAGGTCTG
GGACTGGTCA CTATTTGTGA GCGTGAAGAT CCTCGCGATG CCTTTGTGTC CAATAACTAT
GACAGTCTGG ATGCGTTACC GGCAGGCAGT ATCGTCGGGA CGTCCAGTTT ACGTCGCCAG
TGCCAACTGG CTGAACGCCG TCCGGATCTG ATTATCCGCT CCCTGCGCGG CAACGTCGGC
ACTCGCCTGA GCAAACTGGA TAACGGCGAA TACGATGCCA TCATTCTTGC CGTAGCCGGA
CTAAAACGTT TAGGTCTGGA GTCACGTATT CGCGCCGCGT TGCCACCCGA GATTTCTCTT
CCGGCGGTAG GACAAGGTGC GGTGGGTATT GAATGCCGCC TTGATGATTC ACGCACTCGC
GAGCTGCTTG CCGCGCTGAA TCACCACGAA ACTGCACTGC GCGTTACCGC AGAACGCGCC
ATGAATACCC GTCTCGAAGG CGGATGTCAG GTGCCAATTG GTAGCTACGC CGAGCTTATT
GATGGCGAAA TCTGGCTGCG TGCGCTGGTC GGCGCGCCGG ACGGTTCGCA GATTATTCGC
GGTGAACGCC GCGGTGCGCC GCAAGATGCC GAACAAATGG GGATTTCGCT GGCAGAAGAG
CTACTGAATA ACGGCGCGCG CGAGATCCTC GCTGAAGTCT ATAACGGAGA CGCCCCGGCA
TGA
 
Protein sequence
MIMTVTSMLD NVLRIATRQS PLALWQAHYV KDKLMASHPG LVVELVPMVT RGDVILDTPL 
AKVGGKGLFV KELEVALLEN RADIAVHSMK DVPVEFPQGL GLVTICERED PRDAFVSNNY
DSLDALPAGS IVGTSSLRRQ CQLAERRPDL IIRSLRGNVG TRLSKLDNGE YDAIILAVAG
LKRLGLESRI RAALPPEISL PAVGQGAVGI ECRLDDSRTR ELLAALNHHE TALRVTAERA
MNTRLEGGCQ VPIGSYAELI DGEIWLRALV GAPDGSQIIR GERRGAPQDA EQMGISLAEE
LLNNGAREIL AEVYNGDAPA