Gene CPR_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCPR_1420 
SymbolhemB 
ID4205689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium perfringens SM101 
KingdomBacteria 
Replicon accessionNC_008262 
Strand
Start bp1598983 
End bp1599948 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content31% 
IMG OID642565974 
Productdelta-aminolevulinic acid dehydratase 
Protein accessionYP_698739 
Protein GI110803332 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0113] Delta-aminolevulinic acid dehydratase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAAGA GAGGAAGAAG ACTTAGAATA AATAAGGAAA TGAGAGATCT TGTTAGAGAA 
AATATTTTAA CAGCAAATGA TTTTATATTT CCTATATTTG TAGCTGAAGG AAATAACATA
AAAAAGGAGA TAAGTTCTCT ACCAGGAAAC TATCATGTAT CTTTAGATAG ATTAAATGAA
ATAGTTGATG AAGTGGTAGC TTTAAATATA AAAGGTGTAA TTATATTTGG TCTTCCAGAA
CATAAGGATG CTTGTGGATC AGAGGCTTAT TCTGATAATG GAATAGTTCA AAAGGCTATT
AGAAAATTAA GAGAAGATTA TGAAAACTTA TTAATAATAA CTGATGTTTG TATGTGTGAA
TATACTAGTC ATGGTCATTG TGGAATAATA GAAGGAAAAG ATGTAGATAA TGACAAGACT
TTAAGTTTCT TAGATAAAAT AGCAGTTTCA CATGCTAAGG CTGGTGCTCA TATGGTAGCT
CCTTCAGATA TGATGGATGG AAGAATATTA TCTATGAGAA ATGCTTTAGA TGAAGCTGGA
TTTGTTAATG TTGGAATAAT GAGTTATTCT GCAAAATATT GCTCAGCTTT CTATGGACCT
TTCAGAGAAG CTGCTAATTC AGCACCACAA TTTGGAGATA GAAAAACTTA TCAAATGGAT
CCTGCAAACT CAAGAGAAGC CATAAAAGAA GTAGAGCAAG ACATAGAAGA GGGTGCAGAT
ATAGTTATGG TTAAACCTGC CTTATCATAT TTAGATATTG TTAAGGAAGT AAGAAATAAA
GTAGATGTAC CAGTTTGTGT TTATAATGTA AGTGGTGAAT TTGCTATGGT TAAGGCAGCA
GCTAAACTTG GACTTATAAA TGAAAAACAA GTAGCTTTAG AAATGCTTTT ATCAATGAAA
AGAGCAGGGG CAGATATGAT AATAACTTAT TATGCTATAG AAGCTGCTAA ATGGTTACAA
GAATAG
 
Protein sequence
MIKRGRRLRI NKEMRDLVRE NILTANDFIF PIFVAEGNNI KKEISSLPGN YHVSLDRLNE 
IVDEVVALNI KGVIIFGLPE HKDACGSEAY SDNGIVQKAI RKLREDYENL LIITDVCMCE
YTSHGHCGII EGKDVDNDKT LSFLDKIAVS HAKAGAHMVA PSDMMDGRIL SMRNALDEAG
FVNVGIMSYS AKYCSAFYGP FREAANSAPQ FGDRKTYQMD PANSREAIKE VEQDIEEGAD
IVMVKPALSY LDIVKEVRNK VDVPVCVYNV SGEFAMVKAA AKLGLINEKQ VALEMLLSMK
RAGADMIITY YAIEAAKWLQ E