Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1563 |
Symbol | hemB1 |
ID | 5712707 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | + |
Start bp | 1626965 |
End bp | 1627966 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641267478 |
Product | delta-aminolevulinic acid dehydratase |
Protein accession | YP_001532906 |
Protein GI | 159044112 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0113] Delta-aminolevulinic acid dehydratase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0018344 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.110847 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGACCGA CCCAAGCCCC CTTCCCCCAT GCCCGCTTCC GCCGTCTGCG CCGCACCCCG GCCCTGCGCA ATCTCACGCG CCAATCGGAA TTGAGCGTCC ATGACCTGAT CTGGCCGATC TTTGTCAGCG AGCCCGAAGG CGCGGTGGAC ATCCCCTCGA TGCCCGGGGT GTCGCGGCTG ACGGTCGAGG GCGCGGTGAG GGCGGCGGAG CGGGCGGCGA GCTTGGGCAT TCCGGCGATT TGCCTGTTTC CCTATACGGA TCCGTCCCTC AAGACCCAGC TGTGCGAAGA GGCCTGGAAC CCCGACAACC TCAGCAACCG GGCGATCCGG GCGATCAAGT CCGAGGTGCC CGAGATCGCG GTGATGACCG ATGTGGCCCT CGACCCCTAC AACATCAACG GCCATGACGG GATCGTCCGC GACGGGGTGA TCGTGAATGA CGAGAGCGTC GCGGCACTGG TCAAGATGGC CGTGGCCCAG GCGGAATCCG GGGCCGATAT TCTCGGGCCT TCGGACATGA TGGACGGGCG GATCGGCGCG ATGCGGGCGG CGCTGGAGGC GGCGGGCCAT TCGGATGTCA CCATCCTGAG CTATGCGGCG AAGTATTCCA GCGGGTTCTA CGGTCCCTTC CGGGACGCGG TCGGCGCGTC CGGCGCGCTG GTGGGCGACA AGAACACCTA CCAGATGGAC CCGGGCAATT CCGACGAGGC GCTGCGCCTG ATCGAGCGCG ACCTGCTGGA GGGTGCGGAT ATGGTGATGG TCAAGCCGGG GATGCCCTAT CTCGACATCT GTCGCCGGGT GAAGGACGCC TTCGGGGTGC CGACCTATGC CTACCAGGTG TCAGGCGAAT ACGCGATGTT GCAGGCGGCG AGCGCCAATG GCTGGCTCGA CCATGACAAG GTGATGTTCG AGTCGCTTCT GGCCTTCAAA CGCGCGGGGT GCGATGGAAT CCTGACCTAT TTCGCCCCTG TGGTGGCCGA ACGGCTCCGC GGAATCGCCT GA
|
Protein sequence | MRPTQAPFPH ARFRRLRRTP ALRNLTRQSE LSVHDLIWPI FVSEPEGAVD IPSMPGVSRL TVEGAVRAAE RAASLGIPAI CLFPYTDPSL KTQLCEEAWN PDNLSNRAIR AIKSEVPEIA VMTDVALDPY NINGHDGIVR DGVIVNDESV AALVKMAVAQ AESGADILGP SDMMDGRIGA MRAALEAAGH SDVTILSYAA KYSSGFYGPF RDAVGASGAL VGDKNTYQMD PGNSDEALRL IERDLLEGAD MVMVKPGMPY LDICRRVKDA FGVPTYAYQV SGEYAMLQAA SANGWLDHDK VMFESLLAFK RAGCDGILTY FAPVVAERLR GIA
|
| |