Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0092 |
Symbol | hemA |
ID | 3927318 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 83016 |
End bp | 84221 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637901216 |
Product | 5-aminolevulinate synthase |
Protein accession | YP_506921 |
Protein GI | 88658613 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0156] 7-keto-8-aminopelargonate synthetase and related enzymes |
TIGRFAM ID | [TIGR01821] 5-aminolevulinic acid synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.3402 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATAGATT ATGAAGAGAT ATTTTGTAGC AAGATTAAGC GTATAAAAGA TGAAGGTCGA TATAGAGAAT TTACAGGATT CTCACGTATT CCTGGTCAGT TTCCTTATGC TATAGAATGT GATGTAAATA ACGTAGTTAC TCTATGGTGT AGCAATGATT ATTTGGGTAT GGGACAAAAT GAACATATGA TTCTTGCTAT CAAAAATTAT AGTAGTAGTG TAGGTGCTGG TGGTACACGG AATATCTCTG GCACTACAAA GGAGATTATT GAGCTTGAAA AATCATTAGC AGATTTACAT AAAAAACCAG CTGCTCTGAC TTTTGTGTGT GGCTATATTG CAAATCAGAC TACTATTAGT ACAGTGCTTT CTGTAATACC AGATATTGTA GTTTTTTCAG ATGAGAAAAA TCATTCTTCT ATGATAGAAG GTATTAGATC TACTAATAGA GCAAAGCATA TATTTAGGCA CAATGATCTT AATCACCTTG AGACTTTGTT AAAATCTGTT GATATATCTG TACCTAAGAT CATAATATTT GAATCCTTGT ATTCTATGGA TGGAGATATA GCTCCAATTG CAAAAATTTG TGATTTAGCT GATAAATATA ATGCAATAAC TTATTTAGAT GAAGTACATG CAGTTGGTAT GTATGGAAGT CGTGGAGGAG GTATATCAGA ACAAGAAAAT ATATCTGATA GAGTGACAAT TATTCAAGGA ACTCTTTCCA AAGCTTTTGG AGTTATGGGT GGATATATTA CTGGATCAAA AAATGTGGTA GATGTTGTCA GAAGCTTTGC TCCTGGTTTT ATTTTTACTA CAGCATTATC ACCTCTTATT GCATCATCTG CTAGAATAAG TGTTGAGCAT TTAAAGAATA GTTCTATTGA GAGAGAAAAG CAACGTGAGG TTGTTAACAA AGTAAAAGAA TCTTTTTCTA AGGCTGGTAT TGACTTTGTC AAAACGGATA CACATATAAT TCCTGTAATT ATAGGTGATT CAGTGGCTTG TACAGAGATT TCACGAGTGT TACTTAAGGA ATATAGGATA TATATACAAT CTATTAATTA TCCTACTGTG CCTGTAGGTA CAGAGAGATT AAGGATTACT CCAACACCTT ATCATACAGA TGAAATGATA GATAAATTAA CCCAAGCGTT GGTTGATGTT TTGTGTAGGT TTAAAATAAT GAATAAGCAA AATTAA
|
Protein sequence | MIDYEEIFCS KIKRIKDEGR YREFTGFSRI PGQFPYAIEC DVNNVVTLWC SNDYLGMGQN EHMILAIKNY SSSVGAGGTR NISGTTKEII ELEKSLADLH KKPAALTFVC GYIANQTTIS TVLSVIPDIV VFSDEKNHSS MIEGIRSTNR AKHIFRHNDL NHLETLLKSV DISVPKIIIF ESLYSMDGDI APIAKICDLA DKYNAITYLD EVHAVGMYGS RGGGISEQEN ISDRVTIIQG TLSKAFGVMG GYITGSKNVV DVVRSFAPGF IFTTALSPLI ASSARISVEH LKNSSIEREK QREVVNKVKE SFSKAGIDFV KTDTHIIPVI IGDSVACTEI SRVLLKEYRI YIQSINYPTV PVGTERLRIT PTPYHTDEMI DKLTQALVDV LCRFKIMNKQ N
|
| |