Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0828 |
Symbol | dapA |
ID | 3927943 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 843296 |
End bp | 844189 |
Gene Length | 894 bp |
Protein Length | 297 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637901945 |
Product | dihydrodipicolinate synthase |
Protein accession | YP_507624 |
Protein GI | 88658264 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0329] Dihydrodipicolinate synthase/N-acetylneuraminate lyase |
TIGRFAM ID | [TIGR00674] dihydrodipicolinate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTACAGCC TTATGGGTGT TTTTACGGCA TTAATTACAC CATTTAAGGA TGATTTTTCT ATAGATGAAA ACGCGTTTTG TCATCTAATT GAAGAACAAA TCAGTAATAA TATTCATGGT TTAGTTCCAT GCGGCACTAC TGCAGAATGT CCTACTTTAA GCTTTGAAGA GTACTGCAAA GTAATCGAAT TATGTGTTAA AATCACAAAT AAACGTGTCC CCATAATAGC TGGATCAAGC TCAAATTCTA CACAAGAAGC TATTAAACGC ACGTTATATG TTCAGTCTCT AAATGTAGAT GCAGCTTTAG TAGTTGTACC ATATTACAAT AGACCAAGTG ATGAAGGAAT ATATCAACAT TTTAAAGCAG TACATGATGC AACTAACCTT CCCATTATTG CATATAATAT ACCTAATAGA TCTGCTATTG ATGCAAGCGA TGCATTACTT GCACGAATTT TATCTTTGCC TAGAATAATA GGGCTTAAAG ATTCAACTGG AGATGTAAGT AGGCCTCTTA ATTTAAAATT ATTACTAAAT AAAGAAGTAG TTTTATTCTC AGGTGATGAT TCTACATGCT TAGGGTTCTA TGCTCAAAGT GGTAGTGGTG GTAGGACTGG ATGCATTTCT GTTGTTTCAA ATGTGATACC TAAAATACAT GCTGATATGC ACAATGCATT CCTTGCTAAT AATATGAAAG AAGCTATGAA TGCAAATTTA TCAGTATTTA AATTAGCAAA AGCATTGTTT TGTCAGTCAA GCCCTGCACC AACAAAATAT GCTATGAGCT TAATTAAAAA TATCTCACCA GCAGTAAGGT TGCCATTAGT GGAATTAACT CAAGAAAACA AATTAAAAGT TGAAAAAACG CTAAAAGAAT TAAAGTTAAT TTAA
|
Protein sequence | MYSLMGVFTA LITPFKDDFS IDENAFCHLI EEQISNNIHG LVPCGTTAEC PTLSFEEYCK VIELCVKITN KRVPIIAGSS SNSTQEAIKR TLYVQSLNVD AALVVVPYYN RPSDEGIYQH FKAVHDATNL PIIAYNIPNR SAIDASDALL ARILSLPRII GLKDSTGDVS RPLNLKLLLN KEVVLFSGDD STCLGFYAQS GSGGRTGCIS VVSNVIPKIH ADMHNAFLAN NMKEAMNANL SVFKLAKALF CQSSPAPTKY AMSLIKNISP AVRLPLVELT QENKLKVEKT LKELKLI
|
| |