Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1067 |
Symbol | |
ID | 3927547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 1095119 |
End bp | 1096264 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637902181 |
Product | D-alanyl-D-alanine carboxypeptidase family protein |
Protein accession | YP_507852 |
Protein GI | 88657866 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1686] D-alanyl-D-alanine carboxypeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.594952 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACGCC ATATAATCCT TCTACTGTTA TATTGTTTTA TTAGTATTCA GGTACATATA AGTAATGCTG TGGCCTTACC TATACAAACA ACAGCACCTC AAGCAGCAGT ATTCGATTTC TTTTCTAATA CAATGTTATT AGAACACAAC ATTGACGAAC AAATTGCACC CTCGTCTATG ACTAACTTAA TGACCTTATA TGTTACATTT TACTACATAA AAGCTGGATT TGTAAAAATG GAGGATAAAT TTAAGACTAG TAAAGAGGCT TGGCAGAAAG GAGGTAACTC CATTTTCCTA AGGGCGGGAC AGCTAGTAGC AGTAAGAGAT TTAATTAATG GTATTATTAC AACATCTGCA AATGACGCCT GTATTACTCT TGCAGAAGGC GTAGCAGGTT CACAGGAGAA CTTTGTTGAT GAAATGAATC GTATAGCACA AAAACTTAAC CTAACAAAAT CTCATTTCAG CAATGTAATA GGTGCTCAAG ATAAAAATCA ATTCATGTCA ATACGCGACT TAATAACTCT TACAGTAAGA ATTTTCGAAG ATTTTCCTGA ATATTATCAT CTATTCTCCA AAAAAGATTT TAAGTACAAT AATATATACC AAGAAAACAT CAACTTATTG TCACCAGATA ATAGAATAGA TGGGATAATT AGCATATATA CAGATGCAGG AGGATACGGG TCTATAGTAG CTGCTAAACA TGAAGGCAGA CGTATCTTCA TATTAATCAG CGGTCTTAAA ACTGAAAAAG AACGCATATC TGAAATAAAG CAGTTGCTAG ATTATGCTTT TAATGATTTT AGCAGCCAAA CCATCTTTCA CAAAGGTAGT AAAGCCAAGG AAATAATAGT TAAAAACGGT GATGCAAAAT ATGTAGAAGC TGTGTTCAAT AACGATGTGA TTATTCTATA TCCTAAAGGC TCATATGATA CAGTCAAAAC CTTTTTTTCA CACGAAAATG CAATATCTGC ACCAGTAAAA AAAGGACAAG AAGTTGGTCA TCTTCACATA CAGGTACCAG AACTTACAGA ACGCGTTATA CCTATGTACG CAGCAAATGA TATAAACCAT CTCAATTTCT TCCAAAGAAT ATTGTACATG TTTTCCCCTA AAACAGATAA AGTTGCTACT TCATAA
|
Protein sequence | MLRHIILLLL YCFISIQVHI SNAVALPIQT TAPQAAVFDF FSNTMLLEHN IDEQIAPSSM TNLMTLYVTF YYIKAGFVKM EDKFKTSKEA WQKGGNSIFL RAGQLVAVRD LINGIITTSA NDACITLAEG VAGSQENFVD EMNRIAQKLN LTKSHFSNVI GAQDKNQFMS IRDLITLTVR IFEDFPEYYH LFSKKDFKYN NIYQENINLL SPDNRIDGII SIYTDAGGYG SIVAAKHEGR RIFILISGLK TEKERISEIK QLLDYAFNDF SSQTIFHKGS KAKEIIVKNG DAKYVEAVFN NDVIILYPKG SYDTVKTFFS HENAISAPVK KGQEVGHLHI QVPELTERVI PMYAANDINH LNFFQRILYM FSPKTDKVAT S
|
| |