Gene ECH_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1067 
Symbol 
ID3927547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1095119 
End bp1096264 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content34% 
IMG OID637902181 
ProductD-alanyl-D-alanine carboxypeptidase family protein 
Protein accessionYP_507852 
Protein GI88657866 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1686] D-alanyl-D-alanine carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.594952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACGCC ATATAATCCT TCTACTGTTA TATTGTTTTA TTAGTATTCA GGTACATATA 
AGTAATGCTG TGGCCTTACC TATACAAACA ACAGCACCTC AAGCAGCAGT ATTCGATTTC
TTTTCTAATA CAATGTTATT AGAACACAAC ATTGACGAAC AAATTGCACC CTCGTCTATG
ACTAACTTAA TGACCTTATA TGTTACATTT TACTACATAA AAGCTGGATT TGTAAAAATG
GAGGATAAAT TTAAGACTAG TAAAGAGGCT TGGCAGAAAG GAGGTAACTC CATTTTCCTA
AGGGCGGGAC AGCTAGTAGC AGTAAGAGAT TTAATTAATG GTATTATTAC AACATCTGCA
AATGACGCCT GTATTACTCT TGCAGAAGGC GTAGCAGGTT CACAGGAGAA CTTTGTTGAT
GAAATGAATC GTATAGCACA AAAACTTAAC CTAACAAAAT CTCATTTCAG CAATGTAATA
GGTGCTCAAG ATAAAAATCA ATTCATGTCA ATACGCGACT TAATAACTCT TACAGTAAGA
ATTTTCGAAG ATTTTCCTGA ATATTATCAT CTATTCTCCA AAAAAGATTT TAAGTACAAT
AATATATACC AAGAAAACAT CAACTTATTG TCACCAGATA ATAGAATAGA TGGGATAATT
AGCATATATA CAGATGCAGG AGGATACGGG TCTATAGTAG CTGCTAAACA TGAAGGCAGA
CGTATCTTCA TATTAATCAG CGGTCTTAAA ACTGAAAAAG AACGCATATC TGAAATAAAG
CAGTTGCTAG ATTATGCTTT TAATGATTTT AGCAGCCAAA CCATCTTTCA CAAAGGTAGT
AAAGCCAAGG AAATAATAGT TAAAAACGGT GATGCAAAAT ATGTAGAAGC TGTGTTCAAT
AACGATGTGA TTATTCTATA TCCTAAAGGC TCATATGATA CAGTCAAAAC CTTTTTTTCA
CACGAAAATG CAATATCTGC ACCAGTAAAA AAAGGACAAG AAGTTGGTCA TCTTCACATA
CAGGTACCAG AACTTACAGA ACGCGTTATA CCTATGTACG CAGCAAATGA TATAAACCAT
CTCAATTTCT TCCAAAGAAT ATTGTACATG TTTTCCCCTA AAACAGATAA AGTTGCTACT
TCATAA
 
Protein sequence
MLRHIILLLL YCFISIQVHI SNAVALPIQT TAPQAAVFDF FSNTMLLEHN IDEQIAPSSM 
TNLMTLYVTF YYIKAGFVKM EDKFKTSKEA WQKGGNSIFL RAGQLVAVRD LINGIITTSA
NDACITLAEG VAGSQENFVD EMNRIAQKLN LTKSHFSNVI GAQDKNQFMS IRDLITLTVR
IFEDFPEYYH LFSKKDFKYN NIYQENINLL SPDNRIDGII SIYTDAGGYG SIVAAKHEGR
RIFILISGLK TEKERISEIK QLLDYAFNDF SSQTIFHKGS KAKEIIVKNG DAKYVEAVFN
NDVIILYPKG SYDTVKTFFS HENAISAPVK KGQEVGHLHI QVPELTERVI PMYAANDINH
LNFFQRILYM FSPKTDKVAT S