Gene ECH_0644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0644 
Symbol 
ID3927736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp648209 
End bp649258 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content33% 
IMG OID637901766 
Productputative DNA-binding/iron metalloprotein/AP endonuclease 
Protein accessionYP_507454 
Protein GI88657692 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.684449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGATAAAT TAAAAATAGT ATTAGGTATA GAAACAAGTT GTGATGAAAC TGCTGTTGCT 
ATTGTAAATA GTAATAAGGA GGTTTTATCA CATAAAATTC TTTCACAGCA GGAACATGCA
GCATATGGTG GAGTTGTTCC GGAAATTGCT TCTCGTGCTC ATATTAATTA TTTATATGAG
TTAGTTGGTA GTTGTATAGA GGAGTCGCAA CTTTGTTTTA ATGATATTGA TGCTATTGCT
GTTACTGCAG GTCCTGGTCT TATTGGTGGT TTAATAGTTG GCATAATGAT GGCTAAAGCA
ATTTCCAGTG TTACTGGTAA GCCTATTATT GAAATTAATC ATTTAGAGGC TCATGCTTTA
ATTATTCGTA TGTTTTATGA AATAGATTTT CCATTCTTAT TGTTAATAAT GTCTGGAGGG
CATTGTCAGT TCTTGGTAGC TTACGATGTA AGGTGTTATT ATAAGTTAGG TTCTTCTTTG
GATGATTCTT TAGGTGAAGT ATTTGATAAA GTAGCAAGAA TGTTAAATCT TGGGTATCCT
GGGGGGCCAA TTATTGAAGA GAAGTCTTTG TTAGGTGATA GTGGAAGTTT TACTTTACCA
CGAGCATTAA CTAACCGTCC TGGATGTGAT TTTTCGTTTT CTGGACTTAA AACTGCTGTG
AGAAATATTA TTGCAGGTCA GAAGTGTATA AATCATGAGT TGGTATGTAA TATTTCAGCA
TCTTTCCAAG ATTGTGTTGG TGATATATTG GTAAACAGGA TTAACAACGC GATTGTAATG
TCAAAAGATA TAGATCACAG GATTAATAAG TTAGTAGTAA CTGGTGGTGT TGCAGCTAAT
AAATTATTAC GTAATCGTAT GTCAGTATGT GCAAATGATA ATGGTTTTGA AATATTGTAT
CCTCCAAGTA AGTTATGTAC TGATAATGGA GTTATGATAG GATGGGCTGG TATTGAAAAT
TTAGCAAAGG GTTATGTTTC AAATTTAAAT TTTTTTCCAA GAGCAAGGTG GCCTTTAGAA
AATTTGAGGT TTGATATACT AAGAAAGTAG
 
Protein sequence
MDKLKIVLGI ETSCDETAVA IVNSNKEVLS HKILSQQEHA AYGGVVPEIA SRAHINYLYE 
LVGSCIEESQ LCFNDIDAIA VTAGPGLIGG LIVGIMMAKA ISSVTGKPII EINHLEAHAL
IIRMFYEIDF PFLLLIMSGG HCQFLVAYDV RCYYKLGSSL DDSLGEVFDK VARMLNLGYP
GGPIIEEKSL LGDSGSFTLP RALTNRPGCD FSFSGLKTAV RNIIAGQKCI NHELVCNISA
SFQDCVGDIL VNRINNAIVM SKDIDHRINK LVVTGGVAAN KLLRNRMSVC ANDNGFEILY
PPSKLCTDNG VMIGWAGIEN LAKGYVSNLN FFPRARWPLE NLRFDILRK