Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0644 |
Symbol | |
ID | 3927736 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 648209 |
End bp | 649258 |
Gene Length | 1050 bp |
Protein Length | 349 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637901766 |
Product | putative DNA-binding/iron metalloprotein/AP endonuclease |
Protein accession | YP_507454 |
Protein GI | 88657692 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.684449 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGATAAAT TAAAAATAGT ATTAGGTATA GAAACAAGTT GTGATGAAAC TGCTGTTGCT ATTGTAAATA GTAATAAGGA GGTTTTATCA CATAAAATTC TTTCACAGCA GGAACATGCA GCATATGGTG GAGTTGTTCC GGAAATTGCT TCTCGTGCTC ATATTAATTA TTTATATGAG TTAGTTGGTA GTTGTATAGA GGAGTCGCAA CTTTGTTTTA ATGATATTGA TGCTATTGCT GTTACTGCAG GTCCTGGTCT TATTGGTGGT TTAATAGTTG GCATAATGAT GGCTAAAGCA ATTTCCAGTG TTACTGGTAA GCCTATTATT GAAATTAATC ATTTAGAGGC TCATGCTTTA ATTATTCGTA TGTTTTATGA AATAGATTTT CCATTCTTAT TGTTAATAAT GTCTGGAGGG CATTGTCAGT TCTTGGTAGC TTACGATGTA AGGTGTTATT ATAAGTTAGG TTCTTCTTTG GATGATTCTT TAGGTGAAGT ATTTGATAAA GTAGCAAGAA TGTTAAATCT TGGGTATCCT GGGGGGCCAA TTATTGAAGA GAAGTCTTTG TTAGGTGATA GTGGAAGTTT TACTTTACCA CGAGCATTAA CTAACCGTCC TGGATGTGAT TTTTCGTTTT CTGGACTTAA AACTGCTGTG AGAAATATTA TTGCAGGTCA GAAGTGTATA AATCATGAGT TGGTATGTAA TATTTCAGCA TCTTTCCAAG ATTGTGTTGG TGATATATTG GTAAACAGGA TTAACAACGC GATTGTAATG TCAAAAGATA TAGATCACAG GATTAATAAG TTAGTAGTAA CTGGTGGTGT TGCAGCTAAT AAATTATTAC GTAATCGTAT GTCAGTATGT GCAAATGATA ATGGTTTTGA AATATTGTAT CCTCCAAGTA AGTTATGTAC TGATAATGGA GTTATGATAG GATGGGCTGG TATTGAAAAT TTAGCAAAGG GTTATGTTTC AAATTTAAAT TTTTTTCCAA GAGCAAGGTG GCCTTTAGAA AATTTGAGGT TTGATATACT AAGAAAGTAG
|
Protein sequence | MDKLKIVLGI ETSCDETAVA IVNSNKEVLS HKILSQQEHA AYGGVVPEIA SRAHINYLYE LVGSCIEESQ LCFNDIDAIA VTAGPGLIGG LIVGIMMAKA ISSVTGKPII EINHLEAHAL IIRMFYEIDF PFLLLIMSGG HCQFLVAYDV RCYYKLGSSL DDSLGEVFDK VARMLNLGYP GGPIIEEKSL LGDSGSFTLP RALTNRPGCD FSFSGLKTAV RNIIAGQKCI NHELVCNISA SFQDCVGDIL VNRINNAIVM SKDIDHRINK LVVTGGVAAN KLLRNRMSVC ANDNGFEILY PPSKLCTDNG VMIGWAGIEN LAKGYVSNLN FFPRARWPLE NLRFDILRK
|
| |