Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0479 |
Symbol | |
ID | 3927126 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 458144 |
End bp | 459868 |
Gene Length | 1725 bp |
Protein Length | 574 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 637901602 |
Product | M24 family metallopeptidase |
Protein accession | YP_507295 |
Protein GI | 88657638 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATA GGTTAAATCA ATTAATTGGT TTAATGGAAG AGTATGAAAT TGATGTATTG TTATTGCAGA ATACGGATGA ATATCAGTGT GAGTATGTGC ATATTAACAA GCAAAGAATA AGATGGTTGT GTGGTTTTTC TGGCTCAAAT GCTACTTTGA TAATATCAAG GGAAGGTAAA CAGAATTTTT TTACTGATGG TAGATATACA TTACAAGCAA CAAGGGAGTT GGACTGTAGT TATTATCAAA TACATAATGT GTGTGAGTTA ACTCCTTGGC AATGGTGTGT AGAAAATTGT CTATCTCATA CTGTTGTAGC TTATGAATCT GCATTGTTTA CTTTGAGTCA AATAAGAAAG TATGAAGATT GTGGTATTTT TTTAAAACCA ATAGATCAAA TTTTGATTGA TAAGTTGTGG ATTCGTGATT TTGCTATAGA ACATGATATA GTACAGCATT CTTTAGAATA TTCTGGTGTT GAGAGTTATA CAAAGTCTTG TGAAGTTGCA AAATATTTGT CTGACAAAGA TGCTGCATTA ATCACAAATA CTGATGTTAT TTCATGGATG TTAAATATAC GTAATAAAAA GTTTCTTTAT AATCCTTCAG TGTTGTCTAG AGCGATTTTA TATAAGGATG GAAGAGTTGA TTTATTTATT GATGATGTTT ACTCAGTAAA TGTTAAATAT GAGCATCTTA ATATATGTTC TTTGAATAAT CTATTTAATG TTTTGAAATC TGTAAAATCA GTAGTTGTAG ATGCATCTAC TATACCAATG AGTATTTTTC TATCTTTACA ACAACAGGAT GTATTAGTTA ATGATGCAGA TTTTTGTCTT TTAATGAAAG CCAGAAAGAA TGATGTTGAA ATACAAGGTG CAATTAATGC TCATGTTAGA GATGGAATTT CTATAGTTAA TTTATTATAC TGGTTAAATA TGCAATTGGA TAATAATCAG AAAATTACAG AGCTGGATGT TGAGTCAAAA TTATTAGATT TCAGAAAACA GCAAAGTTTG TTTCAAGGTG AAAGTTTTTC TACAATTTCT GGTTTTCAAG AAAACGGAGC TGTAATACAT TACAGAGCGA ATAATGATAC AAATAAGTTA ATATGTAAGA ATGGATTATA TTTATTAGAT TCTGGTGGTC AATATCTTGA TGGTACAACA GATGTTACGC GTACTGTTGC TATAGGTGAA CCTACATCTG AGCAAATTAC TAATTTTACG TTAGTTTTAA AAGGTCATAT TGCTTTAGCG ATGGCAGTCT TTCCTTTAGG TACTACTGGT GGAATGTTAG ATATATTAGC TAGACAGTAT TTGTGGAAAT CAGGACTTGA TTATCAACAT GGTACGGGTC ATGGGGTGGG AAGTTTTTTA TCAGTTCATG AAGGTCCTTG TGCTATTTCG TACAAGAATG ATGTTGTATT GCAGCCAAAT ATGGTGCTAT CAAATGAGCC AGGATATTAT AAAAATGGTG AATATGGAAT AAGGATTGAA AATCTGATGT ACGTTGAAGA ATATATGAAT GGCTTTTTAA GATTTAAACA GTTAACATGT GTGCCTATAG ATTTAAGATT AATAGATGTA GATATGTTAA ATCATGAGGA AATCAATTAC ATAGACCAGT ATCATAATTT TGTATATAAC ACTATTGCTC CACATGTAAG TGAAGAAGTA AAACATTGGT TATGTCATGC ATGTCAGAGT TTAAAAGGTA AGTAG
|
Protein sequence | MKNRLNQLIG LMEEYEIDVL LLQNTDEYQC EYVHINKQRI RWLCGFSGSN ATLIISREGK QNFFTDGRYT LQATRELDCS YYQIHNVCEL TPWQWCVENC LSHTVVAYES ALFTLSQIRK YEDCGIFLKP IDQILIDKLW IRDFAIEHDI VQHSLEYSGV ESYTKSCEVA KYLSDKDAAL ITNTDVISWM LNIRNKKFLY NPSVLSRAIL YKDGRVDLFI DDVYSVNVKY EHLNICSLNN LFNVLKSVKS VVVDASTIPM SIFLSLQQQD VLVNDADFCL LMKARKNDVE IQGAINAHVR DGISIVNLLY WLNMQLDNNQ KITELDVESK LLDFRKQQSL FQGESFSTIS GFQENGAVIH YRANNDTNKL ICKNGLYLLD SGGQYLDGTT DVTRTVAIGE PTSEQITNFT LVLKGHIALA MAVFPLGTTG GMLDILARQY LWKSGLDYQH GTGHGVGSFL SVHEGPCAIS YKNDVVLQPN MVLSNEPGYY KNGEYGIRIE NLMYVEEYMN GFLRFKQLTC VPIDLRLIDV DMLNHEEINY IDQYHNFVYN TIAPHVSEEV KHWLCHACQS LKGK
|
| |