Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0464 |
Symbol | |
ID | 3927986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 440717 |
End bp | 442186 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637901588 |
Product | putative carboxypeptidase |
Protein accession | YP_507281 |
Protein GI | 88658190 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2317] Zn-dependent carboxypeptidase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.157916 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACATT ATAAATTTTT GGAGCAAGTT TTTGGTAAAA TTGGAAATAT TAATTCTATT ATTAATGTTT TAGAAACTAG TGAAGGTAAT TTTTGCGATA AAATAGAGCA CATTTGTACC CTTAAAGAAA TTAAGCATGA GATTCTTAAT AGTGAGATGA TCGGCGATAT GATACAACAT TCTGTAGCTA ATAAAGCACA ATTAAATGAC TGGGAAATTG CTAACTTGAA CCATATTGAG CTGATACATA AGAATAGTAG TGCAGTTCCT GTGGAGTTGG TATTAGACCT TTATAGAGCG CAAGTTAAGT GTAAAAATTC TTGGGTATTA TTCCGTAATG GTGATGCTTC TATACAAGAT ATAGTTGCTT TATTGTCTGA TGTTGTAAGA TTGGTAAGTG ATATTGCTTC TATTAAAGCT GAGAGTTTAA AGATTTCTAA ATATGATGTT ATTTTAGGAC TACAAGATAG TAAATTGAAT ACAAGAAAAG TAGATGCTAT ATTTACTGAA ATAGGAGCTT TTTTCCGTCA GTTTATCACT GAGGTGGGTG ATAAGCAGAA ACACAATAAG ATTTGTTATC CAAAAGGTAT TAATGAAGAG AAACAGATAC TTCTAGGTTA TGATGCTTTG TCGAGTTTTG GTATGACAAA CAGTAATATT ATTAATAGTG ATTATGTAAA TAATAGATAT TCATTTGGGA AAGATTTACC TTTTTTAGTT AATTATAGTG AAGATGATTA CAGAATCGGG TTAAAAACTT TATTCAGAAA AATAGGTTAT GCTTTGTATG CTTTGAATTT ACCTGAGAAG TGGCATAAAC AACCTGTAGG GTGGAACTTA AATAACATTC TATCTGAAAT TTTGGGGCTG TTAACATCGA ATCACTTAAT GATGAGTAAG GAGTTTGTAA AGTTTATATC TCCTAATTTA AAGAAGCGAT TTTCTTTTAG AGGTAAGGTT GGGCACTATG AGAATATTCA GTTATATTTT AACGAAGTGC AACCTAATTT GTTGATGCAT AAATCTGATG AGGTGACTCT ACTAGCTCAT ATTATGCTGC GGTATACTTT GGAAAAGGAA ATGATAAGTG ATTCTTTGCA AGTACAAGAC TTGCCAGATG CTTGGATTCA AGGAATGAAA CACTATTTTA ATGTTGCTCC AAAGAATGAT TTGGAAGGGT TTTTACAAGA TGATTATTGG GTGAGTGGTA TTTTTGGATA TTTTCCTTGT TGTATGATTT CTGCTATTAT TGCTTCTCAG ATTTTTTCTA CTATGAAGAA TACTGATGTT CAAGTGTTGT CACAAGTAGA AAAGGGGGAT TTATCATCAT TTATCCTGTG GATAAATAAG AATGTATGTG ATTACAGTAC GAAGTACAGT AGCATGGATT TGTTAAAAAA AGTTACAGGT CAGAAATTGA ATGTTAATTT CTATAAAAAT TATCTTACAA ATAAGTATCT CAACATGTAG
|
Protein sequence | MKHYKFLEQV FGKIGNINSI INVLETSEGN FCDKIEHICT LKEIKHEILN SEMIGDMIQH SVANKAQLND WEIANLNHIE LIHKNSSAVP VELVLDLYRA QVKCKNSWVL FRNGDASIQD IVALLSDVVR LVSDIASIKA ESLKISKYDV ILGLQDSKLN TRKVDAIFTE IGAFFRQFIT EVGDKQKHNK ICYPKGINEE KQILLGYDAL SSFGMTNSNI INSDYVNNRY SFGKDLPFLV NYSEDDYRIG LKTLFRKIGY ALYALNLPEK WHKQPVGWNL NNILSEILGL LTSNHLMMSK EFVKFISPNL KKRFSFRGKV GHYENIQLYF NEVQPNLLMH KSDEVTLLAH IMLRYTLEKE MISDSLQVQD LPDAWIQGMK HYFNVAPKND LEGFLQDDYW VSGIFGYFPC CMISAIIASQ IFSTMKNTDV QVLSQVEKGD LSSFILWINK NVCDYSTKYS SMDLLKKVTG QKLNVNFYKN YLTNKYLNM
|
| |