Gene ECH_0464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0464 
Symbol 
ID3927986 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp440717 
End bp442186 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content31% 
IMG OID637901588 
Productputative carboxypeptidase 
Protein accessionYP_507281 
Protein GI88658190 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2317] Zn-dependent carboxypeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATT ATAAATTTTT GGAGCAAGTT TTTGGTAAAA TTGGAAATAT TAATTCTATT 
ATTAATGTTT TAGAAACTAG TGAAGGTAAT TTTTGCGATA AAATAGAGCA CATTTGTACC
CTTAAAGAAA TTAAGCATGA GATTCTTAAT AGTGAGATGA TCGGCGATAT GATACAACAT
TCTGTAGCTA ATAAAGCACA ATTAAATGAC TGGGAAATTG CTAACTTGAA CCATATTGAG
CTGATACATA AGAATAGTAG TGCAGTTCCT GTGGAGTTGG TATTAGACCT TTATAGAGCG
CAAGTTAAGT GTAAAAATTC TTGGGTATTA TTCCGTAATG GTGATGCTTC TATACAAGAT
ATAGTTGCTT TATTGTCTGA TGTTGTAAGA TTGGTAAGTG ATATTGCTTC TATTAAAGCT
GAGAGTTTAA AGATTTCTAA ATATGATGTT ATTTTAGGAC TACAAGATAG TAAATTGAAT
ACAAGAAAAG TAGATGCTAT ATTTACTGAA ATAGGAGCTT TTTTCCGTCA GTTTATCACT
GAGGTGGGTG ATAAGCAGAA ACACAATAAG ATTTGTTATC CAAAAGGTAT TAATGAAGAG
AAACAGATAC TTCTAGGTTA TGATGCTTTG TCGAGTTTTG GTATGACAAA CAGTAATATT
ATTAATAGTG ATTATGTAAA TAATAGATAT TCATTTGGGA AAGATTTACC TTTTTTAGTT
AATTATAGTG AAGATGATTA CAGAATCGGG TTAAAAACTT TATTCAGAAA AATAGGTTAT
GCTTTGTATG CTTTGAATTT ACCTGAGAAG TGGCATAAAC AACCTGTAGG GTGGAACTTA
AATAACATTC TATCTGAAAT TTTGGGGCTG TTAACATCGA ATCACTTAAT GATGAGTAAG
GAGTTTGTAA AGTTTATATC TCCTAATTTA AAGAAGCGAT TTTCTTTTAG AGGTAAGGTT
GGGCACTATG AGAATATTCA GTTATATTTT AACGAAGTGC AACCTAATTT GTTGATGCAT
AAATCTGATG AGGTGACTCT ACTAGCTCAT ATTATGCTGC GGTATACTTT GGAAAAGGAA
ATGATAAGTG ATTCTTTGCA AGTACAAGAC TTGCCAGATG CTTGGATTCA AGGAATGAAA
CACTATTTTA ATGTTGCTCC AAAGAATGAT TTGGAAGGGT TTTTACAAGA TGATTATTGG
GTGAGTGGTA TTTTTGGATA TTTTCCTTGT TGTATGATTT CTGCTATTAT TGCTTCTCAG
ATTTTTTCTA CTATGAAGAA TACTGATGTT CAAGTGTTGT CACAAGTAGA AAAGGGGGAT
TTATCATCAT TTATCCTGTG GATAAATAAG AATGTATGTG ATTACAGTAC GAAGTACAGT
AGCATGGATT TGTTAAAAAA AGTTACAGGT CAGAAATTGA ATGTTAATTT CTATAAAAAT
TATCTTACAA ATAAGTATCT CAACATGTAG
 
Protein sequence
MKHYKFLEQV FGKIGNINSI INVLETSEGN FCDKIEHICT LKEIKHEILN SEMIGDMIQH 
SVANKAQLND WEIANLNHIE LIHKNSSAVP VELVLDLYRA QVKCKNSWVL FRNGDASIQD
IVALLSDVVR LVSDIASIKA ESLKISKYDV ILGLQDSKLN TRKVDAIFTE IGAFFRQFIT
EVGDKQKHNK ICYPKGINEE KQILLGYDAL SSFGMTNSNI INSDYVNNRY SFGKDLPFLV
NYSEDDYRIG LKTLFRKIGY ALYALNLPEK WHKQPVGWNL NNILSEILGL LTSNHLMMSK
EFVKFISPNL KKRFSFRGKV GHYENIQLYF NEVQPNLLMH KSDEVTLLAH IMLRYTLEKE
MISDSLQVQD LPDAWIQGMK HYFNVAPKND LEGFLQDDYW VSGIFGYFPC CMISAIIASQ
IFSTMKNTDV QVLSQVEKGD LSSFILWINK NVCDYSTKYS SMDLLKKVTG QKLNVNFYKN
YLTNKYLNM