Gene ECH_0900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0900 
SymbolclpX 
ID3927295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp927610 
End bp928830 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content33% 
IMG OID637902017 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_507694 
Protein GI88658550 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAGATA ATGAAAAAAA TTCTTGTAGC TGTTCTTTCT GCGGAAAGAT TCATAGTGAA 
GTACGTAAGT TAATTGCTGG GCCATCAGTT TTTATTTGTA ATGAGTGTAT TGATTTATGT
AGTGGTATAT TACAAGAAGA AAGTAGATCT TATAAAAAGA CGGATACTCT TAAGTTAAAA
CCAAAGGAAA TAAAGAAAGT TCTTGATGAG TATGTTATAG GGCAAGAGCA CTCAAAAAAA
GTTTTATCAG TTGCTGTGTA TAATCATTAT AAACGTTTAT CGAATTTAAG TGTTATTAGT
GAAGTTGAGA TTTCTAAGTC AAATGTTTTG TTGATTGGAC CTACTGGTTC TGGAAAAACA
TTATTAGCTC GTACTTTAGC TAGAGTTTTA CAAGTTCCTT TTGCGATGGC TGATGCTACT
ACTTTAACGG AAGCAGGGTA TGTTGGAGAG GATGTAGAGA ATATATTGTT AAAATTATTG
CAGGCAGCTA ATTTTAATGT TGATGCAGCA CAACGTGGCA TAATTTATAT TGATGAAGTA
GATAAAATTT CTAGAAAGTC TGAAAATACT TCTATTACTC GTGATGTATC TGGCGAAGGT
GTTCAACAAG CTTTATTGAA AGTTATTGAA GGCACAGTTT CTTCTGTCCC ACCTCAAGGT
GGTAGGAAGC ATCCACATCA AGAGTTTATA CAAATAAATA CTGATAATAT TTTATTTATA
TTTGGTGGTG CGTTTGATGG TTTAGATAAA ATTATAGAAT CTCGTCATAG AGGTAGTAGT
ATGGGGTTTG AAGCGAATGT ACAAAAAGTA TCAAAGAATA AAGATATTTT TTGTTACACT
GAGCCAGAAG ATTTAGTGAA GTTTGGTTTA ATTCCGGAGT TTGTTGGTAG AATTCCTGTT
ATTACATCTT TAGGTGAGCT TGATGAGAGT ACTTTATGTC GTATTTTAGT TGAACCAAAA
AATTCTTTGG TTAAACAATA CAAGAAACTT TTTGAAATGG ATAATATTAA TCTTCAGTTT
GATGATAGTG CATTATCAGT AATTGCAAAA AAGGCTGCTG TCAGGAAAAC TGGTGCTAGA
GGTTTAAGGG CTATTTTAGA AGCATTATTA CTTGATTTAA TGTTTGAAAG CCCTGGAAGT
TCTGATGTGA ATCAAGTAGT AATTAGTAAG GAGATGGTTG AAGAGTTGAT GGTAAGCTCG
CACTTATTTT TAAAACATTA A
 
Protein sequence
MADNEKNSCS CSFCGKIHSE VRKLIAGPSV FICNECIDLC SGILQEESRS YKKTDTLKLK 
PKEIKKVLDE YVIGQEHSKK VLSVAVYNHY KRLSNLSVIS EVEISKSNVL LIGPTGSGKT
LLARTLARVL QVPFAMADAT TLTEAGYVGE DVENILLKLL QAANFNVDAA QRGIIYIDEV
DKISRKSENT SITRDVSGEG VQQALLKVIE GTVSSVPPQG GRKHPHQEFI QINTDNILFI
FGGAFDGLDK IIESRHRGSS MGFEANVQKV SKNKDIFCYT EPEDLVKFGL IPEFVGRIPV
ITSLGELDES TLCRILVEPK NSLVKQYKKL FEMDNINLQF DDSALSVIAK KAAVRKTGAR
GLRAILEALL LDLMFESPGS SDVNQVVISK EMVEELMVSS HLFLKH