Gene Daud_1478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaud_1478 
SymbolclpX 
ID6026715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Desulforudis audaxviator MP104C 
KingdomBacteria 
Replicon accessionNC_010424 
Strand
Start bp1563033 
End bp1564286 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content58% 
IMG OID641594296 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001717616 
Protein GI169831634 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0037217 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAACG AGAAGGGCCA GTTGAAATGT TCGTTTTGCG GTAAGCTGCA GGATCAGGTC 
AAGAAGCTGG TGGCTGGTCC CGGGGTATAC ATTTGTGATG AATGCATAGA GCTTTGCAAC
GAGATCATCG AGGAGGAACT CAGCGAGGAT CTGGGGCTGG AGTTGCGGGA TATTCCGAAG
CCGCGGGAAA TCAAGGATTA CCTGGATCAG TACGTGATCG GCCAGGAGTA TGCAAAGAAA
ATCCTCGCCG TGGCCGTTTA CAACCATTAC AAGCGGATAA ACCTCGGCGG CAAGCTTGAG
GACGTGGAGC TGCAAAAGAG CAACATTGTC ATGCTCGGGC CGACCGGTTC CGGGAAGACC
TTGCTGGCGC AGACGCTGGC GCGGTTTCTA AACGTGCCCT TTGCAATCGC CGACGCCACT
TCGCTGACCG AGGCCGGGTA TGTGGGAGAG GATGTGGAGA ACATCCTCCT GAAGCTCATC
CAGGCCGCCG ATTACGACGT GGAGAAGGCG GAGAAGGGCA TTGTGTACAT CGACGAGGTC
GACAAGATCG CGCGTAAGTC GGAAAACCCT TCCATCACCA GGGACGTTTC CGGCGAGGGC
GTGCAGCAGG CCCTCCTGAA GATTCTGGAG GGTACGGTGG CCAGCGTGCC GCCGCAGGGC
GGCCGCAAGC ACCCGCACCA GGAGTTCATC CAGCTGGATA CCACAAACAT TCTGTTCATC
TGCGGCGGGG CCTTTGAGGG GATCGACAAG ATCATCCAGA GCCGGGTGGC TAAGAAGACT
ATGGGGTTCG GGGCCGAACT GACGCTGAAG CGGGACCGCA AGCTGGGTGA CATCCTGCGG
AACATCCTGC CCCAGGATCT CTTGAAGTAC GGCCTGATCC CCGAGTTTGT CGGGCGCCTG
CCGGTCATCG TGACGCTGGA CCCGCTCAAC CAAGAGGACC TGGTCAGGAT CCTGGTTGAG
CCGCGCAACG CCCTGGTGAA GCAGTATGAG AAGCTCTTCG AAATCGACGG GGTCGCCCTG
GAGTTTCAGG AAGAAGCGCT CCAGGCCATC GCTGAGGAGG CTATCCGGCG CAACACCGGC
GCCCGGGGCC TGCGGGCGAT TCTGGAGGAG ATCATGCTGA ACGTCATGTA TGATATTCCG
TCCCGGGGCG ACGTCGCCAA GTGCACCATT TCGAGGGAAA CGGTCGTGAA CCGGGAGAAC
CCGCTGATCA TTACCGTGGA GCGCAGCAAG AAGAAAAAGG AAAGTGCCTT GTAA
 
Protein sequence
MFNEKGQLKC SFCGKLQDQV KKLVAGPGVY ICDECIELCN EIIEEELSED LGLELRDIPK 
PREIKDYLDQ YVIGQEYAKK ILAVAVYNHY KRINLGGKLE DVELQKSNIV MLGPTGSGKT
LLAQTLARFL NVPFAIADAT SLTEAGYVGE DVENILLKLI QAADYDVEKA EKGIVYIDEV
DKIARKSENP SITRDVSGEG VQQALLKILE GTVASVPPQG GRKHPHQEFI QLDTTNILFI
CGGAFEGIDK IIQSRVAKKT MGFGAELTLK RDRKLGDILR NILPQDLLKY GLIPEFVGRL
PVIVTLDPLN QEDLVRILVE PRNALVKQYE KLFEIDGVAL EFQEEALQAI AEEAIRRNTG
ARGLRAILEE IMLNVMYDIP SRGDVAKCTI SRETVVNREN PLIITVERSK KKKESAL