Gene GM21_3010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3010 
SymbolclpX 
ID8138356 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3497592 
End bp3498845 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content60% 
IMG OID644870611 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_003022797 
Protein GI253701608 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones84 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGAGCAGAA GAGATGACCG TTCCGACACG CTGATCTGTT CCTTTTGCGG GAAGAGCCAG 
GAAGAGGTGA AGAAGCTGAT TGCGGGGCCT ACGGTCTACA TCTGCGACGA GTGCATCGAG
CTTTGCAACG ACATCATCGC GGAGGAGTCC AAACTGGAGG ATGCCACCGC AACCGATGTG
AGGAAACTTC CCAAGCCGCA GGAAATCAAG GAAGTCCTCG ATGAATACGT GATCGGCCAG
TCCAGGGCGA AAAAGGTCCT GGCCGTAGCC GTGTACAACC ATTACAAGAG GGTCGAGGCC
GCGGTGAAGC CGGGCGACGT CGAGATGCAG AAGAGCAACA TCCTGCTTCT GGGCCCAACA
GGCAGCGGCA AAACGCTCCT GGCGCAGACC CTGGCCCGCA TCCTCAAGGT GCCTTTCGCC
ATGGCGGACG CCACCAACTT GACCGAGGCG GGTTACGTCG GCGAGGACGT GGAGAACATC
ATCCTGACCC TCTTGCAGGC GTCCGATTAC GACGTGGAGA AGGCGCAGAA GGGGATCATC
TACATCGACG AGATCGACAA GATCGCCAGG AAATCCGACT CGCCCTCCAT CACCCGCGAC
GTTTCGGGCG AGGGGGTGCA GCAGGCCCTT TTGAAGATCA TCGAAGGGAC CGTGGCGAGC
GTCCCCCCCA AGGGTGGGCG CAAGCACCCG CAGCAGGAGT TCCTTAAGGT GGACACCACC
AACATCCTGT TCATCTGCGG CGGGGCCTTC CCCGGGTTGG ACAGCATCAT CCAGCAGAGG
ATCGGGGTCA AGACGCTCGG CTTCGGCGCG GACGTCAAGA AGAAGGTGGA GAAGAAGGCG
GGCGAACTGC TGGCCGGGGT GACCCCTGAG GATCTCTTGA AGTTCGGTTT CATCCCCGAG
TTCGTGGGGC GTCTTCCCAT GCTCGCCTCG CTCTCCGAGC TCGACGAGGA GGCGATGGTC
CAGATCCTCA AGGAGCCGAA GAACGCGCTG ATCAAGCAGT ACCAGAAGCT GTTCGATATG
GAGCACGTGA AGCTGAAGTT CACCGACGGC TCCCTGGTCG CCATAGCACG CGAGGCCCTG
AAGCGAAAGA CCGGCGCCCG CGGCCTGCGC TCCATCCTGG AAAACGCGAT GCTGGACATC
ATGTACGAGA TCCCCTCCCA GAGCATGGTG AAGGAAGTGG TCATCAACGA AGAGGTGATC
TACAGCAAGG AAAAGCCGAT CATCGTCTAC GAGAACGTGG CGGAAAGCGC CTGA
 
Protein sequence
MSRRDDRSDT LICSFCGKSQ EEVKKLIAGP TVYICDECIE LCNDIIAEES KLEDATATDV 
RKLPKPQEIK EVLDEYVIGQ SRAKKVLAVA VYNHYKRVEA AVKPGDVEMQ KSNILLLGPT
GSGKTLLAQT LARILKVPFA MADATNLTEA GYVGEDVENI ILTLLQASDY DVEKAQKGII
YIDEIDKIAR KSDSPSITRD VSGEGVQQAL LKIIEGTVAS VPPKGGRKHP QQEFLKVDTT
NILFICGGAF PGLDSIIQQR IGVKTLGFGA DVKKKVEKKA GELLAGVTPE DLLKFGFIPE
FVGRLPMLAS LSELDEEAMV QILKEPKNAL IKQYQKLFDM EHVKLKFTDG SLVAIAREAL
KRKTGARGLR SILENAMLDI MYEIPSQSMV KEVVINEEVI YSKEKPIIVY ENVAESA