Gene Franean1_5272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5272 
SymbolclpX 
ID5673606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6341448 
End bp6342740 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content65% 
IMG OID641244127 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_001509536 
Protein GI158317028 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.308962 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.162196 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACGCA TCGGTGATGG CGGTGACCTG CTCAAGTGCT CCTTCTGCGG TAAGTCTCAG 
AAGCAGGTGA AGAAGCTCAT CGCCGGCCCC GGCGTCTACA TCTGCGATGA GTGCATCGAT
CTCTGCAACG AGATCATCGA GGAGGAGCTC TCCGAGTCCT CGGAGCTCAA GTGGGAGGAG
CTCCCGAAGC CCCGGGAGAT CTACGAGTTC CTCGACGGGT ACGTGGTCGG CCAGGAGGCG
GCGAAGAAGA CGCTGTCGGT GGCCGTCTAC AACCATTACA AGCGGGTGCA GGCGGGCGGT
GCCTCCGGCG GTGACGCCGG CAAGGGCGAG GTGGAGCTCG CGAAGAGCAA CATCCTGCTG
CTGGGCCCCA CGGGGTGCGG CAAGACCCTG CTGGCGCAGA CGCTGGCCCG GATGCTGAAC
GTCCCGTTCG CCATCGCCGA CGCGACCGCG CTCACCGAGG CCGGATATGT CGGCGAGGAT
GTCGAGAACA TTCTTCTCAA ACTCATCCAG GCCGCCGACT ACGACGTCAA GAAGGCCGAA
ACCGGCATCA TCTACATCGA TGAGGTCGAC AAGATCGCCC GGAAGTCGGA GAACCCCAGC
ATCACCCGGG ACGTCTCCGG CGAGGGCGTG CAGCAGGCGC TGCTGAAGAT TCTCGAGGGA
ACGACGGCGA GTGTCCCGCC GCAGGGCGGC CGCAAGCACC CGCACCAGGA GTTCATTCAG
ATCGACACGA CGAACGTCCT GTTCATCGTC GGTGGGGCTT TCGCCGGTCT GGACCGCATC
ATCGAGTCGC GCATCGGCAA GAAGTCGCTG GGGTTCCGCG CGGTGCTGCA CGGCAAGGAC
GACCCGGACG CCTCGAACGT CTTCGGTGAC ATCATGCCGG AGGACCTCCT CAAGTACGGA
ATGATCCCGG AGTTCATCGG CCGGCTGCCG ATCATCACCA GCGTCTCCAA CCTCGACCGC
GAGGCGCTAA TCCGGATCCT CACCGAGCCG AAGAACGCGC TCGTCCGCCA GTACAAGCGG
CTGTTCGAGC TGGACGGCGT CGACCTCGAC TTCACCACCG ACGCACTCGA GGCCATCGCG
GACCAGGCCA TCCTGCGCGG GACGGGCGCC CGCGGCCTGC GCGCGATCAT GGAAGAGGTC
CTGCTCTCGG TGATGTACGA CATCCCGAGC CGTAAGGACG TCGCCCGCGC GGTGATCACC
CGGGAGGTCG TGCTCGAGCA CGTCAACCCG ACCCTGGTGC CACGCGACGT CGCCGCGTCG
AAGCGCGGCC CGCGCCAGGA GAAGTCCGCC TGA
 
Protein sequence
MARIGDGGDL LKCSFCGKSQ KQVKKLIAGP GVYICDECID LCNEIIEEEL SESSELKWEE 
LPKPREIYEF LDGYVVGQEA AKKTLSVAVY NHYKRVQAGG ASGGDAGKGE VELAKSNILL
LGPTGCGKTL LAQTLARMLN VPFAIADATA LTEAGYVGED VENILLKLIQ AADYDVKKAE
TGIIYIDEVD KIARKSENPS ITRDVSGEGV QQALLKILEG TTASVPPQGG RKHPHQEFIQ
IDTTNVLFIV GGAFAGLDRI IESRIGKKSL GFRAVLHGKD DPDASNVFGD IMPEDLLKYG
MIPEFIGRLP IITSVSNLDR EALIRILTEP KNALVRQYKR LFELDGVDLD FTTDALEAIA
DQAILRGTGA RGLRAIMEEV LLSVMYDIPS RKDVARAVIT REVVLEHVNP TLVPRDVAAS
KRGPRQEKSA