Gene Francci3_1207 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1207 
SymbolclpX 
ID3903561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1440399 
End bp1441685 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content64% 
IMG OID637878540 
ProductATP-dependent protease ATP-binding subunit ClpX 
Protein accessionYP_480314 
Protein GI86739914 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0448569 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.478447 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCACGCA TCGGTGATGG CGGTGACCTG CTCAAGTGCT CGTTCTGCGG TAAGTCGCAG 
AAGCAGGTGA AGAAGCTCAT CGCCGGCCCT GGCGTCTATA TCTGCGACGA GTGCATCGAT
CTCTGCAACG AGATCATCGA GGAGGAGCTG TCCGAATCCT CCGAGCTCAA ATGGGACGAA
CTGCCCAAGC CGCGCGAGAT CTACGAGTTC CTCGACAGTT ACGTGGTGGG GCAGGAGACG
GCGAAGAAGA CCCTCTCCGT CGCGGTCTAC AACCACTACA AGCGGGTCCA GGCGGGCGGA
TCCAGCGGTG ACGGGAGCAA GGGCGAGGTC GAGCTCGCGA AGAGCAACAT CCTGCTGCTC
GGGCCGACGG GCTGCGGCAA GACGCTGCTC GCCCAGACTC TCGCGCGAAT GCTCAACGTC
CCGTTTGCCA TCGCCGACGC CACCGCGCTG ACCGAGGCGG GCTACGTCGG GGAAGACGTC
GAGAACATTC TGCTCAAACT CATTCAGGCC GCCGACTATG ACGTCAAAAA GGCCGAGACC
GGGATTATCT ACATCGACGA GGTCGACAAG ATCGCCCGGA AGTCGGAGAA CCCGAGCATC
ACGCGGGACG TGTCCGGTGA GGGCGTGCAA CAGGCCCTGT TGAAGATCCT CGAAGGCACC
ACGGCCAGCG TGCCGCCACA GGGCGGGCGC AAGCACCCGC ATCAGGAGTT CATCCAGATC
GACACGACGA ACGTGCTGTT CATCGTGGGC GGGGCGTTCG CCGGGCTCGA CCGGATCATC
GAGTCGAGGA TCGGCAAGAA GTCGCTGGGG TTCCGCGCCG TACTGCACGG CAAGGACGAC
CCGGACGGAT CCGATGTCTT CGGCGACATC ATGCCCGAGG ACCTGCTGAA GTACGGCATG
ATTCCGGAGT TCATCGGCCG GCTCCCGGTG ATCACCAGCG TGTCCAACCT GGATCGTGAG
GCGCTGATCC GCATCCTCAC CGAGCCGAAG AACGCCCTCG TCCGCCAGTA CAAGCGGCTG
TTCGAGCTGG ACAGCGTCGA CCTCGACTTC ACCTCGGACG CGCTCGAGGC CATCGCGGAC
CAGGCGATCC TGCGTGGGAC CGGTGCTCGT GGGCTGCGGG CAATCATGGA AGAGGTCCTG
CTCTCGGTGA TGTACGACAT CCCGAGCCGC AAGGATGTGG CCCGGGTCGT CGTCACCCGC
GAGGTCGTGC TGGAGCACGT CAATCCGACG CTGGTCCCGC GCGACGTCGT GTCCAAGCGG
GCTCCCCGCC AGGAGAAGTC GGCCTGA
 
Protein sequence
MARIGDGGDL LKCSFCGKSQ KQVKKLIAGP GVYICDECID LCNEIIEEEL SESSELKWDE 
LPKPREIYEF LDSYVVGQET AKKTLSVAVY NHYKRVQAGG SSGDGSKGEV ELAKSNILLL
GPTGCGKTLL AQTLARMLNV PFAIADATAL TEAGYVGEDV ENILLKLIQA ADYDVKKAET
GIIYIDEVDK IARKSENPSI TRDVSGEGVQ QALLKILEGT TASVPPQGGR KHPHQEFIQI
DTTNVLFIVG GAFAGLDRII ESRIGKKSLG FRAVLHGKDD PDGSDVFGDI MPEDLLKYGM
IPEFIGRLPV ITSVSNLDRE ALIRILTEPK NALVRQYKRL FELDSVDLDF TSDALEAIAD
QAILRGTGAR GLRAIMEEVL LSVMYDIPSR KDVARVVVTR EVVLEHVNPT LVPRDVVSKR
APRQEKSA