Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_1207 |
Symbol | clpX |
ID | 3903561 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 1440399 |
End bp | 1441685 |
Gene Length | 1287 bp |
Protein Length | 428 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 637878540 |
Product | ATP-dependent protease ATP-binding subunit ClpX |
Protein accession | YP_480314 |
Protein GI | 86739914 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1219] ATP-dependent protease Clp, ATPase subunit |
TIGRFAM ID | [TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0448569 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.478447 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCACGCA TCGGTGATGG CGGTGACCTG CTCAAGTGCT CGTTCTGCGG TAAGTCGCAG AAGCAGGTGA AGAAGCTCAT CGCCGGCCCT GGCGTCTATA TCTGCGACGA GTGCATCGAT CTCTGCAACG AGATCATCGA GGAGGAGCTG TCCGAATCCT CCGAGCTCAA ATGGGACGAA CTGCCCAAGC CGCGCGAGAT CTACGAGTTC CTCGACAGTT ACGTGGTGGG GCAGGAGACG GCGAAGAAGA CCCTCTCCGT CGCGGTCTAC AACCACTACA AGCGGGTCCA GGCGGGCGGA TCCAGCGGTG ACGGGAGCAA GGGCGAGGTC GAGCTCGCGA AGAGCAACAT CCTGCTGCTC GGGCCGACGG GCTGCGGCAA GACGCTGCTC GCCCAGACTC TCGCGCGAAT GCTCAACGTC CCGTTTGCCA TCGCCGACGC CACCGCGCTG ACCGAGGCGG GCTACGTCGG GGAAGACGTC GAGAACATTC TGCTCAAACT CATTCAGGCC GCCGACTATG ACGTCAAAAA GGCCGAGACC GGGATTATCT ACATCGACGA GGTCGACAAG ATCGCCCGGA AGTCGGAGAA CCCGAGCATC ACGCGGGACG TGTCCGGTGA GGGCGTGCAA CAGGCCCTGT TGAAGATCCT CGAAGGCACC ACGGCCAGCG TGCCGCCACA GGGCGGGCGC AAGCACCCGC ATCAGGAGTT CATCCAGATC GACACGACGA ACGTGCTGTT CATCGTGGGC GGGGCGTTCG CCGGGCTCGA CCGGATCATC GAGTCGAGGA TCGGCAAGAA GTCGCTGGGG TTCCGCGCCG TACTGCACGG CAAGGACGAC CCGGACGGAT CCGATGTCTT CGGCGACATC ATGCCCGAGG ACCTGCTGAA GTACGGCATG ATTCCGGAGT TCATCGGCCG GCTCCCGGTG ATCACCAGCG TGTCCAACCT GGATCGTGAG GCGCTGATCC GCATCCTCAC CGAGCCGAAG AACGCCCTCG TCCGCCAGTA CAAGCGGCTG TTCGAGCTGG ACAGCGTCGA CCTCGACTTC ACCTCGGACG CGCTCGAGGC CATCGCGGAC CAGGCGATCC TGCGTGGGAC CGGTGCTCGT GGGCTGCGGG CAATCATGGA AGAGGTCCTG CTCTCGGTGA TGTACGACAT CCCGAGCCGC AAGGATGTGG CCCGGGTCGT CGTCACCCGC GAGGTCGTGC TGGAGCACGT CAATCCGACG CTGGTCCCGC GCGACGTCGT GTCCAAGCGG GCTCCCCGCC AGGAGAAGTC GGCCTGA
|
Protein sequence | MARIGDGGDL LKCSFCGKSQ KQVKKLIAGP GVYICDECID LCNEIIEEEL SESSELKWDE LPKPREIYEF LDSYVVGQET AKKTLSVAVY NHYKRVQAGG SSGDGSKGEV ELAKSNILLL GPTGCGKTLL AQTLARMLNV PFAIADATAL TEAGYVGEDV ENILLKLIQA ADYDVKKAET GIIYIDEVDK IARKSENPSI TRDVSGEGVQ QALLKILEGT TASVPPQGGR KHPHQEFIQI DTTNVLFIVG GAFAGLDRII ESRIGKKSLG FRAVLHGKDD PDGSDVFGDI MPEDLLKYGM IPEFIGRLPV ITSVSNLDRE ALIRILTEPK NALVRQYKRL FELDSVDLDF TSDALEAIAD QAILRGTGAR GLRAIMEEVL LSVMYDIPSR KDVARVVVTR EVVLEHVNPT LVPRDVVSKR APRQEKSA
|
| |