Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0229 |
Symbol | |
ID | 5668654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 279463 |
End bp | 282075 |
Gene Length | 2613 bp |
Protein Length | 870 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641239158 |
Product | ATPase |
Protein accession | YP_001504602 |
Protein GI | 158312094 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0542] ATPases with chaperone activity, ATP-binding subunit |
TIGRFAM ID | [TIGR03346] ATP-dependent chaperone ClpB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.388652 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCTG ACCGTCTCAC CGCCCGCTCG CAGGAGGCTC TGTCCTCCGC GATCAGCCGT GCCACCGGCG ACGGATCGCC GCTCGTCGAC CCGCTGCACC TGCTGACCGC CCTGCTCGAG GCGCCCGACG GTGTCGGTGC CGCCCTGCTG GAGGCCGTCG GCACCCCGGC GGCGGACATC CGCTCCCGGG CGGAGGCCGC GGTGGGCCGG CTCCCCCGCG CCGCCGGGGC CAACGTGGCG CCGCCGCAAC TGTCCCGGCA GCTCGTCGCC GTCCTGAACA ACGCCGAGCG CCAGGCCGCC CGGCTCGGCG ATGAGTACAC CTCGGTCGAG CACCTGGTCG TGGCGCTCGC GGAGGAGGGC GGGGAGGCGT CCCGCATCCT CGCCGAGGCG GGCGCGACCC CGGACGCCCT GCGCGGCGCG TTCGACCGCG TCCGCGGCGG CGCCCGCCGC GTCACCAGCC GGGATCCGGA GGGGGCCTAC CGGGCGCTCG AGAAGTACTC CATCGACCTC ACCGCGCGGG CCCGCGACGG CAAGCTCGAC CCGGTGATCG GCCGCGACAC CGAGATCCGC CGGGTCGTGC AGGTTCTCTC CCGGCGCACG AAGAACAACC CGGTCCTGAT CGGCGAGCCC GGCGTCGGCA AGACGGCGAT CGTCGAGGGG CTCGCGCTGC GGGTGGCCGC GGGTGACGTC CCGGAGTCGC TGCGCGGGCG GCGCATCGTC TCGCTCGACC TCGGCTCGAT GGTCGCCGGC TCCAAGCTGC GCGGCGAGTT CGAGGAACGG CTGACCTCGG TGCTCACCGA GATCCGCGAG GCCGAGGGCC AGATCATCAC CTTCATCGAC GAGCTGCACA CCGTCGTCGG CGCCGGCGCG GCCGAGGGCG CGATGGACGC CGGCAACATG CTCAAGCCGA TGCTCGCCCG CGGTGAGCTG CGCATGATCG GCGCGACGAC GCTGGACGAG TACCGCACCC GCATCGAGAA GGACCCGGCG CTGGAGCGCC GCTTCCAGCC CGTGATGGTC GGGGAGCCGT CCGTGGAGGA CACGATCGGC ATCCTGCGCG GGCTCAAGGA GCGTTACGAG GTCCACCACG GGGTGCGGAT CACCGACTCG GCGCTGGTGG CCGCGGCCAC CCTGTCCGAC CGGTACGTCA CCGCCCGGTT CCTCCCCGAC AAGGCGATCG ACCTGATGGA TGAGGCGGCG TCCCGGCTAC GGATGGAGAT CGACAGCCGG CCGGTCGCCG TCGACGAGCT CGAGCGGGCC GTGCGCCGTC TCGAGATCGA GGACATGGCG CTGTCGAAGG AGAACGACGA CGCGTCCCGG GAACGGCGCG ACCGGCTGCA GCGCGAGCTG GCGGAGAAGC GCGAGGAGCT CTCCGCGCTG ACCGCGCGGT GGCAGCGGGA GAAGAACTCC ATCTCCGAGG TCCAGAAGAT CAAGGAGGAG CTGGAGAACG CCCGCCGCGC CGCCGAGATG GCCGAGCGCG ACCTCGACCT CGCCAAGGCC GGTGAGCTGC GGTACGGCAC GATCCCGACG CTGGAGAAGC GGCTCGCCGA GGCGACCGGC GCGCTCGCCG GATCGGACTC GCCCGGCGGG GCGATGCTCA GCGAGGAGGT CGGTCCCGAC GACGTCGCCG AGGTCGTCGC CTCGTGGACG GGCATCCCCG CCGGCCGCAT GCTCGAGGGC GAGACGAGCA AGCTCCTGCG CATGGAGACG GAGCTGCACC GTCGCGTGAT CGGGCAGGAC GAGGCCGTGC GCACCGTGGC GGACGCCGTC CGCCGCGCGC GGGCCGGCAT CGCCGACCCG GACCGGCCGA CCGGGTCGTT CCTCTTCCTC GGGCCGACGG GTGTGGGCAA GACGGAGCTG GCCAAGGCGC TCGCCGACTT CCTGTTCGAC GACGAGCGGG CGGTCGTGCG CATCGACATG AGCGAGTACG CCGAGAAGCA CTCGGTGGCG CGGTTGATCG GCGCGCCTCC CGGCTACGTC GGCTTCGAGT CCGGCGGCCA GCTCACCGAG GCGATCCGGC GCCGCCCGTA CAGCGTGATC CTGCTCGACG AGGTCGAGAA GGCGCACCCG GACGTCTTCG ACGTGCTGCT CGCCGTACTC GACGACGGCC GGCTGACCGA CGGCCAGGGC CGCACGGTCG ACTTCCGGAA CACCATCCTG ATCCTGACCT CGAACCTGGG GTCGGTCTAC ATCGCCGACC CGACCCTGCC CCCGCAGGTC CGCCACGATT CGGTGATGGT CGCCGTGCGC GACGCCTTCA AGCCGGAGTT CCTGAACCGG CTCGACGACG TGCTGGTCTT CGAGCAGCTC GGCCGGGACG ATCTGACGAA GATCGTCGAC ATCCAGATCG ACCGGCTGCG CAGGCGGCTG GCCGACCGCC GGATCTCCCT CGAGGTGACC GACGCCGCCA AGGTCTGGCT CGCGGACGCC GGCTACGACC CGGTGTACGG GGCGCGGCCG CTGCGCCGCC TGGTGCAGAC CTCGATCGGC GACCAGCTCG CCCGCGAGCT GCTGGCCGGC CAGATCAGGG ACGGCGACGG GGTCGTGGTC GACGTGGACG GGCAGCGCTC GGCGCTGAGC GTCCACTCCG CGGCCCGCGC GCAGGCCATC TGA
|
Protein sequence | MNADRLTARS QEALSSAISR ATGDGSPLVD PLHLLTALLE APDGVGAALL EAVGTPAADI RSRAEAAVGR LPRAAGANVA PPQLSRQLVA VLNNAERQAA RLGDEYTSVE HLVVALAEEG GEASRILAEA GATPDALRGA FDRVRGGARR VTSRDPEGAY RALEKYSIDL TARARDGKLD PVIGRDTEIR RVVQVLSRRT KNNPVLIGEP GVGKTAIVEG LALRVAAGDV PESLRGRRIV SLDLGSMVAG SKLRGEFEER LTSVLTEIRE AEGQIITFID ELHTVVGAGA AEGAMDAGNM LKPMLARGEL RMIGATTLDE YRTRIEKDPA LERRFQPVMV GEPSVEDTIG ILRGLKERYE VHHGVRITDS ALVAAATLSD RYVTARFLPD KAIDLMDEAA SRLRMEIDSR PVAVDELERA VRRLEIEDMA LSKENDDASR ERRDRLQREL AEKREELSAL TARWQREKNS ISEVQKIKEE LENARRAAEM AERDLDLAKA GELRYGTIPT LEKRLAEATG ALAGSDSPGG AMLSEEVGPD DVAEVVASWT GIPAGRMLEG ETSKLLRMET ELHRRVIGQD EAVRTVADAV RRARAGIADP DRPTGSFLFL GPTGVGKTEL AKALADFLFD DERAVVRIDM SEYAEKHSVA RLIGAPPGYV GFESGGQLTE AIRRRPYSVI LLDEVEKAHP DVFDVLLAVL DDGRLTDGQG RTVDFRNTIL ILTSNLGSVY IADPTLPPQV RHDSVMVAVR DAFKPEFLNR LDDVLVFEQL GRDDLTKIVD IQIDRLRRRL ADRRISLEVT DAAKVWLADA GYDPVYGARP LRRLVQTSIG DQLARELLAG QIRDGDGVVV DVDGQRSALS VHSAARAQAI
|
| |