Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6001 |
Symbol | groEL |
ID | 5674322 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 7319731 |
End bp | 7321380 |
Gene Length | 1650 bp |
Protein Length | 549 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641244849 |
Product | chaperonin GroEL |
Protein accession | YP_001510251 |
Protein GI | 158317743 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02348] chaperonin GroL |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0956308 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAGA TTCTGACGTT CAACGAGGAC GCCCGCCGGT CGCTCGAGCG GGGCGTGAAC GCCCTCGCGG ACGCGGTCAA GGTGACGATC GGTCCGCGCG GCCGCAACGT CGTCATCAAC AAGTCGTACG GCGCGCCGAC GATCACCAAC GACGGTGTGA CGATCGCGCG TGAGGTCGAG CTGGACGACC CTTACCAGAA CCTGGGGGCG CAGCTCGCCA AGGAGGTCGC CACCAAGACC AACGACGTCG CCGGCGACGG GACGACGACG GCCACGGTGC TGGCCCAGGA GATGGTCCGC CAGGGCCTGC GGCAGGTGAC GGCCGGGGCC GCCCCGCTCT CGCTCAAGAT CGGCATCGAG GCGGCGGTCG CCGCTGTCTC GTCGGCGCTG CTGGAAGCCG CGATCGAGAT CGGCTCGAAG GAGACCATCG CGCAGGTCGC TGCGATCTCG GCGCAGGACG CCCAGGTCGG CGAACTCATC GCCGAGGCGA TGGACAAGAT CGGCAAGGAC GGTGTGATCA CCATCGAGGA GAGCCAGACC ATGGGTCTGG ACCTCGAGCT CACCGAGGGG ATGCAGTTCG ACAAGGGCTA CATCTCGCCG TACTTCGTGA CCGACCAGGA GAGCATGGAG GCCGTCCTCG AGGACGCCTA CGTGCTGCTG CACCCTGGCA AGATCAGCGC GCTGAACGAC ATCCTGCCGA TCCTCGAGCA GGTGGTGCAG GAGCGTAAGC CGCTGCTGAT CATCGCTGAG GACGTCGAGG GCGAGGCGCT GTCGACCCTG GTCGTCAACG CGGTGCGCAA GACCTTCCAG GTCGTCGCGG TGAAGGCCCC CGGCTTCGGT GACCGCCGCA AGGCGATGCT GCAGGACCTC GCGGTCCTGA CCGGTGGCCA GGTCGTCGCG ACCGAGGTGG GTCTCAAGCT CGACTCGATC ACGCTCGCCG AGCTCGGCCG GGCGCGGCGG ATCGTCGTCG ACAAGGACCT CACCACGATC GTCGACGGTG CCGGCGACGC CGACGCGGTC GCCGAGCGCG TCCGGCAGAT GAAGGCGGAG ATCGAACTCT CCGACTCGGA CTGGGACCGC GAGAAGCTCC AGGAGCGCCT GGCCAAGCTG GCCGGCGGGG TCGCCGTCAT CCGCGTCGGC GCCGCCACCG AGGTCGAGCT CAAGGAGAAG AAGCACCGTC TCGAGGACGC CGTGTCGGCG ACCCGGGCCG CGGTCGAGGA GGGCATCGTG GCGGGCGGCG GCAGCGCGCT CGTCCACGCG GCCAAGGTCC TCGACGGTGA CCTCGGCCTG ACCGGTGACG AGCGTTCCGG GGTGCGGGTC GTGCGCGCGG CGCTCGACGC GCCGCTCACC TGGATCGCGC GCAACGCCGG CCTCGAGGGC GCGGTGATCG TGTCCAAGGT CCGCGAGGGC GACGTCGGCC GGGGCTTCAA CGCGGCCACC GGCGAGTACG TCGACCTGGT CGCGGCCGGC GTGGTCGACC CCGTGAAGGT CACCCGCTCC GCGGTGGCGA ACGCGGCCTC CATCGCCGCG CTGCTCATCA CCACCGAGAG CCTGGTGGCG GAGAAGCCCG AGGAGGCCGA GGCCGGCCAC GGCCACGACC ACTCGCACGG CGGCCATGGC CACGGCCACT CGCACGGGCC GGGTTTCTGA
|
Protein sequence | MPKILTFNED ARRSLERGVN ALADAVKVTI GPRGRNVVIN KSYGAPTITN DGVTIAREVE LDDPYQNLGA QLAKEVATKT NDVAGDGTTT ATVLAQEMVR QGLRQVTAGA APLSLKIGIE AAVAAVSSAL LEAAIEIGSK ETIAQVAAIS AQDAQVGELI AEAMDKIGKD GVITIEESQT MGLDLELTEG MQFDKGYISP YFVTDQESME AVLEDAYVLL HPGKISALND ILPILEQVVQ ERKPLLIIAE DVEGEALSTL VVNAVRKTFQ VVAVKAPGFG DRRKAMLQDL AVLTGGQVVA TEVGLKLDSI TLAELGRARR IVVDKDLTTI VDGAGDADAV AERVRQMKAE IELSDSDWDR EKLQERLAKL AGGVAVIRVG AATEVELKEK KHRLEDAVSA TRAAVEEGIV AGGGSALVHA AKVLDGDLGL TGDERSGVRV VRAALDAPLT WIARNAGLEG AVIVSKVREG DVGRGFNAAT GEYVDLVAAG VVDPVKVTRS AVANAASIAA LLITTESLVA EKPEEAEAGH GHDHSHGGHG HGHSHGPGF
|
| |