Gene Franean1_0175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0175 
SymbolgroEL 
ID5668600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp211645 
End bp213267 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content68% 
IMG OID641239104 
Productchaperonin GroEL 
Protein accessionYP_001504548 
Protein GI158312040 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0553371 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA TCATTGCCTT CGACGAGGAG GCACGGCGCG GCCTGGAGCG CGGCATGAAC 
CAGCTGGCCG ACGCGGTCAA GGTCACGCTC GGCCCCAAGG GTCGCAACGT CGTGCTGGAG
AAGAAGTGGG GCGTCCCCAC GATCACCAAC GACGGCGTCA GCATCGCCAA GGAGATCGAG
CTCGAGGACC CGTACGAGAA GATCGGCGCG GAGCTCGTCA AGGAAGTCGC GAAGAAGACC
AACGACGTCG CGGGTGACGG CACCACCACC GCGACCATTC TCGCCCAGGC TCTGGTGCGC
GAGGGCCTGC GCAACGTCGC CGCCGGCGCG AACCCGATGG GCCTGAAGAA GGGCATCGAG
GCCGCCGTCG AGCGTGTCTC CGAGGAGCTC TCCAGCGTCG CCAAGGACGT GGAGACCAAG
GAGCAGATCG CCTCCACCGC CTCCATCTCC GCCGGTGACC CGGCCATCGG CAGCATGATC
GCCGAGGCGA TGGACAAGGT CGGCAAGGAA GGCGTCATCA CCGTCGAGGA GAGCAACACC
TTCGGGCTCG AGCTCGAGCT CACCGAGGGC ATGCGCTTCG ACAAGGGCTA CATCTCGCCC
TACTTCGTCA CCGACACCGA CCGCATGGAA GCTGTCCTCG ACGACCCGTA CATCCTGATC
GCGAACAGCA AGATCTCCGC GGTCAAGGAC CTCCTCCCGA TCCTGGAGAA GGTCATGCAG
GCCGGCAAGC CGCTGGCCAT CATCTCCGAG GACGTCGAGG GCGAGGCCCT GGCCACCCTG
GTCGTCAACA AGATCCGCGG CACGTTCAAG AGCACCGCGG TCAAGGCGCC GGGCTTCGGT
GACCGCCGCA AGGCCATGCT GACCGACATC GCCGTCCTCA CCGGCGGCCA GGTCATCTCC
GAGGACGTCG GCCTCAAGCT CGAGGGCACC ACCGTCGACC TGCTCGGCCG GGCCCGCAAG
GTCGTCATCA CCAAGGACGA GACCACCATC GTCGAGGGTG CCGGCGACGC GGACCAGATC
GCGGGGCGGG TCAACCAGAT CCGCAACGAG ATCGAGAAGT CCGACTCCGA CTACGACCGC
GAGAAGCTCC AGGAGCGGCT GGCCAAGCTC GCCGGCGGCG TCGCGGTCAT CAAGGTCGGC
GCGGCCACCG AGGTCGAGCT CAAGGAGAAG AAGCACCGCA TCGAGGACGC CGTCTCGAAC
GCGAAGGCCG CGGTCGAGGA GGGCATCGTC GCCGGCGGTG GCGTCGCGCT CCTGCAGGCC
GCGACCAGCG CCTTCGAGAA GCTCGACCTC TCCGGCGACG AGGCCACCGG TGCGAACATC
GTCCGTCTCG CCCTGGAGGC GCCGATCAAG CAGATCGCGT TCAACAGCGG TCTCGAGGGC
GGCGTCGTGG TCGAGAAGGT GCGCAACCTC CCGACCGGGC ACGGCCTGAA CGCGGCGACC
GGCGAGTACG TCGACATGGT CGCCACCGGG ATCATCGACC CGGCGAAGGT CACCCGCTCG
GCGCTGCAGA ACGCCGCGTC GATCGCCGGC CTCTTCCTCA CCACCGAGGC CGTCATCGCG
GACAAGCCGG AGAAGGACAA GGCCCCGGCC ATGCCGGGTG GCGGCGGCGA GATGGACTTC
TGA
 
Protein sequence
MPKIIAFDEE ARRGLERGMN QLADAVKVTL GPKGRNVVLE KKWGVPTITN DGVSIAKEIE 
LEDPYEKIGA ELVKEVAKKT NDVAGDGTTT ATILAQALVR EGLRNVAAGA NPMGLKKGIE
AAVERVSEEL SSVAKDVETK EQIASTASIS AGDPAIGSMI AEAMDKVGKE GVITVEESNT
FGLELELTEG MRFDKGYISP YFVTDTDRME AVLDDPYILI ANSKISAVKD LLPILEKVMQ
AGKPLAIISE DVEGEALATL VVNKIRGTFK STAVKAPGFG DRRKAMLTDI AVLTGGQVIS
EDVGLKLEGT TVDLLGRARK VVITKDETTI VEGAGDADQI AGRVNQIRNE IEKSDSDYDR
EKLQERLAKL AGGVAVIKVG AATEVELKEK KHRIEDAVSN AKAAVEEGIV AGGGVALLQA
ATSAFEKLDL SGDEATGANI VRLALEAPIK QIAFNSGLEG GVVVEKVRNL PTGHGLNAAT
GEYVDMVATG IIDPAKVTRS ALQNAASIAG LFLTTEAVIA DKPEKDKAPA MPGGGGEMDF