Gene Franean1_3854 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3854 
SymbolgroEL 
ID5672217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4576310 
End bp4577947 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content68% 
IMG OID641242732 
Productchaperonin GroEL 
Protein accessionYP_001508152 
Protein GI158315644 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA TCATTGCCTT CGATGAAGAG GCACGGCGCG GCCTGGAGCG CGGCATGAAC 
CAGCTGGCCG ACGCGGTCAA GGTCACGCTC GGCCCCAAGG GTCGCAACGT CGTGCTGGAG
AAGAAGTGGG GCGTCCCCAC GATCACCAAC GACGGCGTCA GCATCGCCAA GGAGATCGAG
CTCGAGGACC CGTACGAGAA GATCGGCGCG GAGCTCGTCA AGGAAGTCGC GAAGAAGACC
AACGACGTCG CGGGTGACGG CACCACCACC GCCACGATCC TCGCCCAGGC GCTCGTCCGC
GAGGGCCTGC GCAACGTCGC CGCCGGTGCG AACCCGATCG GCCTGAAGAA GGGCATCGAG
GCCGCCGTCG AGCGTGTCTC CGAGGAGCTC TCCAGCGTCG CCAAGGACGT GGAGACCAAG
GAGCAGATCG CCTCGGCCGC CTCCATCTCC GCCGGTGACC CGGCCATCGG CGCCATGATC
GCCGAGGCGA TGGACAAGGT CGGCAAGGAA GGCGTCATCA CCGTCGAGGA GAGCAACACC
TTCGGGCTCG AGCTCGAGCT CACCGAGGGC ATGCGCTTCG ACAAGGGCTA CATCTCGCCC
TACTTCGTCA CCGACACCGA CCGCATGGAA GCCGTCCTCG ACGACCCGTA CATCCTGATC
ACCAACAGCA AGATCTCCGC GGTCAAGGAC CTCCTCCCGA TCCTGGAGAA GGTCATGCAG
GCCGGCAAGC AACTCGTGAT CCTCTCCGAG GACGTCGAGG GCGAGGCCCT GGCCACCCTG
GTCGTGAACA AGATCCGCGG CACGTTCAAG AGCGCCGCGG TCAAGGCGCC CGGCTTCGGT
GACCGCCGCA AGGCCATGCT GACCGACATC GCCGTCCTCA CCGGCGGCCA GGTCATCTCC
GAGGACATCG GCCTCAAGCT CGAGGGCACC ACCGTCGACC TGCTCGGCCG GGCCCGCAAG
GTCGTCATCA CCAAGGACGA GACCACCATC GTCGAGGGTG CCGGCGACGC GGACCAGATC
GCGGGGCGGG TCAACCAGAT CCGCAACGAG ATCGAGAAGT CCGACTCCGA CTACGACCGC
GAGAAGCTCC AGGAGCGGCT GGCCAAGCTC GCCGGCGGCG TCGCGGTCAT CAAGGTCGGC
GCGGCCACCG AGGTCGAGCT CAAGGAGAAG AAGCACCGCA TCGAGGACGC CGTCTCGAAC
GCGAAGGCCG CGGTCGAGGA GGGCATCGTC GCCGGCGGTG GCGTCGCGCT CCTGCAGGCC
TCCACCAGCG CCTTCGAGAA GCTCGACCTC TCCGGCGACG AGGCCACCGG CGCCCTGATC
GTCGAGCGCG CGCTCGCCGC ACCGCTACGC CAGATCGCCA CCAACGCCGG GCTCGAGGGC
GGCGTCGTGG TCGAGAAGGT CCGCGGTCTC CCGACCGGGC ACGGCCTGAA CGCCGCCACC
GGCGAGTACG TCGACATGAT CGCCGCCGGG ATCATCGACC CGGTGAAGGT CACCCGCTCG
GCGCTACAGA ACGCCGCGTC CATCACCGGC CTCTTCCTCA CCATCGAGGT CGTCGTCGCG
GACAAGCCGG AGCCCGCGGG CGCCGGCGGT GGTGCTGACG CGGCCGCCAT GGGCGGCATG
GGCGGCATGG GCTTCTGA
 
Protein sequence
MPKIIAFDEE ARRGLERGMN QLADAVKVTL GPKGRNVVLE KKWGVPTITN DGVSIAKEIE 
LEDPYEKIGA ELVKEVAKKT NDVAGDGTTT ATILAQALVR EGLRNVAAGA NPIGLKKGIE
AAVERVSEEL SSVAKDVETK EQIASAASIS AGDPAIGAMI AEAMDKVGKE GVITVEESNT
FGLELELTEG MRFDKGYISP YFVTDTDRME AVLDDPYILI TNSKISAVKD LLPILEKVMQ
AGKQLVILSE DVEGEALATL VVNKIRGTFK SAAVKAPGFG DRRKAMLTDI AVLTGGQVIS
EDIGLKLEGT TVDLLGRARK VVITKDETTI VEGAGDADQI AGRVNQIRNE IEKSDSDYDR
EKLQERLAKL AGGVAVIKVG AATEVELKEK KHRIEDAVSN AKAAVEEGIV AGGGVALLQA
STSAFEKLDL SGDEATGALI VERALAAPLR QIATNAGLEG GVVVEKVRGL PTGHGLNAAT
GEYVDMIAAG IIDPVKVTRS ALQNAASITG LFLTIEVVVA DKPEPAGAGG GADAAAMGGM
GGMGF