Gene Francci3_4398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4398 
SymbolgroEL 
ID3907373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5256902 
End bp5258524 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content67% 
IMG OID637881729 
Productchaperonin GroEL 
Protein accessionYP_483473 
Protein GI86743073 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.600466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA TCATTGCCTT CGACGAGGAG GCTCGGCGCG GCCTGGAGCG CGGCATGAAC 
CAGCTGGCCG ATGCGGTCAA GGTCACGCTC GGGCCGAAGG GCCGCAACGT CGTGCTGGAG
AAGAAGTGGG GCGTCCCCAC GATCACCAAC GATGGTGTGA GCATCGCCAA GGAGATCGAG
CTCGAGGACC CGTACGAGAA GATCGGCGCG GAACTCGTCA AGGAAGTCGC GAAGAAGACC
AACGACGTCG CGGGTGACGG CACCACCACC GCCACCATCC TGGCCCAGGC CCTGGTGCGC
GAGGGTCTGC GCAACGTGGC CGCCGGCGCC AACCCCCTCG GGCTGAAGAA GGGCATCGAG
GTCGCGGTCG AGCGCGTCTC CGAGGAGCTG TCCAAGCAGG CCAAGGAGGT CGAGACCAAG
GAGCAGATCG CCTCCACGGC GTCCATCTCC GCGGGTGACT CGGCCATCGG CGGCCTCATC
GCCGAGGCGC TCGACAAGGT CGGCAAGGAA GGCGTCGTCA CCGTCGAGGA GAGCAACACC
TTCGGCCTCG AGCTCGAGCT CACCGAGGGT ATGCGCTTCG ACAAGGGCTA CATCTCGCCG
TACTTCGTCA CGGACGCGGA CCGTCAGGAA GCCGTCCTGG ACGACCCGTA CATCCTGATC
GTCAACAGCA AGATCTCCGC GGTCAAGGAC CTGCTCCCGC TGCTGGAGAA GGTCATGCAG
ACCGGTAAGC CGCTGGCGAT CATCGCCGAA GATGTCGAGG GCGAGGCGCT GGCCACCCTG
GTCGTCAACA AGATCCGCGG CACCTTCAAG AGCGCCGCGG TGAAGGCCCC CGGCTTCGGT
GACCGCCGCA AGGCGATCCT GGGCGACATC GCCATCCTGA CCGGTGGTCA GGTCATCTCC
GAGGACGTCG GCCTCAAGCT CGAGAGCACC TCGCTCGACC TGCTCGGCCG TGCCCGGAAG
ATTGTCGTTA CCAAGGACGA GACGACCGTC GTCGAGGGTG CCGGCGACCC CGACCAGATC
GCCGGTCGGG TCAGCCAGAT CCGCAACGAG ATCGAGAAGT CGGACTCGGA CTACGACCGC
GAGAAGCTCC AGGAGCGGCT CGCGAAGCTG GCTGGCGGCG TCGCCGTCAT CAAGGTCGGC
GCGGCCACCG AGGTCGAGCT CAAGGAGAAG AAGCACCGCA TCGAGGACGC GGTGTCCAAC
GCGAAGGCCG CTGTCGAGGA GGGCATCGTC GCCGGTGGCG GCGTCGCGCT GCTCCAGGCG
TCGATCACTG CCTTCGAGAA GTTGGACCTG TCCGGCGACG AGGCGACCGG TGCCAACATC
GTCCGGCTCG CGCTCGAGGC GCCCATCAAG CAGATCGCCT TCAACAGCGG TCTCGAGGGC
GGCGTCGTGG TCGAGAAGGT CCGCAACCTG CCGACCGGCC ACGGCCTGAA CGCGGCCACC
GGCGAGTACG TCGACCTGAT CGGCACCGGC ATCATCGACC CGGCCAAGGT CACCCGCTCC
GCCCTGCAGA ACGCCGCGTC GATCGCCGGC CTGTTCCTCA CCACCGAGGC CGTCATCGCC
GACAAGCCGG AGAAGAACCC GGCCCCGGCA GTCCCGGGCG GCGGCGGCGA CATGGACTTC
TAG
 
Protein sequence
MPKIIAFDEE ARRGLERGMN QLADAVKVTL GPKGRNVVLE KKWGVPTITN DGVSIAKEIE 
LEDPYEKIGA ELVKEVAKKT NDVAGDGTTT ATILAQALVR EGLRNVAAGA NPLGLKKGIE
VAVERVSEEL SKQAKEVETK EQIASTASIS AGDSAIGGLI AEALDKVGKE GVVTVEESNT
FGLELELTEG MRFDKGYISP YFVTDADRQE AVLDDPYILI VNSKISAVKD LLPLLEKVMQ
TGKPLAIIAE DVEGEALATL VVNKIRGTFK SAAVKAPGFG DRRKAILGDI AILTGGQVIS
EDVGLKLEST SLDLLGRARK IVVTKDETTV VEGAGDPDQI AGRVSQIRNE IEKSDSDYDR
EKLQERLAKL AGGVAVIKVG AATEVELKEK KHRIEDAVSN AKAAVEEGIV AGGGVALLQA
SITAFEKLDL SGDEATGANI VRLALEAPIK QIAFNSGLEG GVVVEKVRNL PTGHGLNAAT
GEYVDLIGTG IIDPAKVTRS ALQNAASIAG LFLTTEAVIA DKPEKNPAPA VPGGGGDMDF