Gene Francci3_2513 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2513 
SymbolgroEL 
ID3904657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2969278 
End bp2970927 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content67% 
IMG OID637879843 
Productchaperonin GroEL 
Protein accessionYP_481609 
Protein GI86741209 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.167914 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAAGG ATCTGCGGTT CAATGTCGAG GCGCGCCGCC TGCTGGAGGC CGGGGTCAAT 
GCCCTGGCGG ACGCCGTCAA GGTGACTCTG GGTCCAAAGG GCCGCAACGC CGTCATCGAA
AAACTGACCG GGCCGCCCAC CATCACCAAT GACGGTGTGA CCATAGCCAG GGAGATCCAG
CTGCGTAACC CCTTCGCCAA CATGGGGGCG CAACTGGTCA AGGAGGTCGC GACCAAGACC
AACGGCACTG CCGGCGACGG AACCACCACC GCCACCGTGC TCGCGCAGGC CCTCGTCCGG
GAGGGTCTGC ATGCCGTGGA CGGGGGCGCC AACCCGATGT TTCTCAAGAA CGGCATCGAG
GCTGCCGTGG CCGCCCTGCT AGAGGAGTTT GAAAAGTACC GGGGAGAGGT CGAGGGCGAG
GCCGATCTTG CCCGGGTGGC GACCCTCGCC GCCAACAACG ATGCCCGGAT CGGCGACGTC
GTGGCCGCGG CCCTTGGCCG GGTCGGCTGC GACGGGGTGG TCACGGTCGA GGAATCCCCG
ATCTTCGGAC TCGAGGTCAG CTTCGTGGAC GGTATCGAGT TGGACAACGG GTACCTCTCG
CCGTACATGG TCACCGACAC CGAGCGGATG GAGGCCGCCT ACACCGACCC CTACATCCTG
TTGACCAACG AGAAGATCTC TCAGGTTCAG ACCCTGATGC CGGTCCTCGA GCTGGTCACC
CGGGCCGGCG GCCAGTTGAT CGTCTTCGCG GAGAACGTCG AGGGACCGGC ACTGGGCATG
CTGGTCGCCA ACAATGTGCA CGGGACCTTC CGGTCCGCGG TGGTCCGGGC ACCCGGTTTC
GGTCACCGTC GGTTGGCCGA GCTCAACGAT CTCGCGGTTT TTCTGGGCGG TCAGGTGATT
ACCGCGGATG CCGGGCTTTC CCTGGACCGG GTCACCCTCG GCCAGCTCGG GCGTTGCAAG
AAGGCCACCA TTACCGAGCA TGCGACTACG ATCGTCGACG GCGCCGGTTC CGCCACCGAG
ATCCATGCCC GGATCGACCA GCTCAAGCGG GAGCTTGAAC GGGCGGAGAA CCCCCACGAC
CAGGACACGT TGCAGACCCG GATCGCCCGG TTGTCCGGCG GCGTCGCGGT GATCCGGGTC
GGCGCCGTGA CCGGTGTGGA GTTGAAGGAG AAGCTGCACC GGGTCGAGGA CTCCCTCGCG
GCGGCACGGG CTGCTCTCGC CGAAGGTGTC GTGGCGGGCG GTGGTACCGC ACTGCTGCAA
GCGGCCTCGG CCCTTGACAA GCTCACGCTG ACCGGCGACG CCGCCGAAGG CAGGGAGATC
GTTCGGCGGG CCATCGCCGA ACCACTGCGC TGGATCGCGA TCAACGCCGG CTACGACGGC
GACGAAGTGG TCAAGCGGGT CGCCGAGCTG CCGCGCGGTC ACGGTTTCAA CGCCGCGACC
GGAGAATACG GGGAGATGGC CGGCTTCGGT GTCATCGACC CGGTGAAAGT TACCCGTTGC
GCGCTGCAGA GCGCGGCGTC GATCGCCGCG CTGTTGCTGA CAACGGAAAC CCTGGTTGTC
GAGGAGGTCA TCGGCAACCC GGGTGCCGTG ATCGCTCCCG GATTCGGGGA TCTCGCGGAG
GGCCTGGTCC GGCCTTCCAA CATCGCCTGA
 
Protein sequence
MAKDLRFNVE ARRLLEAGVN ALADAVKVTL GPKGRNAVIE KLTGPPTITN DGVTIAREIQ 
LRNPFANMGA QLVKEVATKT NGTAGDGTTT ATVLAQALVR EGLHAVDGGA NPMFLKNGIE
AAVAALLEEF EKYRGEVEGE ADLARVATLA ANNDARIGDV VAAALGRVGC DGVVTVEESP
IFGLEVSFVD GIELDNGYLS PYMVTDTERM EAAYTDPYIL LTNEKISQVQ TLMPVLELVT
RAGGQLIVFA ENVEGPALGM LVANNVHGTF RSAVVRAPGF GHRRLAELND LAVFLGGQVI
TADAGLSLDR VTLGQLGRCK KATITEHATT IVDGAGSATE IHARIDQLKR ELERAENPHD
QDTLQTRIAR LSGGVAVIRV GAVTGVELKE KLHRVEDSLA AARAALAEGV VAGGGTALLQ
AASALDKLTL TGDAAEGREI VRRAIAEPLR WIAINAGYDG DEVVKRVAEL PRGHGFNAAT
GEYGEMAGFG VIDPVKVTRC ALQSAASIAA LLLTTETLVV EEVIGNPGAV IAPGFGDLAE
GLVRPSNIA