Gene Franean1_6001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6001 
SymbolgroEL 
ID5674322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7319731 
End bp7321380 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content70% 
IMG OID641244849 
Productchaperonin GroEL 
Protein accessionYP_001510251 
Protein GI158317743 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0956308 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA TTCTGACGTT CAACGAGGAC GCCCGCCGGT CGCTCGAGCG GGGCGTGAAC 
GCCCTCGCGG ACGCGGTCAA GGTGACGATC GGTCCGCGCG GCCGCAACGT CGTCATCAAC
AAGTCGTACG GCGCGCCGAC GATCACCAAC GACGGTGTGA CGATCGCGCG TGAGGTCGAG
CTGGACGACC CTTACCAGAA CCTGGGGGCG CAGCTCGCCA AGGAGGTCGC CACCAAGACC
AACGACGTCG CCGGCGACGG GACGACGACG GCCACGGTGC TGGCCCAGGA GATGGTCCGC
CAGGGCCTGC GGCAGGTGAC GGCCGGGGCC GCCCCGCTCT CGCTCAAGAT CGGCATCGAG
GCGGCGGTCG CCGCTGTCTC GTCGGCGCTG CTGGAAGCCG CGATCGAGAT CGGCTCGAAG
GAGACCATCG CGCAGGTCGC TGCGATCTCG GCGCAGGACG CCCAGGTCGG CGAACTCATC
GCCGAGGCGA TGGACAAGAT CGGCAAGGAC GGTGTGATCA CCATCGAGGA GAGCCAGACC
ATGGGTCTGG ACCTCGAGCT CACCGAGGGG ATGCAGTTCG ACAAGGGCTA CATCTCGCCG
TACTTCGTGA CCGACCAGGA GAGCATGGAG GCCGTCCTCG AGGACGCCTA CGTGCTGCTG
CACCCTGGCA AGATCAGCGC GCTGAACGAC ATCCTGCCGA TCCTCGAGCA GGTGGTGCAG
GAGCGTAAGC CGCTGCTGAT CATCGCTGAG GACGTCGAGG GCGAGGCGCT GTCGACCCTG
GTCGTCAACG CGGTGCGCAA GACCTTCCAG GTCGTCGCGG TGAAGGCCCC CGGCTTCGGT
GACCGCCGCA AGGCGATGCT GCAGGACCTC GCGGTCCTGA CCGGTGGCCA GGTCGTCGCG
ACCGAGGTGG GTCTCAAGCT CGACTCGATC ACGCTCGCCG AGCTCGGCCG GGCGCGGCGG
ATCGTCGTCG ACAAGGACCT CACCACGATC GTCGACGGTG CCGGCGACGC CGACGCGGTC
GCCGAGCGCG TCCGGCAGAT GAAGGCGGAG ATCGAACTCT CCGACTCGGA CTGGGACCGC
GAGAAGCTCC AGGAGCGCCT GGCCAAGCTG GCCGGCGGGG TCGCCGTCAT CCGCGTCGGC
GCCGCCACCG AGGTCGAGCT CAAGGAGAAG AAGCACCGTC TCGAGGACGC CGTGTCGGCG
ACCCGGGCCG CGGTCGAGGA GGGCATCGTG GCGGGCGGCG GCAGCGCGCT CGTCCACGCG
GCCAAGGTCC TCGACGGTGA CCTCGGCCTG ACCGGTGACG AGCGTTCCGG GGTGCGGGTC
GTGCGCGCGG CGCTCGACGC GCCGCTCACC TGGATCGCGC GCAACGCCGG CCTCGAGGGC
GCGGTGATCG TGTCCAAGGT CCGCGAGGGC GACGTCGGCC GGGGCTTCAA CGCGGCCACC
GGCGAGTACG TCGACCTGGT CGCGGCCGGC GTGGTCGACC CCGTGAAGGT CACCCGCTCC
GCGGTGGCGA ACGCGGCCTC CATCGCCGCG CTGCTCATCA CCACCGAGAG CCTGGTGGCG
GAGAAGCCCG AGGAGGCCGA GGCCGGCCAC GGCCACGACC ACTCGCACGG CGGCCATGGC
CACGGCCACT CGCACGGGCC GGGTTTCTGA
 
Protein sequence
MPKILTFNED ARRSLERGVN ALADAVKVTI GPRGRNVVIN KSYGAPTITN DGVTIAREVE 
LDDPYQNLGA QLAKEVATKT NDVAGDGTTT ATVLAQEMVR QGLRQVTAGA APLSLKIGIE
AAVAAVSSAL LEAAIEIGSK ETIAQVAAIS AQDAQVGELI AEAMDKIGKD GVITIEESQT
MGLDLELTEG MQFDKGYISP YFVTDQESME AVLEDAYVLL HPGKISALND ILPILEQVVQ
ERKPLLIIAE DVEGEALSTL VVNAVRKTFQ VVAVKAPGFG DRRKAMLQDL AVLTGGQVVA
TEVGLKLDSI TLAELGRARR IVVDKDLTTI VDGAGDADAV AERVRQMKAE IELSDSDWDR
EKLQERLAKL AGGVAVIRVG AATEVELKEK KHRLEDAVSA TRAAVEEGIV AGGGSALVHA
AKVLDGDLGL TGDERSGVRV VRAALDAPLT WIARNAGLEG AVIVSKVREG DVGRGFNAAT
GEYVDLVAAG VVDPVKVTRS AVANAASIAA LLITTESLVA EKPEEAEAGH GHDHSHGGHG
HGHSHGPGF