Gene Smed_6084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6084 
SymbolgroEL 
ID5320386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1017063 
End bp1018700 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content62% 
IMG OID640777729 
Productchaperonin GroEL 
Protein accessionYP_001314661 
Protein GI150378066 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.879265 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCTA AAGAAGTCAA GTTCGGCCGT AGCGCGCGCG AAAAGATGCT GCGCGGCGTC 
GACATCCTCG CCGACGCAGT CAAGGTGACG CTCGGCCCGA AGGGCCGCAA CGTCGTCATC
GACAAGTCGT TCGGCGCGCC GCGCATCACC AAGGACGGCG TCACCGTGGC CAAGGAGATC
GAACTCGAAG ACAAGTTCGA GAACATGGGC GCCCAGATGG TCCGCGAAGT CGCTTCGAAG
ACCAACGACA TTGCCGGTGA CGGCACGACG ACTGCAACCG TTCTCGCTCA GGCAATCGTT
CGCGAAGGCG CGAAGGCCGT TGCCGCCGGC ATGAACCCGA TGGACCTTAA GCGCGGTATC
GACCTCGCCG TCGCGGAAGT CGTCAAGGAC CTGCTCGCCA AGGCCAAGAC GATCAACACC
TCGGACGAAG TCGCCCAGGT CGGCACGATC TCGGCAAACG GCGAAAAGCA GATCGGTCTC
GACATTGCGG AAGCGATGCA GAAGGTCGGC AACGAAGGCG TCATCACGGT TGAAGAAGCC
AAGACCGCCG AGACCGAGCT CGAAGTCGTC GACGGCATGC AGTTCGACCG CGGCTACCTG
TCGCCCTACT TCGTCACCAA CCCGGAAAAG ATGGTCGCCG ACCTCGAAGA CGCTTACATT
CTCCTGCACG AGAAGAAGCT GTCGAACCTG CAGGCGATGC TCCCGGTTCT CGAAGCCGTC
GTCCAGACCG GCAAGCCGCT CCTCATCATT GCTGAAGACG TCGAAGGCGA AGCACTCGCA
ACGCTCGTCG TCAACAAGCT GCGTGGCGGC CTGAAGATCG CTGCCGTCAA GGCCCCGGGC
TTCGGCGACC GCCGCAAGGC CATGCTCGAA GACATCGCCA TCCTGACGGG CGGCACGGTG
ATCTCGGAAG ACCTCGGCAT CAAGCTCGAA AGCGTCACGC TCGACATGCT CGGCCGTGCG
AAGAAGGTTT CGATCACCAA GGAAAATACG ACGATCGTCG ACGGTGCCGG CCAGAAGTCC
GACATCGAAG GCCGCGTCGC CCAGATCAAG GCCCAGATCG AAGAAACCAC TTCCGACTAC
GACCGTGAGA AGCTGCAGGA GCGCCTTGCC AAGCTCGCTG GCGGCGTTGC CGTCATCCGC
GTCGGCGGTG CGACGGAAGT CGAAGTGAAG GAAAAGAAGG ACCGCATCGA CGACGCTCTC
AACGCGACGC GCGCTGCAGT TCAGGAAGGC ATCGTACCGG GCGGCGGCGT TGCTCTGCTG
CGTTCCTCCG TCAAGATCAC CGTCAAGGGT GAAAACGACG ACCAGGATGC CGGCGTCAAC
ATCGTTCGCC GCGCTCTGCA GTCTCCGGCC CGCCAGATCG TCGAAAACGC TGGCGACGAA
GCATCCATCG TCGTCGGCAA GATCCTCGAG AAGGACACCG ACGACTTCGG TTACAACGCA
CAGACCGGCG AATATGGCGA CATGATCGCC ATGGGCATCA TCGACCCGGT CAAGGTCGTT
CGCACCGCGC TCCAGGACGC AGCCTCGGTT GCTTCGCTGC TCATCACCAC CGAAGCCATG
ATCGCCGAGC TGCCGAAGAA GGACGCTCCG GCAATGCCTG GCGGCATGGG CGGCATGGGC
GGCATGGACA TGATGTGA
 
Protein sequence
MAAKEVKFGR SAREKMLRGV DILADAVKVT LGPKGRNVVI DKSFGAPRIT KDGVTVAKEI 
ELEDKFENMG AQMVREVASK TNDIAGDGTT TATVLAQAIV REGAKAVAAG MNPMDLKRGI
DLAVAEVVKD LLAKAKTINT SDEVAQVGTI SANGEKQIGL DIAEAMQKVG NEGVITVEEA
KTAETELEVV DGMQFDRGYL SPYFVTNPEK MVADLEDAYI LLHEKKLSNL QAMLPVLEAV
VQTGKPLLII AEDVEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKAMLE DIAILTGGTV
ISEDLGIKLE SVTLDMLGRA KKVSITKENT TIVDGAGQKS DIEGRVAQIK AQIEETTSDY
DREKLQERLA KLAGGVAVIR VGGATEVEVK EKKDRIDDAL NATRAAVQEG IVPGGGVALL
RSSVKITVKG ENDDQDAGVN IVRRALQSPA RQIVENAGDE ASIVVGKILE KDTDDFGYNA
QTGEYGDMIA MGIIDPVKVV RTALQDAASV ASLLITTEAM IAELPKKDAP AMPGGMGGMG
GMDMM