Gene Smed_4130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4130 
SymbolgroEL 
ID5319272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp597851 
End bp599485 
Gene Length1635 bp 
Protein Length544 aa 
Translation table11 
GC content61% 
IMG OID640775936 
Productchaperonin GroEL 
Protein accessionYP_001312869 
Protein GI150376273 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.800331 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCCA AAGAAATCAT CTTCTCCACC GAAGTCCGCG ATCGCCTGCT TCGCGGCGTG 
GAATTGCTCA ACAATGCCGT CAAGGTGACG CTCGGCCCGA AGGGCCGCAA CGTCGTCATC
GACAGGTCGT ATGGCGCACC ACGAATCACC AAGGACGGCG TCTCCGTCGC CAAGGAAATC
GAGCTTGAAG ACAAGTTCGA GAACATGGGC GCGCAGATGG TGCGCGAAGT GGCATCAAAG
ACCAATGATC TTGCCGGCGA CGGCACCACG ACGGCGACTG TGCTTGCCGC GTCCATCTTC
CGCGAAGGCG CCAAGCTGGT CGCGGCGGGC ATGAACCCGA TGGATCTTAA GCGCGGTATC
GACCTCGCCG TTACCGCGGT CCTCGCGGAA ATCAAGCTGC GCGCCACGAA GGTCAATTCG
TCGAGCGAGA TAGCACAGGT CGGCACGATC GCCGCCAATG GCGACGCCAG CGTCGGCGAG
ATGATCGCAG GGGCGATGGA GAAGGTCGGC AACGAGGGTG TCATTACCGT CGAAGAAGCC
AGGACCGCCG ATACCGAACT CGACGTCGTC GAAGGCATGC AGTTCGACCG CGGATATCTA
TCGCCGTACT TCGTGACCAA TGCCGAGAAG ATGCGCGTGG AACTGGACGA TCCCTACATC
CTCATTCACG AGAAGAAGCT CGGCAACCTG CAGACCATGC TGCCGATCCT TGAGGCCGTA
GTGCAGAGCG GCAAGCCTTT GCTCATCATC TCCGAAGACG TGGAAGGCGA AGCGTTGACG
ACGCTCGTCG TCAACAAGTT GCGGGGAGGT CTGAAGATCG CCGCCGTGAA GTCGCCGGGC
TTCGGCGACC GCCGCAAGGC CATGCTGCAG GACATTGCCG TTCTGACCGC CGGCCAGATG
ATTTCCGAAG ATATCGGCAT CAAGCTCGAA AACGTTACGC TCGATATGCT TGGCCGCGCC
AGGCGGGTGC TGATCGAGAA AGACACGACC ACGATCATCG ACGGCTCCGG GGATAAGGCC
TCCATCCAGG CATGCATCAG CCAGATCAAG GCGCAGATCG AAGAGACGAC ATCCGACTAC
GACAAGGAGA AGCTGCAGGA GCGGCTGGCG AAGCTCACCG GCGGCGTCGC GGTAATTCGC
GTCGGCGGCG CGACCGAGCT TGAAGTCAAG GAAAAGAAGG ACCGCATCGA CGATGCCCTG
AACGCCACCC GCGCGGCAGT CGAGGAAGGC ATCGTTGCCG GCGGCGGCGT GGCGTTGTTG
CGTGCGAAAT CGGCGCTTGC CAGTCTCACC GGCGAGAACC CCGAGATTAC CGCAGGCATC
GCGATCGTGC GCAAGGCACT GGAGGCACCA ATCCGGCAGA TCGCCGACAA TGCCGGCGTC
GAAGGCTCGA TCGTCATCGG AAAACTCGTC GACAGCAGCG ACCAGAACCA GGGCTTCGAT
GCGCAGACGG AAACCTATGT GGATATGATC AAGGCCGGGA TCGTCGATCC CGCCAAGGTC
GTGCGGACAG CCTTGCGGGA CGCCGGCTCG ATTGCCGCAC TCCTGATCAC CGCCGAAGCC
ATGGTCGCCG ATATCCCAGA GAAAAACGCC GCTCAAAATG CCGGAAACGG CGCGATGGGC
GGGAGAGGAT ACTGA
 
Protein sequence
MSAKEIIFST EVRDRLLRGV ELLNNAVKVT LGPKGRNVVI DRSYGAPRIT KDGVSVAKEI 
ELEDKFENMG AQMVREVASK TNDLAGDGTT TATVLAASIF REGAKLVAAG MNPMDLKRGI
DLAVTAVLAE IKLRATKVNS SSEIAQVGTI AANGDASVGE MIAGAMEKVG NEGVITVEEA
RTADTELDVV EGMQFDRGYL SPYFVTNAEK MRVELDDPYI LIHEKKLGNL QTMLPILEAV
VQSGKPLLII SEDVEGEALT TLVVNKLRGG LKIAAVKSPG FGDRRKAMLQ DIAVLTAGQM
ISEDIGIKLE NVTLDMLGRA RRVLIEKDTT TIIDGSGDKA SIQACISQIK AQIEETTSDY
DKEKLQERLA KLTGGVAVIR VGGATELEVK EKKDRIDDAL NATRAAVEEG IVAGGGVALL
RAKSALASLT GENPEITAGI AIVRKALEAP IRQIADNAGV EGSIVIGKLV DSSDQNQGFD
AQTETYVDMI KAGIVDPAKV VRTALRDAGS IAALLITAEA MVADIPEKNA AQNAGNGAMG
GRGY