Gene Smed_0408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0408 
SymbolgroEL 
ID5321242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp438560 
End bp440197 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content61% 
IMG OID640789343 
Productchaperonin GroEL 
Protein accessionYP_001326100 
Protein GI150395633 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.522888 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCTA AAGAAGTCAA GTTCGGCCGT AGCGCGCGCG AAAAGATGCT GCGCGGCGTC 
GACATTCTCG CCGACGCGGT CAAGGTGACG CTCGGCCCGA AGGGCCGTAA CGTCGTCATC
GACAAATCGT TCGGCGCACC GCGCATCACC AAGGACGGCG TTTCCGTCGC CAAGGAGATC
GAACTCGAAG ACAAGTTCGA GAACATGGGC GCCCAGATGG TCCGCGAAGT TGCTTCGAAG
ACCAACGACA TCGCCGGCGA CGGCACGACG ACCGCAACCG TTCTCGCTCA GGCAATCGTT
CGCGAAGGCG CGAAGGCCGT TGCCGCCGGC ATGAACCCGA TGGACCTTAA GCGCGGTATC
GACCTCGCCG TCGCTGAAGT CGTCAAGGAC CTGCTCGCCA AGGCCAAGAA GATCAACACC
TCGGACGAAG TCGCCCAGGT CGGCACGATC TCGGCAAACG GCGAAAAGCA GATCGGTCTC
GACATCGCCG AAGCGATGCA GAAGGTTGGC AACGAAGGCG TCATCACGGT TGAAGAAGCC
AAGACCGCCG AGACCGAACT CGAAGTCGTC GAAGGCATGC AGTTCGACCG CGGTTATCTG
TCGCCCTACT TCGTCACCAA CCCGGAAAAG ATGGTCGCCG ACCTCGAAGA CGCTTACATT
CTCCTGCACG AGAAGAAGCT CTCCAACCTT CAGGCGATGC TCCCGGTTCT CGAAGCCGTC
GTCCAGACCG GCAAGCCGCT CCTCATCATT GCTGAAGACG TCGAAGGCGA AGCTCTTGCA
ACGCTCGTCG TCAACAAGCT GCGTGGCGGC CTGAAGATTG CTGCCGTCAA GGCTCCGGGC
TTCGGCGACC GTCGCAAGGC CATGCTCGAA GACATCGCCA TCCTGACGGG CGGCACCGTG
ATCTCGGAAG ACCTCGGCAT CAAGCTCGAA AGCGTCACGC TCGACATGCT CGGCCGTGCG
AAGAAGGTTT CGATCACCAA GGAAAATACG ACGATCGTCG ACGGTGCCGG CCAGAAGTCC
GACATCGAAG GCCGCGTCGC CCAGATCAAG GCCCAGATCG AAGAAACCAC TTCCGACTAC
GACCGTGAGA AGCTGCAGGA GCGCCTTGCC AAGCTCGCTG GCGGCGTTGC CGTCATCCGC
GTCGGCGGTG CGACGGAAGT CGAAGTGAAG GAAAAGAAGG ACCGCATCGA CGACGCTCTC
AACGCGACGC GCGCTGCAGT TCAGGAAGGC ATCGTACCGG GCGGCGGCGT TGCTCTGCTG
CGTTCCTCCG TCAAGATCAC CGTCAAGGGT GAAAACGACG ACCAGGATGC CGGCGTCAAC
ATCGTTCGCC GCGCTCTGCA GTCTCCGGCC CGCCAGATCG TCGAAAACGC TGGCGACGAA
GCATCCATCG TTGTCGGCAA GATCCTCGAG AAGGACACCG ACGACTTCGG TTACAACGCA
CAGACCGGCG AATATGGTGA CATGATCGCC ATGGGCATCA TCGACCCGGT CAAGGTCGTT
CGCACCGCGC TTCAGGACGC AGCCTCGGTT GCTTCGCTGC TCATCACCAC CGAAGCCATG
ATCGCCGAGC TGCCGAAGAA GGACGCTCCG GCAATGCCTG GCGGCATGGG CGGCATGGGC
GGCATGGACA TGATGTGA
 
Protein sequence
MAAKEVKFGR SAREKMLRGV DILADAVKVT LGPKGRNVVI DKSFGAPRIT KDGVSVAKEI 
ELEDKFENMG AQMVREVASK TNDIAGDGTT TATVLAQAIV REGAKAVAAG MNPMDLKRGI
DLAVAEVVKD LLAKAKKINT SDEVAQVGTI SANGEKQIGL DIAEAMQKVG NEGVITVEEA
KTAETELEVV EGMQFDRGYL SPYFVTNPEK MVADLEDAYI LLHEKKLSNL QAMLPVLEAV
VQTGKPLLII AEDVEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKAMLE DIAILTGGTV
ISEDLGIKLE SVTLDMLGRA KKVSITKENT TIVDGAGQKS DIEGRVAQIK AQIEETTSDY
DREKLQERLA KLAGGVAVIR VGGATEVEVK EKKDRIDDAL NATRAAVQEG IVPGGGVALL
RSSVKITVKG ENDDQDAGVN IVRRALQSPA RQIVENAGDE ASIVVGKILE KDTDDFGYNA
QTGEYGDMIA MGIIDPVKVV RTALQDAASV ASLLITTEAM IAELPKKDAP AMPGGMGGMG
GMDMM