Gene Smed_0804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0804 
SymbolgroEL 
ID5321641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp866544 
End bp868172 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content63% 
IMG OID640789741 
Productchaperonin GroEL 
Protein accessionYP_001326495 
Protein GI150396028 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.451933 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.636529 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTCA AGGAAGTCAG ATTCACCAGC GATGCCCGCG ATCGCATGCT GCGCGGCGTG 
GATATCATGG CCAACGCCGT GCGCGTGACT TTGGGACCGA AGGGGCGGAA CGTCGTCATC
GACAAGTCCT TTGGAGCGCC GCGGATAACC AAGGACGGCG TTTCCGTCGC CAAGGAAATC
GAGCTCGAGG ACAAGTTCGA GAACATGGGG GCGCAGATGC TGCGCGAGGT GGCGTCGCGC
ACCAGCGACA TCGCCGGCGA TGGCACCACC ACGGCCACCG TACTCGCGCA AGCAATCGTC
AGGGAAGGCG CGAAGGCCGT GGCGGCAGGC ATGAACCCGA TGGACCTGAA GCGTGGCATC
GATCTGGCCG TCGAGGCGAT CGTACGGGAA CTCAGGACCA ACGCCCGCAA GGTCTCCAAG
AACGCCGAGA TCGCTCAGGT AGCCACGATT TCGGCCAATG GCGACGCAGA AATCGGTCGC
TACCTTGCCG AAGCCATGGA AAAGGTCGGC AACGAGGGCG TAATCACCGT CGAAGAGGCC
AAGACCGCCG AGATCGAGCT TGAAGTCGTG GAGGGAATGC AGTTCGACCG CGGCTATCTC
TCACCCTATT TCATCACCAA CCAGGAGAAG ATGAGGGTGG AACTGGAAGA CGCCTACATA
CTGCTGCACG AGAAGAAGCT CTCCAACCTG CAGGCGATGA TCCCAATTCT CGAATCGGTC
ATCCAGTCCG GAAAGCCCCT GCTGATCATT GCCGAGGACG TCGAGGGCGA GGCACTCGCG
ACGCTGGTGG TCAACAAGCT GCGCGGCGGC CTGAAGATCG CCGCGGTCAA GGCGCCCGGC
TTCGGCGACC GCCGCAAGTC CATGCTCGAG GACATCGCGA TCCTGACCGG CGGAACCGTC
ATCTCCGAGG AACTCGGGAC TAAGCTCGAG AGCGCGACGA TCGACATCCT CGGCCGCGCG
AAACGCGTGA TGGTCGAAAA GGAGACGACG ACGATCGTCG ACGGCGCCGG GTCGAAGGCG
GACATCGGTG GCCGCGTCGC CCAGATCAAG GCGCAGATCG AGGACACCAC TTCCGACTAC
GATCGGGAGA AGCTGCAGGA GCGGCTCGCC AAGCTCGCGG GCGGTGTCGC CGTGATCCGC
GTCGGCGGCT CGACAGAGAT CGAAGTCAAG GAGAAGAAGG ATCGCGTCGA CGATGCCCTT
CATGCGACGC GGGCGGCGGT CGAAGAAGGC ATCCTGCCGG GCGGCGGCGT GGCGCTGCTA
CGGGTCGTCA GCGTGCTCAA CGGTCTTGCG ACGGCCAACG ACGATCAGCG CGTGGGTATC
GAGATCGTCC GCCGCGCTAT CGAGGCACCC GTCCGCCAGA TCGCCGAGAA CGCCGGCGCC
GAGGGATCCA TTATCGTCGG GAAGTTGCGG GAGAAAGAGG ATTTTGCCTT TGGCTGGAAT
GCCCAGACTG GTGAATTCGG CGATCTCTTT CAAATGGGCG TCATCGACCC AGCAAAGGTC
GTGCGCGCCG CCTTGCAGGA CGCCGCCTCC GTCGCCGGTC TTCTTGTGAC GACGGAAGCG
ATGATCGCCG AGAAACCGAA GAAGGACGGG CAGCCGCAGA TGCCGCCCGC GCCGGGCATG
GATTTCTGA
 
Protein sequence
MAVKEVRFTS DARDRMLRGV DIMANAVRVT LGPKGRNVVI DKSFGAPRIT KDGVSVAKEI 
ELEDKFENMG AQMLREVASR TSDIAGDGTT TATVLAQAIV REGAKAVAAG MNPMDLKRGI
DLAVEAIVRE LRTNARKVSK NAEIAQVATI SANGDAEIGR YLAEAMEKVG NEGVITVEEA
KTAEIELEVV EGMQFDRGYL SPYFITNQEK MRVELEDAYI LLHEKKLSNL QAMIPILESV
IQSGKPLLII AEDVEGEALA TLVVNKLRGG LKIAAVKAPG FGDRRKSMLE DIAILTGGTV
ISEELGTKLE SATIDILGRA KRVMVEKETT TIVDGAGSKA DIGGRVAQIK AQIEDTTSDY
DREKLQERLA KLAGGVAVIR VGGSTEIEVK EKKDRVDDAL HATRAAVEEG ILPGGGVALL
RVVSVLNGLA TANDDQRVGI EIVRRAIEAP VRQIAENAGA EGSIIVGKLR EKEDFAFGWN
AQTGEFGDLF QMGVIDPAKV VRAALQDAAS VAGLLVTTEA MIAEKPKKDG QPQMPPAPGM
DF