Gene SeHA_C4748 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C4748 
SymbolgroEL 
ID6488061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp4628968 
End bp4630614 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content55% 
IMG OID642744802 
Productchaperonin GroEL 
Protein accessionYP_002048378 
Protein GI194451344 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.729761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value0.333677 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCTA AAGACGTAAA ATTCGGTAAC GACGCTCGTG TGAAAATGCT GCGCGGCGTA 
AACGTACTGG CAGATGCAGT GAAAGTAACC CTCGGTCCGA AAGGCCGTAA CGTGGTTCTG
GATAAATCTT TCGGTGCGCC GACTATCACT AAAGATGGTG TTTCCGTAGC GCGTGAAATC
GAACTGGAAG ACAAGTTTGA AAACATGGGC GCGCAGATGG TGAAAGAAGT TGCCTCTAAA
GCGAACGATG CTGCAGGCGA CGGCACCACC ACCGCGACCG TACTGGCGCA GTCCATCATT
ACCGAAGGCT TGAAAGCCGT TGCTGCGGGC ATGAACCCGA TGGACCTGAA ACGTGGTATC
GACAAAGCGG TTGCTGCGGC GGTTGAAGAG CTGAAAGCCC TGTCCGTACC GTGCTCCGAC
TCTAAAGCGA TTGCTCAGGT AGGTACTATC TCCGCTAACT CCGACGAAAC CGTAGGTAAA
CTGATTGCGG AAGCGATGGA TAAAGTCGGT AAAGAAGGCG TCATCACCGT TGAAGACGGT
ACCGGTCTGC AGGACGAACT GGACGTGGTT GAAGGTATGC AGTTTGACCG CGGCTACCTG
TCTCCTTACT TCATCAACAA GCCGGAAACT GGCGCAGTAG AGCTGGAAAG CCCGTTCATC
CTGCTGGCTG ATAAGAAAAT CTCCAACATC CGCGAAATGC TGCCGGTTCT GGAAGCCGTT
GCAAAAGCAG GCAAACCGCT GCTGATCATC GCTGAAGATG TTGAAGGCGA AGCGCTGGCT
ACCCTGGTAG TGAACACCAT GCGTGGCATC GTGAAAGTGG CTGCTGTTAA GGCACCGGGC
TTCGGCGATC GTCGTAAGGC GATGCTGCAG GATATCGCTA CCCTGACCGG CGGTACCGTA
ATCTCTGAAG AGATCGGTAT GGAGCTGGAA AAAGCAACCC TGGAAGACCT GGGTCAGGCG
AAACGTGTTG TGATCAACAA AGACACCACC ACCATCATCG ATGGCGTGGG TGAAGAAGCT
GCCATCCAGG GCCGTGTTGC TCAGATCCGT CAGCAGATTG AAGAAGCGAC CTCCGACTAC
GATCGTGAAA AACTGCAGGA GCGCGTAGCG AAACTGGCAG GCGGCGTTGC GGTTATCAAA
GTTGGCGCTG CGACCGAAGT TGAAATGAAA GAGAAGAAAG CCCGCGTTGA AGATGCCCTG
CACGCGACCC GTGCTGCGGT AGAAGAAGGC GTGGTTGCTG GTGGTGGCGT TGCGCTGATC
CGCGTTGCTT CTAAAATTGC TGACCTGAAA GGCCAGAACG AAGACCAGAA CGTGGGTATC
AAAGTTGCGC TGCGCGCAAT GGAAGCTCCG CTGCGTCAGA TCGTGCTGAA CTGCGGCGAA
GAGCCGTCTG TTGTCGCTAA CACCGTTAAA GGCGGCGACG GTAACTACGG TTACAACGCA
GCAACTGAAG AATACGGCAA CATGATCGAT ATGGGTATCC TGGACCCAAC CAAAGTTACC
CGTTCTGCGC TGCAGTACGC GGCTTCTGTG GCTGGTCTGA TGATCACTAC CGAGTGCATG
GTGACCGACC TGCCGAAAAG CGATGCTCCT GATTTAGGCG CTGCTGGCGG CATGGGTGGT
ATGGGTGGTA TGGGCGGCAT GATGTAA
 
Protein sequence
MAAKDVKFGN DARVKMLRGV NVLADAVKVT LGPKGRNVVL DKSFGAPTIT KDGVSVAREI 
ELEDKFENMG AQMVKEVASK ANDAAGDGTT TATVLAQSII TEGLKAVAAG MNPMDLKRGI
DKAVAAAVEE LKALSVPCSD SKAIAQVGTI SANSDETVGK LIAEAMDKVG KEGVITVEDG
TGLQDELDVV EGMQFDRGYL SPYFINKPET GAVELESPFI LLADKKISNI REMLPVLEAV
AKAGKPLLII AEDVEGEALA TLVVNTMRGI VKVAAVKAPG FGDRRKAMLQ DIATLTGGTV
ISEEIGMELE KATLEDLGQA KRVVINKDTT TIIDGVGEEA AIQGRVAQIR QQIEEATSDY
DREKLQERVA KLAGGVAVIK VGAATEVEMK EKKARVEDAL HATRAAVEEG VVAGGGVALI
RVASKIADLK GQNEDQNVGI KVALRAMEAP LRQIVLNCGE EPSVVANTVK GGDGNYGYNA
ATEEYGNMID MGILDPTKVT RSALQYAASV AGLMITTECM VTDLPKSDAP DLGAAGGMGG
MGGMGGMM