Gene SAG2074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2074 
SymbolgroEL 
ID1014885 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp2056140 
End bp2057762 
Gene Length1623 bp 
Protein Length540 aa 
Translation table11 
GC content41% 
IMG OID637317240 
Productchaperonin GroEL 
Protein accessionNP_689060 
Protein GI22538209 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.169869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAG ATATTAAATT TTCAGCAGAT GCCCGCTCAG CAATGGTGCG TGGTGTTGAT 
ATTTTAGCTG ATACAGTCAA AGTAACATTA GGTCCTAAAG GCCGTAATGT TGTTCTTGAA
AAAGCATTTG GTTCTCCTTT AATTACAAAT GATGGTGTGA CAATTGCTAA AGAAATTGAG
CTAGAAGATC ACTTTGAAAA TATGGGAGCT AAACTTGTGT CAGAAGTGGC TTCAAAAACT
AATGATATTG CAGGGGATGG CACTACAACT GCTACTGTTT TGACCCAAGC TATTGTACGG
GAAGGTCTTA AAAATGTAAC TGCAGGGGCA AATCCGATTG GCATTCGTCG TGGTATTGAA
ACAGCTGTTT CAGCAGCAGT TGAAGAGCTA AAAGAGATTG CACAACCAGT TTCAGGCAAA
GAAGCTATTG CTCAAGTTGC GGCTGTGTCT TCACGTTCTG AAAAAGTTGG GGAATACATT
TCTGAAGCTA TGGAGCGCGT GGGTAATGAT GGTGTTATCA CTATTGAAGA ATCGCGAGGT
ATGGAAACAG AGCTTGAAGT TGTGGAAGGA ATGCAGTTTG ACCGTGGGTA CTTGTCACAG
TATATGGTAA CTGATAACGA GAAAATGGTC TCTGAACTTG AGAATCCGTA TATCCTTATT
ACAGATAAGA AAATTTCAAA TATCCAAGAA ATTTTACCAT TATTAGAAGA GGTTCTTAAA
ACAAATCGTC CGTTGCTAAT CATCGCTGAT GATGTTGATG GAGAAGCTCT CCCAACGCTT
GTTCTTAACA AAATTCGTGG AACTTTCAAT GTCGTAGCTG TTAAAGCGCC TGGATTTGGT
GATCGTCGTA AAGCCATGCT GGAAGATATT GCTATCCTAA CAGGAGGAAC TGTCGTTACT
GAAGACCTTG GTTTAGACTT AAAAGATGCT ACTATGCAAG TTTTAGGACA GTCTGCTAAA
GTAACAGTAG ATAAAGATTC TACTGTTATT GTCGAAGGTG CCGGTGACTC ATCAGCAATT
GCTAATCGCG TAGCTATCAT TAAGTCACAG ATGGAGGCTA CAACTTCTGA TTTTGATCGT
GAAAAATTAC AAGAACGACT TGCTAAGTTA GCCGGTGGTG TAGCAGTAAT TAAAGTTGGT
GCAGCGACTG AAACAGAATT AAAAGAGATG AAACTTCGCA TCGAAGATGC GTTAAATGCA
ACGCGTGCTG CAGTTGAAGA AGGTATTGTT TCAGGTGGAG GTACGGCTCT TGTGAACGTT
ATTGAAAAAG TAGCGGCACT GAAACTTAAT GGTGATGAGG AGACTGGACG TAATATTGTT
CTTCGTGCTC TCGAAGAGCC TGTTCGTCAA ATTGCTTACA ATGCTGGATA TGAAGGTTCA
GTTATTATTG AACGTTTAAA ACAGTCTGAA ATTGGTACAG GATTTAATGC GGCCAATGGA
GAATGGGTAG ATATGGTTAC CACAGGTATC ATTGACCCTG TCAAAGTAAC ACGTTCTGCA
CTTCAAAATG CGGCATCTGT AGCAAGTCTT ATCTTGACTA CAGAAGCAGT AGTAGCAAAT
AAACCTGAAC CAGAAGCTCC TACAGCTCCT GCAATGGATC CATCTATGAT GGGTGGCTTC
TAA
 
Protein sequence
MAKDIKFSAD ARSAMVRGVD ILADTVKVTL GPKGRNVVLE KAFGSPLITN DGVTIAKEIE 
LEDHFENMGA KLVSEVASKT NDIAGDGTTT ATVLTQAIVR EGLKNVTAGA NPIGIRRGIE
TAVSAAVEEL KEIAQPVSGK EAIAQVAAVS SRSEKVGEYI SEAMERVGND GVITIEESRG
METELEVVEG MQFDRGYLSQ YMVTDNEKMV SELENPYILI TDKKISNIQE ILPLLEEVLK
TNRPLLIIAD DVDGEALPTL VLNKIRGTFN VVAVKAPGFG DRRKAMLEDI AILTGGTVVT
EDLGLDLKDA TMQVLGQSAK VTVDKDSTVI VEGAGDSSAI ANRVAIIKSQ MEATTSDFDR
EKLQERLAKL AGGVAVIKVG AATETELKEM KLRIEDALNA TRAAVEEGIV SGGGTALVNV
IEKVAALKLN GDEETGRNIV LRALEEPVRQ IAYNAGYEGS VIIERLKQSE IGTGFNAANG
EWVDMVTTGI IDPVKVTRSA LQNAASVASL ILTTEAVVAN KPEPEAPTAP AMDPSMMGGF