Gene Acid345_1097 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1097 
SymbolgroEL 
ID4069557 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1373508 
End bp1375169 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content61% 
IMG OID637983106 
Productchaperonin GroEL 
Protein accessionYP_590174 
Protein GI94968126 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000134813 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000260243 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAAGC AAATCGTTCA CGGCGAAGAA TCCCGCCAGT CCATTCTGCG CGGCGTTAAC 
GTTCTCGCTG ACGCAGTGAA AGTTACCCTC GGCCCCAAGG GCCGCAACGT CGTTATCGAC
AAAAAGTTCG GTTCCCCGCT CATCACCAAG GACGGCGTTA CCGTCGCGAA GGAAATCGAA
CTGAAGGACA CCCTTGAGAA CATGGGCGCC CAGATGGTGA AGGAAGTCGC CAGCAAGACC
AGCGACATCG CCGGCGACGG CACCACCACC GCCACCGTTC TCGCCCAGGC CATCTACCGC
GAAGGCGTAA AGAACGTCGC CGCCGGTTCC AACCCCATGG CCCTCAAGCG CGGCATCGAC
AAGGCCGTCA CCGCCGTATG CGGTTACAAC GACGCCGAAG GCAACCGCAT CCCCGGCGCC
CTCGACAAGT TCAGCAAGCC CGTCACCGGC GAGATGATCG CCCAGGTCGG CACCATCTCC
GCCAACAACG ACGAGACCAT CGGCAAGATC ATTGCCGAAG CCATGAAGAA GGTCGGCAAA
GACGGTGTCA TCACCGTTGA AGAGTCGAAG ACCATGGAGA CCCAGCTCGA AGTCGTCGAA
GGCATGCAGT TCGACCGCGG CTATCTCTCC CCGTACTTCG TCACCGATCC TGAGCGCATG
GAAGCCGTGC TCGAGAACCC CTACATCCTC ATCCACGAAA AGAAAGTTTC CTCGATGAAG
GACCTGCTCC CGTTGCTCGA GCAGATCGCC AAGGGCGGAC GCCCATTGGT CATCATCGCG
GAAGACGTGG AAGGCGAAGC ACTCGCGACT TTGGTCGTGA ACAAGCTGCG CGGCACGCTC
AACGTTGCGG CCGTGAAGGC GCCTGGCTTC GGCGATCGCC GCAAGGCCAT GCTGCAAGAC
ATCGCGATCC TCACCGGCGG CAAGGCCATT ACCGAAGACC TCGGCATTAA GCTCGAGAAC
GTCCATATGG ACGATCTCGG TTCCGCCAAG AAGGTCACCA TCGACAAGGA CAACACCACG
ATTGTCGAAG GCAAGGGCAA GAGTTCCGAC ATCGAAGGCC GCGTGAAGGA AATTCGCAGC
CAGGTCGAAA AGACCACCAG CGACTACGAC CGCGAGAAGC TCCAGGAACG CCTGGCGAAG
CTCGTCGGCG GCGTTGCGGT GATCAAGGTC GGCGCAGCCA CCGAGACTGA AATGAAGGAA
AAGAAAGCTC GCGTGGAAGA CGCGATGCAC GCAACCCGCG CTGCCGTGGA AGAAGGCATC
GTCCCGGGCG GCGGCGTTGC GCTCATCCGT TGCGTCGAAG CCGTTGACGC CCTCAAGCTC
ACCGGCGACG AAGGCATCGG CGCCAACATC ATCAAGCGCG CGCTCGAAGA GCCCCTCCGC
CAGATCGTCG GCAACGCCGG CGAAGAAGGC GCCATTGTGG TCGGCAAGAT CCGCGACCAC
AAGGACCCGC ACTACGGATA CAACGCCCAG ACCAGCGAGT ACGTTGACCT GGTCAAGGCC
GGCGTCATCG ACCCGACCAA GGTAACCCGT ACCGCGCTCC AGAACGCCGG CTCCATCGCT
GGCCTCATGC TCACCACGGA AGCCCTCATC TCCGAGATCC CCGAAGAGAA GAAGTCTGAA
CCGGCCGGTG GACACGGCGG CGGCATGGGC GGCATGTACT AA
 
Protein sequence
MAKQIVHGEE SRQSILRGVN VLADAVKVTL GPKGRNVVID KKFGSPLITK DGVTVAKEIE 
LKDTLENMGA QMVKEVASKT SDIAGDGTTT ATVLAQAIYR EGVKNVAAGS NPMALKRGID
KAVTAVCGYN DAEGNRIPGA LDKFSKPVTG EMIAQVGTIS ANNDETIGKI IAEAMKKVGK
DGVITVEESK TMETQLEVVE GMQFDRGYLS PYFVTDPERM EAVLENPYIL IHEKKVSSMK
DLLPLLEQIA KGGRPLVIIA EDVEGEALAT LVVNKLRGTL NVAAVKAPGF GDRRKAMLQD
IAILTGGKAI TEDLGIKLEN VHMDDLGSAK KVTIDKDNTT IVEGKGKSSD IEGRVKEIRS
QVEKTTSDYD REKLQERLAK LVGGVAVIKV GAATETEMKE KKARVEDAMH ATRAAVEEGI
VPGGGVALIR CVEAVDALKL TGDEGIGANI IKRALEEPLR QIVGNAGEEG AIVVGKIRDH
KDPHYGYNAQ TSEYVDLVKA GVIDPTKVTR TALQNAGSIA GLMLTTEALI SEIPEEKKSE
PAGGHGGGMG GMY