Gene Jann_3358 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_3358 
SymbolgroEL 
ID3935831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp3409571 
End bp3411211 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content61% 
IMG OID637905731 
Productchaperonin GroEL 
Protein accessionYP_511300 
Protein GI89055849 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAAAAG ACGTCCGTTT TGATACCGAC GCCCGCAATC GTATGCTGAA GGGTGTGAAC 
ACCCTCGCCG ATGCGGTCAA AGTCACGCTT GGCCCCAAAG GCCGTAACGT GGTCATCGAC
AAGTCCTTCG GCGCGCCGCG CATCACGAAG GACGGTGTAT CCGTCGCCAA AGAGATCGAG
CTGGAAGACA AGTTCGAGAA CATGGGCGCA CAGATGGTGA AAGAAGTCGC CAGCCGCACC
AATGATGAGG CCGGTGACGG CACCACGACG GCAACTGTGC TGGCCCAGGC CATCATCAAG
GAAGGCCTCA AGTCGGTTGC GGCAGGCATG AACCCGATGG ACCTCAAGCG CGGCATCGAC
CTGGCCGTGA CCAAGGTCAT CGCCGAGATC CAGGGCTCCG CTCGCGAAGT CGCGGACAGC
GATGAAGTCG CCCAGGTTGG CACCATTTCC GCCAACGGCG AAGCTGAAAT CGGTCGTCAG
ATCGCCGACG CGATGCAGAA AGTCGGCAAC GACGGCGTCA TCACCGTGGA AGAGAACAAG
GGCCTTGAGA CCGAGACCGA TGTTGTCGAA GGCATGCAGT TCGACCGTGG CTACCTGTCG
CCCTATTTCG TGACCAACCC TGACAAGATG ATCGCCGAGT TGGACGATTG CCTGATCCTG
CTGCACGAGA AGAAGTTGTC TTCCCTGCAG CCGATGGTCC CGCTGCTGGA GACTGTCATC
CAGTCCGGCA AGCCGCTTCT GATCATCGCT GAAGATGTCG AAGGGGAAGC CCTGGCCACG
CTCGTCGTCA ACAAGCTGCG TGGCGGCCTG AAGATCGCCG CCGTCAAAGC GCCCGGTTTC
GGGGATCGTC GTAAGGCGAT GCTGCAGGAT ATCGCCATCC TGACCGGTGG CCAGGTGATC
GCGGAAGACC TGGGCATGAA GCTCGAATCC GTGACGATGG ACATGCTCGG CACCGCCAAG
CGTCTGACCA TCTCCAAGGA CGAGACCACG ATTGTCGACG GTGCTGGCAA CAAGCCGGAG
ATCGAGGCGC GCGTCGCCCA GATCCGTCAG CAGATCGAGG AAAGCACCTC CGACTATGAC
CGTGAAAAGC TGCAAGAGCG TGTTGCCAAA CTGGCAGGCG GTGTTGCCGT GATCAAGGTC
GGCGGCATGT CCGAGATCGA AGTGAAAGAG CGTAAGGACC GCGTCGACGA CGCCCTGAAC
GCAACCCGCG CCGCTGTCCA GGAAGGCATC GTTGTGGGCG GTGGTGTTGC TCTGGTCCAG
GGTGGCAAGT CGCTGGCTGG TCTTGAAGGC GAGAATGCCG ACCAGAATGC CGGTATCGCC
ATCGTGCGCC GTGCATTGGA AGCGCCGCTG CGCCAGATCG CCGAAAACTC CGGCGTCGAC
GGGTCCGTCG TTGCGGGCAA GATCCGCGAA TCTGACGACA ACGCCTTCGG CTTCAACGCC
CAGACGGAAG AATATGGCGA CCTGTTCAAG TTCGGCGTCA TCGACCCGGC CAAGGTTGTC
CGCACGGCTC TGCAGGACGC GGCCTCTGTG GCTGGCCTGC TGATCACCAC GGAAGCCATG
GTGGCCGACA AGCCTGCCAA AGAAGGCGCA CCTGCCGGTG GCGGCATGCC CGACATGGGC
GGCATGGGCG GCATGATGTA A
 
Protein sequence
MAKDVRFDTD ARNRMLKGVN TLADAVKVTL GPKGRNVVID KSFGAPRITK DGVSVAKEIE 
LEDKFENMGA QMVKEVASRT NDEAGDGTTT ATVLAQAIIK EGLKSVAAGM NPMDLKRGID
LAVTKVIAEI QGSAREVADS DEVAQVGTIS ANGEAEIGRQ IADAMQKVGN DGVITVEENK
GLETETDVVE GMQFDRGYLS PYFVTNPDKM IAELDDCLIL LHEKKLSSLQ PMVPLLETVI
QSGKPLLIIA EDVEGEALAT LVVNKLRGGL KIAAVKAPGF GDRRKAMLQD IAILTGGQVI
AEDLGMKLES VTMDMLGTAK RLTISKDETT IVDGAGNKPE IEARVAQIRQ QIEESTSDYD
REKLQERVAK LAGGVAVIKV GGMSEIEVKE RKDRVDDALN ATRAAVQEGI VVGGGVALVQ
GGKSLAGLEG ENADQNAGIA IVRRALEAPL RQIAENSGVD GSVVAGKIRE SDDNAFGFNA
QTEEYGDLFK FGVIDPAKVV RTALQDAASV AGLLITTEAM VADKPAKEGA PAGGGMPDMG
GMGGMM