Gene Sala_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSala_2079 
SymbolgroEL 
ID4080016 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingopyxis alaskensis RB2256 
KingdomBacteria 
Replicon accessionNC_008048 
Strand
Start bp2183461 
End bp2185080 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content64% 
IMG OID638010454 
Productchaperonin GroEL 
Protein accessionYP_617121 
Protein GI103487560 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.131312 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.353745 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGCCA AGGAAGTGAA ATTTGCGTCG GACGCGCGTG ATCGCATGCT GCGCGGCGTG 
GATACGCTTG CGAATGCAGT GAAGGTCACG CTCGGCCCCA AGGGTCGCAA CGTGGTTATC
GAAAAGAGTT TCGGCGCGCC GCGCATCACC AAGGACGGCG TCACTGTCGC CAAGGAGATC
GAACTCGCCG ACAAGTTCGA GAACATGGGT GCGCAGATGC TGCGCGAGGT CGCCTCGAAA
CAGAATGACA AGGCCGGCGA CGGCACCACC ACCGCAACCG TGCTTGCGCA GGCGATCGTC
CGCGAGGGAT CGAAAGCGGT GGCCGCGGGC ATGAACCCGA TGGACGTGAA GCGCGGCATC
GACCTCGCCG TGAAGGCCGT GGTGAAGGAT CTCGAAACCC ATGCCAAGAA GGTCAGCGCC
AACAGCGAAA TCGCCCAGGT CGCAACCATC TCGGCGAACG GCGACGAGGA GGTGGGCCGC
ATCCTCGCCG AAGCGATGGA CAAGGTCGGC AACGAAGGCG TCATCACCGT CGAGGAGGCG
AAGAGCCTGG CGACCGAACT CGAAACGGTC GAAGGCATGC AGTTCGACCG CGGCTATCTG
TCGCCCTATT TCATCACCAA TGCCGAAAAG CTGAAGGTCG AACTCGACGA CCCCTATATC
CTGATCCACG AAAAGAAGCT TTCGAACCTG CAGGCGATGC TGCCGCTGCT GGAAGCGGTC
GTGCAGTCGG GCAAGCCGCT GCTCATCATC GCCGAAGATG TCGAGGGCGA GGCATTGGCA
ACGCTGGTCG TCAACCGCCT GCGCGGCGGG CTCAAGGTCG CCGCGGTCAA GGCGCCGGGC
TTTGGCGACC GTCGCAAGGC GATGCTGGAA GATATCGCGA TCCTCACCGG CGGCAATGTC
GTGAGCGAAG ACCTGGGCAT CAAGCTGGAG AATGTCACGG TCAACATGCT CGGGCGCGCC
AAGAAGGTCG TGATCGACAA GGATAATACG ACGATCGTCG ACGGCGTCGG CGCCAGGACC
GACATCGACG CACGCATCGC CCAGATTCGC CAGCAGATCG ACACGACGAC CAGCGACTAT
GACCGCGAGA AGCTGCAGGA ACGCCTCGCC AAGCTCGCGG GTGGCGTTGC GGTGATCCGC
GTCGGCGGCG CAACCGAGGT CGAAGTCAAG GAGCGCAAGG ATCGCGTCGA CGACGCGCTT
CACGCGACGC GTGCTGCGGT GGAAGAAGGC ATTCTTCCCG GCGGCGGCAT TGCACTGCTC
CGCGCGCTCA AGGCGCTCGA CGGCCTCAAG GCGGCGAACG ACGATCAGCA GTCCGGCATC
GACATCGTCC GCCGCGCGCT CCGCGCTCCG GCGCGCCAGA TCGCCGACAA CGCGGGCGAG
GATGGTGCGT GGATCGTCGG CAAACTGCTC GAAAGCAGCG ACTATAACTG GGGCTTCAAT
GCTGCCACCG GCGAATATGA AGACCTCGTC AAAGCGGGTG TGATCGACCC GGCGAAGGTC
GTCCGCACGG CGCTGCAGGA CGCGGCCTCA GTCGCCGCGC TGCTGATCAC GACCGAGGCA
CTCGTCGCCG AGCTGCCGAA GGAAGAGAAA GCCGCGCCGA TGCCGGCGAT GGACTTCTGA
 
Protein sequence
MAAKEVKFAS DARDRMLRGV DTLANAVKVT LGPKGRNVVI EKSFGAPRIT KDGVTVAKEI 
ELADKFENMG AQMLREVASK QNDKAGDGTT TATVLAQAIV REGSKAVAAG MNPMDVKRGI
DLAVKAVVKD LETHAKKVSA NSEIAQVATI SANGDEEVGR ILAEAMDKVG NEGVITVEEA
KSLATELETV EGMQFDRGYL SPYFITNAEK LKVELDDPYI LIHEKKLSNL QAMLPLLEAV
VQSGKPLLII AEDVEGEALA TLVVNRLRGG LKVAAVKAPG FGDRRKAMLE DIAILTGGNV
VSEDLGIKLE NVTVNMLGRA KKVVIDKDNT TIVDGVGART DIDARIAQIR QQIDTTTSDY
DREKLQERLA KLAGGVAVIR VGGATEVEVK ERKDRVDDAL HATRAAVEEG ILPGGGIALL
RALKALDGLK AANDDQQSGI DIVRRALRAP ARQIADNAGE DGAWIVGKLL ESSDYNWGFN
AATGEYEDLV KAGVIDPAKV VRTALQDAAS VAALLITTEA LVAELPKEEK AAPMPAMDF