Gene Strop_3841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagStrop_3841 
Symbol 
ID5060319 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora tropica CNB-440 
KingdomBacteria 
Replicon accessionNC_009380 
Strand
Start bp4401307 
End bp4402938 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content70% 
IMG OID640476098 
Productchaperonin GroEL 
Protein accessionYP_001160649 
Protein GI145596352 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAAGA TCCTGAGCTT CTCGGACGAC GCTCGGCACC AGCTGGAGCA CGGTGTCAAC 
GCCCTCGCGG ATGCGGTCAA GGTCACCCTC GGCCCCCGCG GGCGCAACGT CGTCCTGGAC
AAGAAGTTTG GTGCACCCAC GATCACCAAC GACGGCGTGA CGATCGCCAA GGAGATCGAG
CTCACCGACC CGCACGAGAA CCTCGGCGCG CAGCTGGTCA AGGAGGTGGC GACCAAGACC
AACGACGTCG CCGGCGACGG GACCACCACC GCCACCGTGC TGGCCCAGGC GTTGGTCCGG
GAGGGCCTGC GTAACGTGGC GGCCGGCGCC AACCCGACCG GCCTCAAGCG GGGTATCGAC
GCGGCGGCCA CCAAGGTCTC CGAGGCGCTG CTCGGCAAGG CCGTCGAGGT GTCGGACAAG
GCGGCGATCG CGCACGTCGC GACCGTCTCC GCGCAGGACT CCACGATCGG TGAGCTCATC
GCCGAGGCGA TGGAGCGGGT CGGCCGCGAC GGTGTCATCA CCGTCGAGGA GGGCTCCACC
CTCGCCACCG AGCTGGACGT GACCGAGGGT CTCCAGTTCG ACAAGGGCTT CATCTCGCCC
AACTTCGTCA CTGACGCGGA GGGGCAGGAG TCGGTCCTGG AGGACCCGTA CATCCTCATC
ACCACGCAGA AGATCTCGGC GATCGAGGAG CTGCTACCGC TGCTGGAGAA GGTCCTCCAG
GACAGCAAGC CGCTGCTCAT CATCGCCGAG GACGTCGAGG GCCAGGCGCT GTCCACGCTG
GTGGTCAACG CGCTCCGCAA GACCATGAAG GTCTGCGCGG TGAAGGCTCC CGGCTTCGGT
GACCGCCGCA AGGCGATGTT GCAGGACATG GCGATCCTGA CCGGTGCCGA GCTGGTCGCC
CCCGAGCTGG GCTACAAGCT TGACCAGGTC GGGCTGGAGG TGCTCGGCAC CGCTCGCCGG
GTGGTGGTCG ACAAGGAGAC CACCACCGTC GTCGACGGCG GCGGCCAGGC CGCCGACGCC
GCGGACCGGG TCGCCCAGAT CCGCAAGGAG ATCGAGGCTT CGGACTCCGA GTGGGACCGG
GAGAAGCTCG CCGAGCGGCT GGCCAAGCTC TCCGGTGGCG TTGCCGTGAT CCGGGCGGGC
GCGGCGACCG AGGTCGAGAT GAAGGAGCGC AAGCACCGCA TCGAGGACGC CATCGCCGCC
ACCAAGGCCG CGGTCGAGGA GGGCACGATC CCCGGCGGCG GTGCCGCCCT GGCCCAGGTC
CTGCCGGCGC TCGACGACGA CCTCGGCCTC GACGGGGACG AGAAGGTCGG CGTCTCGATC
GTGCGCAAGG CGCTGGTCGA GCCGCTGCGC TGGATCGCCC AGAACGCCGG CCACGACGGC
TACGTGGTGG TGCAGAAGGT CGTCGACAAG GACTGGGGCC ACGGCCTCGA CGCGGCTACC
GGCGAGTACG TCGACCTGGC AAAGGCTGGC ATCCTCGACC CGGTGAAGGT GACCCGCAAC
GCGGTCGCCA ACGCCGCGTC GATCGCGGGC CTGCTGCTCA CCACCGAGAG CCTCGTGGTG
GACAAGCCGC AGGAGCCGGA GCCGGCCGCG GGTGGCCACG GCCACGGTCA CCAGCACGGC
CCGGGTTTCT GA
 
Protein sequence
MAKILSFSDD ARHQLEHGVN ALADAVKVTL GPRGRNVVLD KKFGAPTITN DGVTIAKEIE 
LTDPHENLGA QLVKEVATKT NDVAGDGTTT ATVLAQALVR EGLRNVAAGA NPTGLKRGID
AAATKVSEAL LGKAVEVSDK AAIAHVATVS AQDSTIGELI AEAMERVGRD GVITVEEGST
LATELDVTEG LQFDKGFISP NFVTDAEGQE SVLEDPYILI TTQKISAIEE LLPLLEKVLQ
DSKPLLIIAE DVEGQALSTL VVNALRKTMK VCAVKAPGFG DRRKAMLQDM AILTGAELVA
PELGYKLDQV GLEVLGTARR VVVDKETTTV VDGGGQAADA ADRVAQIRKE IEASDSEWDR
EKLAERLAKL SGGVAVIRAG AATEVEMKER KHRIEDAIAA TKAAVEEGTI PGGGAALAQV
LPALDDDLGL DGDEKVGVSI VRKALVEPLR WIAQNAGHDG YVVVQKVVDK DWGHGLDAAT
GEYVDLAKAG ILDPVKVTRN AVANAASIAG LLLTTESLVV DKPQEPEPAA GGHGHGHQHG
PGF