Gene Hore_02010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHore_02010 
Symbol 
ID7312520 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalothermothrix orenii H 168 
KingdomBacteria 
Replicon accessionNC_011899 
Strand
Start bp206110 
End bp207753 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content44% 
IMG OID643610624 
Productchaperonin GroEL 
Protein accessionYP_002507958 
Protein GI220931050 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.000193134 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAAG AATTAAAATT TAGTGAAGAT GCTCGCCGTG CTCTGGAACG CGGTGTTGAT 
ACTCTGGCAA ATGCTGTTAA AGTAACTTTA GGTCCAAAAG GACGAAATGT AGTTCTTGAA
AAGAGCTTTG GAGCTCCTAC TATCACCAAC GATGGTGTTA GTATTGCCCG TGAAATAGAA
CTTGAAAATC ACTACGAAAA CATGGGGGCT CAGACTGTAA AAGAGGTTGC TACCAAAACC
AATGATGTTG CCGGTGATGG TACAACCACT GCTACAGTAC TGGCTCAGGC TATTTTCAAG
GAAGGTTTAA AGAATGTGGC CGCCGGTGCC AACCCCATGA TCCTGAAAAG GGGTATTGAA
AAGGCCGTTC AGAAGCTGGT AGAAGAGATT AAGGAACTAA GCAAACCTGT TGAAGGAAAA
GAAGCAGTTT CCCAGGTTGC TGCTATTTCT GCCGGTAATG ATGAAGAAGT CGGTAAGCTT
ATTGCTGAAG CTATGGAGAA AGTTGGTCAG GATGGAGTTA TCTCTGTTGA AGAATCCAAG
AGTATGGGGA CTTCTTTAGA TGTAGTTGAA GGTATGCAGT TCGATAGAGG ATATCTCTCC
CCCTATATGG TAACCGATAC TGATGCTATG GAAGCTTCCC TTGAAGATCC CTATATCCTG
ATCACTGACA AGAAGATATC TAATATCCAG GAAATCTTAC CCCTGTTAGA AAAAGTAGCC
CAGAGTGGTA AACCTCTCTT AATAATTGCT GAAGATGTTG AAGGGGAAGC CCTGGCTACT
CTTGTTGTCA ACAAGATTCG TGGTACCTTT AACTGTGTTG CTGTTAAAGC ACCTGGCTTT
GGTGATCGTC GTAAGGCTAT GTTAGAAGAC ATTGCTATTC TGACCGGTGG TCAGGTAATC
ACTGAAGACC TGGGTCTCAA GCTCGAAAAT GCTGATATTA GTATGCTTGG TCGGGCCCAC
AAAGTAACAG TAACCAAAGA GGATACTACT ATTGTAGAAG GTGCTGGAGA TAGCAAAGAA
ATTCAGAATA GAATTAAGCA GATCAGGACT CAAATTGAAA ATACTGATTC TGATTTTGAC
AGGGAAAAAC TGCAGGAAAG ACTGGCTAAA CTGGCCGGTG GTGTGGCTGT AATTAAGGTT
GGTGCTGCTA CTGAAACTGA ATTAAAAGAA AAGAAACACC GTATTGAAGA TGCTCTCTCT
GCTACCAGGG CCGCTGTAGA AGAAGGACTG GTAGCCGGTG GTGGAACCAC CCTTATTGAT
GCCATTCCTG CCCTTGATGA ACTGAACCTT GAAGGTGACG AAGCTACCGG TGTTGACATT
GTTAGAAAAG CACTTGAAGC CCCGGTACGT CTCATAGCAG ACAATGCCGG TTATGAAGGT
TCAGTAATTG TTGAGAAGGT TAAGTCTGAA GATAAAGGTA TCGGTTTCGA TGCCTATAAC
GGTGAGTTTG TAAATATGAT TGAATCCGGT ATTGTAGACC CGGCTAAGGT AACCCGTTCT
GCCCTTCAGA ATGCTGCCAG TGCTGCTGCT ATGTTGCTGA CTACTGAATG CCTGGTGGCT
GATAAAGAAG AGGATAATGA CAGTAATGGT AATGCCGGAA TGCCCGGTGG CGGTATGCCC
GGCGGAATGG GTGGCATGAT GTAA
 
Protein sequence
MAKELKFSED ARRALERGVD TLANAVKVTL GPKGRNVVLE KSFGAPTITN DGVSIAREIE 
LENHYENMGA QTVKEVATKT NDVAGDGTTT ATVLAQAIFK EGLKNVAAGA NPMILKRGIE
KAVQKLVEEI KELSKPVEGK EAVSQVAAIS AGNDEEVGKL IAEAMEKVGQ DGVISVEESK
SMGTSLDVVE GMQFDRGYLS PYMVTDTDAM EASLEDPYIL ITDKKISNIQ EILPLLEKVA
QSGKPLLIIA EDVEGEALAT LVVNKIRGTF NCVAVKAPGF GDRRKAMLED IAILTGGQVI
TEDLGLKLEN ADISMLGRAH KVTVTKEDTT IVEGAGDSKE IQNRIKQIRT QIENTDSDFD
REKLQERLAK LAGGVAVIKV GAATETELKE KKHRIEDALS ATRAAVEEGL VAGGGTTLID
AIPALDELNL EGDEATGVDI VRKALEAPVR LIADNAGYEG SVIVEKVKSE DKGIGFDAYN
GEFVNMIESG IVDPAKVTRS ALQNAASAAA MLLTTECLVA DKEEDNDSNG NAGMPGGGMP
GGMGGMM