Gene Hoch_6589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6589 
Symbol 
ID8549006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9040275 
End bp9041939 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content66% 
IMG OID646391249 
Productchaperonin GroEL 
Protein accessionYP_003270948 
Protein GI262199739 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.931782 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGGTA AAGAGATTAT TTTCGAAGAG AACGCTCGCA ACAAAGTGAT GCGCGGTGTC 
GATACCCTGG CGAATGCCGT GAAAGTGACC CTCGGCCCCC GTGGTCGCAA CGTGGTCATT
GAGAAGTCGT GGGGCGCCCC CACGGTGACC AAGGACGGCG TCACCGTCGC CAAGGAGATC
GAGCTCGAGA ACAAGTTCGA GAACATGGGC GCGCAGATGG TCAAGGAGGT CGCCTCCAAG
ACCTCTGACA ACGCCGGTGA TGGCACCACC ACCGCCACCG TGCTGGCGCA GGCCATCTTC
CGTGAGGGCA GCAAGCTGGT CGCCGCGGGT CACAATCCGA TGGAGATCAA GCGCGGCATC
GACGCCGCCG TCGAGAGCAT CGTCGCCTCG CTCGGTGAGC TCGCCACCTC GACCAAGGAT
CACAAGGAGA TCGCTCAGGT CGGCACCATC AGCGCCAACG GCGACGCCAC CATCGGCGAC
ATGATCGCCG AGGCCATGGA GAAGGTCGGC AAAGAGGGCG TGATCACGGT CGAAGAGTCC
AAGACCATGC AGAGCGAGCT CGACGTGGTC GAGGGCATGC AGTTCGACCG CGGCTACCTG
TCGCCGTACT TCGTGACCGA CTCGGAGCGC ATGGAGGTCG TGCTCGAGGA TGCGCTGGTG
CTCATCCACG AGAAGAAGAT CTCGAACATG AAGGATCTCC TGCCGGTGCT CGAGCAGGTG
GCCAAGCAGG GTCGTCCGCT GCTCATCGTC GCCGAGGACG TCGACGGTGA GGCGCTGGCC
ACCCTGGTGG TGAACAAGCT GCGCGGCACC CTCCACGTGT GCGCGGTCAA GGCCCCGGGC
TTTGGCGACC GCCGCAAGGA GATGCTCAAG GACATCGCGG TGCTCACCGG CGGCACGGCC
GTCACCGATG ACCTCGGCCT CAAGCTCGAG AACATCACGG TCAACGACCT CGGCATCGCC
AAGCGCGTCA CGGTGGACAA GGACAACACC ACCATCGTCG ACGGCGCCGG CAAGAAAGAG
GACATCGACG CCCGCGTCAA GCAGATCCGC ATCCAGGTCG AGGAGACCAG CAGCGACTAC
GATCGCGAGA AGCTGCAGGA GCGCCTGGCC AAGCTGGTCG GCGGTGTCGC CGTCATCCGC
GTGGGTGCGG CCACCGAGGT CGAGATGAAG GAGAAGAAGG CGCGCGTGGA AGACGCCATG
CACGCCACCC GCGCGGCCGT CGAAGAGGGC ATCGTCCCCG GCGGCGGTGT GGCCCTGCTG
CGCTGCCTCA AGGGTCTCGA CAGCCTCAAT CTGGGCGAGG AGCAGAAGTT CGGCGTCTCG
ATCGTGCGTC GCGCGCTCGA GGAGCCGCTG CGCCAGATCT CGGCCAACGC CGGTTCGGAC
GGCTCGATCG TGGTCGAGAA GGTCAAGAAC GGCGAGGGCG CGTTCGGCTT CAACGCCGCC
AAGGGCGAGT TCGAGGACCT GCTCAAGGCC GGCGTCATCG ACCCCGCCAA GGTGGTTCGC
ACCGCGCTGC AGAACGCGGC TTCGGTGAGC GGCCTGCTGC TCACGACCGA GGCTCTCATC
GCCGAGAAGC CCAAGAAAGA GACCGCGCCG GCCGGTGGTC ACGACCACGG CGGCATGGGC
GGCATGGGCG GCATGGGCGG CATGGGCGGC ATGGGCGGCT TCTGA
 
Protein sequence
MAGKEIIFEE NARNKVMRGV DTLANAVKVT LGPRGRNVVI EKSWGAPTVT KDGVTVAKEI 
ELENKFENMG AQMVKEVASK TSDNAGDGTT TATVLAQAIF REGSKLVAAG HNPMEIKRGI
DAAVESIVAS LGELATSTKD HKEIAQVGTI SANGDATIGD MIAEAMEKVG KEGVITVEES
KTMQSELDVV EGMQFDRGYL SPYFVTDSER MEVVLEDALV LIHEKKISNM KDLLPVLEQV
AKQGRPLLIV AEDVDGEALA TLVVNKLRGT LHVCAVKAPG FGDRRKEMLK DIAVLTGGTA
VTDDLGLKLE NITVNDLGIA KRVTVDKDNT TIVDGAGKKE DIDARVKQIR IQVEETSSDY
DREKLQERLA KLVGGVAVIR VGAATEVEMK EKKARVEDAM HATRAAVEEG IVPGGGVALL
RCLKGLDSLN LGEEQKFGVS IVRRALEEPL RQISANAGSD GSIVVEKVKN GEGAFGFNAA
KGEFEDLLKA GVIDPAKVVR TALQNAASVS GLLLTTEALI AEKPKKETAP AGGHDHGGMG
GMGGMGGMGG MGGF