Gene Hoch_5367 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_5367 
Symbol 
ID8547779 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp7381629 
End bp7383245 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content67% 
IMG OID646390040 
Productchaperonin GroEL 
Protein accessionYP_003269744 
Protein GI262198535 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02348] chaperonin GroL 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.475271 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.589855 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCCA AAGAAATCGT ATTTTCCACC CAGGCCCGCG CGGAGATCGC CAAGGGTCTG 
AACATGCTCG CCAACGCAGT GAAGGTCACG CTTGGGCCCC GCGGTCGCAA CGTCGTGATC
GAGAAGTCCT GGGGCGCCCC GACGGTGACC AAGGACGGCG TGACCGTGGC CAAAGAGGTC
GAGGTCACCA ACAAGCTCCA GAACATGGGC GCGCAGATGA TGAAGGAGGT CGCTTCCAAG
ACCTCCGACA TCGCCGGTGA CGGCACCACC ACCGCCACCG TGCTGGCGCA GGCCATCTTC
ACCGAGGGCG CCAAGCTGGT CGCCGCCGGC GTCAACCCGA TGGACCTCAA GCGCGGCATC
GAGGCCGCGG TCGAGGAGAT CGTCGACGAG CTGTCCAAGC TCTCGACCCC GACCAAGGGC
AAGACCGACA TCGCCCAGGT CGGCACCATC AGCGCCAACG GCGACTCGAC CATCGGCGAC
ATGATCGCCG AGGCCATGGA GAAGGTCGGC AAAGAGGGTG TGATCACGGT CGAGGAGTCC
AAGACCATGC AGAGCGAGCT CGACGTGGTC GAGGGCATGC AGTTCGATCG CGGCTACCTC
TCGCCGTACT TCGTGACCGA CCCCGATCGC ATGGAGGTCG TGCTCAACGA CCCCTTCCTG
CTCATCTGCG AGAAGAAGAT CTCCAACATG AAGGATCTGC TTCCCGTGCT CGAGCAGGTG
GCCAAGTCGG GCCGTCCGCT GCTCATCCTC GCCGAGGACG TCGATGGCGA GGCGCTGGCC
ACCCTGGTGG TCAACAAGCT GCGCGGCACC CTGCAGGTGG CCGCGGTCAA GGCCCCGGGC
TTCGGTGACC GCCGCAAGGC CATGCTCACC GACATCGCCA CCCTCACCGG CGGTCAGGCC
GTCACCGAGG ACATCGGCGT CAAGCTCGAC GGCGTGACCC TCCAGGAGCT GGGCCAGGCC
AAGCGCGTTG TCATCACCAA GGACAACACC ACCATCGTCG AGGGCGCGGG CGAGACCAGC
GCCATCGAGG GCCGGGTCAA GCAGATCCGC CGCGAGGTCG AGGACACCAC CAGCGACTAC
GACCGCGAGA AGCTGCAGGA GCGCCTGGCC AAGCTGGTCG GCGGTGTCGC CGTCATCCGC
GTGGGTGCGG CAACCGAGGT CGAGATGAAG GAGAAGAAGG CGCGCGTGGA AGACGCCATG
CACGCCACCC GCGCGGCCGT CGAAGAGGGC ATCGTCCCCG GCGGCGGTGT CGCTCTCATC
CGCTCGGGCA GCCGTCTCGA CAAGCTCACC TTCGACGACG ACCGCCGCTT CGGCGTCAAC
ATCGTGCGCC AGGCCATCGA GGCGCCGCTG CGCCAGATCT CGCACAACGC GGGCGTGGAC
GGCTCGATCA TCGTCTCCAA GGTGCGCGAG GGCGAGGGCA ACTTCGGCTA CAACGCCGCC
ACCCTCGAGT ACCAGGACCT GGTCGAGAAC GGCGTCATCG ACCCGACCAA GGTCGTGCGC
TCGGCGCTGC AGAACGCGGC CTCGGTCGCC GGTCTGATGC TGACCACCGA GGCCCTCGTG
GCCGAGAAGG TCAAGGACGA GGACGACGCC GGCTCTCACG ACCACGGCGA CTACTGA
 
Protein sequence
MAAKEIVFST QARAEIAKGL NMLANAVKVT LGPRGRNVVI EKSWGAPTVT KDGVTVAKEV 
EVTNKLQNMG AQMMKEVASK TSDIAGDGTT TATVLAQAIF TEGAKLVAAG VNPMDLKRGI
EAAVEEIVDE LSKLSTPTKG KTDIAQVGTI SANGDSTIGD MIAEAMEKVG KEGVITVEES
KTMQSELDVV EGMQFDRGYL SPYFVTDPDR MEVVLNDPFL LICEKKISNM KDLLPVLEQV
AKSGRPLLIL AEDVDGEALA TLVVNKLRGT LQVAAVKAPG FGDRRKAMLT DIATLTGGQA
VTEDIGVKLD GVTLQELGQA KRVVITKDNT TIVEGAGETS AIEGRVKQIR REVEDTTSDY
DREKLQERLA KLVGGVAVIR VGAATEVEMK EKKARVEDAM HATRAAVEEG IVPGGGVALI
RSGSRLDKLT FDDDRRFGVN IVRQAIEAPL RQISHNAGVD GSIIVSKVRE GEGNFGYNAA
TLEYQDLVEN GVIDPTKVVR SALQNAASVA GLMLTTEALV AEKVKDEDDA GSHDHGDY