Gene Hoch_4986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_4986 
Symbol 
ID8547394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp6872924 
End bp6874192 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content62% 
IMG OID646389660 
ProductATP-dependent Clp protease, ATP-binding subunit ClpX 
Protein accessionYP_003269368 
Protein GI262198159 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1219] ATP-dependent protease Clp, ATPase subunit 
TIGRFAM ID[TIGR00382] endopeptidase Clp ATP-binding regulatory subunit (clpX) 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.244142 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.534035 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCAGCG AAAAGCGAGA CAGTGGCCAA GCCAACCTCA CCTGCTCCTT CTGCGGTAAG 
TCGCAGAAGG AAGTGAAGAA ACTCATCGCC GGCCCCACTG TCTACATCTG TGACGAGTGC
ATCGGGCTGT GCAATGACAT CATCGCCGAG GAGATCGAGA AGGAAGATCA GGCCTACGGA
ACGGCCACGA TCCCCAAGCC CCAGCACATC AAGAAGATCC TCGACGAGTA CGTGATCGGT
CAGGAGCGCG CCAAGAAGAT CTTGGCGGTG GCGGTGCACA ACCACTACAA GCGCATCGAT
CACAAGGCCG GCGACGACGA GGAAGAGGTC GAGCTGCAGA AGTCGAACAT CCTGCTGCTC
GGCCCCACCG GCTCGGGCAA GACCTTGCTG GCGCAGACCC TGGCGCGCAT CCTCAATGTG
CCCTTCGCCA TCGCCGACGC CACCAACCTC ACCGAGGCCG GCTACGTCGG CGAGGACGTC
GAGAACATCA TCGTGAGCCT GCTGCAGAAC GCCGATCACG ACATCGAGCG GGCGCAGCGC
GGCATCGTGT ACATCGACGA GATCGACAAG ATCGCGCGCA AGAGCGACAA CCCGTCGATC
ACGCGCGATG TGAGCGGCGA GGGTGTGCAG CAGGCGCTGC TCAAGATCAT CGAGGGCACG
CTGGCCGCGG TGCCGCCCAA GGGCGGTCGC AAGCACCCGC AGCAGGAGTT TCTGCAGGTC
GATACCTCGA ACATCCTGTT CATCTGCGGC GGCGCGTTCA CGGGTCTCGA GGAGATCATC
GAGAACCGCA TCGGCCAGCG CATGATCGGC TTCGGCGCCA CGATGAAGCC CAAGAAGGCG
CTCGACCGCT GGGAGCTGAT CAAAGAGGTG CAGCCCGAGG ATCTGCTCAA GTACGGCATG
ATCCCCGAGT TCGTCGGCCG CCTGCCGATG ATCGCGCCGC TGCACGAGCT GTCTGAGGAC
GCCCTGGTGC AGATCCTTAC CCAGCCCAAG AACGCGCTGA TCAAGCAGTA TCAGAAGCTG
TTCGAGATGG ACGGGGTGAA GCTCAAGTTC ACCCACGGCG CGCTGTACAA GATCGCGTCG
CTGGCCCAGG CGCAGAAGAG CGGCGCCCGC GGTCTGCGCG CCATCCTCGA GTCGGCGTTG
CTCGACATCA TGTACGACAC CCCCAGCCAG CACAACATCA GCGAAGTGAT CATCAACGAG
GACGTGGTCG AGAAGCACTC CGAGCCGATG GTTACCTACG TCAAAGAGCC GGCCGTAGAG
TCGGCCTAA
 
Protein sequence
MPSEKRDSGQ ANLTCSFCGK SQKEVKKLIA GPTVYICDEC IGLCNDIIAE EIEKEDQAYG 
TATIPKPQHI KKILDEYVIG QERAKKILAV AVHNHYKRID HKAGDDEEEV ELQKSNILLL
GPTGSGKTLL AQTLARILNV PFAIADATNL TEAGYVGEDV ENIIVSLLQN ADHDIERAQR
GIVYIDEIDK IARKSDNPSI TRDVSGEGVQ QALLKIIEGT LAAVPPKGGR KHPQQEFLQV
DTSNILFICG GAFTGLEEII ENRIGQRMIG FGATMKPKKA LDRWELIKEV QPEDLLKYGM
IPEFVGRLPM IAPLHELSED ALVQILTQPK NALIKQYQKL FEMDGVKLKF THGALYKIAS
LAQAQKSGAR GLRAILESAL LDIMYDTPSQ HNISEVIINE DVVEKHSEPM VTYVKEPAVE
SA