Gene Hoch_6606 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6606 
Symbol 
ID8549023 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9060728 
End bp9061909 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content70% 
IMG OID646391266 
ProductHtrA2 peptidase 
Protein accessionYP_003270965 
Protein GI262199756 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.211294 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.127864 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCAT CGCCGCTGCC GTCGCCGTCC CGCGTTTGCC CTCTGCCCGC CGCGTCCGCG 
CCGTCCCCGC ACACCGCCCG GCGGACGCGC GCACGCGCCC CGCTGCTGAG CCTGGCCCTG
AGCCTGGGCC TGGCGCTGAC CCTGGCGTCG GCCGCGCCCG GCGATGCCCG GGCGCAGCTC
CTGCGCGGCG ACGACACCAC CATCTCCGAC GTCACCGAAA AGGCGCTGCC CAGCGTGGTC
AATATCTCGA CCACCACCTC GCAGTCCGCG CGCGGGCCCT CGTTCTTCGA TCCCTTCTTC
AACGACGAAA ACTCACCCTT TCGCGGCCGC CCCGGCAAAC GCTACGGCCA GAGCCTGGGC
TCGGGCGTCA TCATCTCGGC CGACGGCTAC GTCATCACCA ACAGCCACGT GGTCGAAGAC
GCCAAAGACA TCCGCGTCTC ACTCTCAGAC GGCCGCGAGC TGAGCGCCAA GATCGTGGGC
AGCGACCCCA AGAGCGACCT GGCCGTGCTC AAGCTCGAGG GCGCGAGCGG GCTGCAGCCC
ATCCGCATCG GCCGCTCGAG CAACATCCGC CTGGGCGAGA TCGTGCTCGC CATCGGCAAC
CCCTTCGGCG TCGGCCAGAC CGTGACCATG GGCATCGTCT CGGCCAAGGG CCGCTCGGGC
ATGGGCATCG TCGACTACGA GGACTTCATC CAGACCGACG CCGCCATCAA CCCGGGCAAC
TCGGGCGGCG CGCTGATCAA CCTGCGCGGC GAGCTGATCG GTATCAACAC CGCGATCCTG
TCGCGCACCG GCGGCTACCA GGGCATCGGC TTCGCCATCC CCACGGACAT GGTCGCGCCC
ATCAAAGACA GCCTCATCCG CGACGGCGCC GTGGCCCGCG GCTTCCTCGG CGTCAACATC
CAGACCCTGA CCAGCGAGCA GGCGCGCGCC GCCGGCGTCC CCGACCTGCG CGGCGTCTTG
ATCACGCGCG TGGTCGAACG CAGTCCGGCC GCCCGCGCCG GCCTGCGCCG CGGCGACATC
ATCACCCGCG TCGGCGACCG CATCACGCTC ACGGCCGCGC ACGTGGTCAA CTCCGTGGGC
ATGAGCCGTC CCGACAAACG CCTGGCCCTG ACCATCATGC GCGACGGCAA GACGCGGCGC
GTCGCAGTAA AACTTGGCGA TTTATCGCAG GTGCCGGAAT AA
 
Protein sequence
MNSSPLPSPS RVCPLPAASA PSPHTARRTR ARAPLLSLAL SLGLALTLAS AAPGDARAQL 
LRGDDTTISD VTEKALPSVV NISTTTSQSA RGPSFFDPFF NDENSPFRGR PGKRYGQSLG
SGVIISADGY VITNSHVVED AKDIRVSLSD GRELSAKIVG SDPKSDLAVL KLEGASGLQP
IRIGRSSNIR LGEIVLAIGN PFGVGQTVTM GIVSAKGRSG MGIVDYEDFI QTDAAINPGN
SGGALINLRG ELIGINTAIL SRTGGYQGIG FAIPTDMVAP IKDSLIRDGA VARGFLGVNI
QTLTSEQARA AGVPDLRGVL ITRVVERSPA ARAGLRRGDI ITRVGDRITL TAAHVVNSVG
MSRPDKRLAL TIMRDGKTRR VAVKLGDLSQ VPE