Gene Hoch_6645 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6645 
Symbol 
ID8549062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp9106613 
End bp9108022 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content67% 
IMG OID646391305 
ProductHtrA2 peptidase 
Protein accessionYP_003271004 
Protein GI262199795 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00239269 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCATAG CCCGCGACCG AATGCGCATT TTTCCGCGCG TGGTCGATGT GTCCGAATGT 
GCGTTCGCGC GCGTTGCTGC GGGCACCGAC GAGGCGGCGA GGAGGCGAAG ATGCGGGCCC
GGCGAGTGCG ACCCGGCGGT GGCCTTGCCG TGTTCGGGCT TGCACCGCGG CGGGGCTTCG
GACCATCCTC GAAACGTGAA GGGCGAATCA CATCAATCAA TACACGTGCG AGATGTTGCC
GCGAAACCCA ACTCGCGCGT CCAGGGGAAC ACCGTGCAGC TACTCGCGTT GCTGCTGCTG
GTGGTCGCGT GTGCGCTCAT CTGGACGCTG CTTCGCGTGC GCGGCGTGTC CGAACACGGC
GAGGCTCGAG CGCCGGCGGT GACGCCCGTG AGCGACTCCT TGCCAGAGCC GCGCGCCATC
ACCGCGCGCG GCGATCTGGC CGCCGATGAG GAAGCCAATA TCGAGTTGTT CCGCCAGGTG
GCGCCGTCGG TCGTGCACAT CGAGAGCCTC AAGGCGCAGC GCCGCGATCG CCTCAGCCTC
AACGCGCTCG ACATTCCCCG GGGCACGGGC TCGGGGTTCA TCTGGGACGA CCGCGGTCAC
GTGGTGACCA ACTATCACGT CATCCAGCAG GCCGACCGCA TCTTCGTCAT CCTGCAAGAT
GGCACCAAGT GGCCGGCGAG CGTGGTCGGC GCGGCGCCGG ATAAGGATAT GGCCGTGCTC
GAGGTCGAGG CGCCGCGCGA GAAGCTGCGG CCGGTGTCGC TGGGCATCTC GAATGAGCTG
CAAGTCGGAC AGAAGGTCTT CGCCATCGGC AATCCCTTTG GCTTCGACCA CACCCTGACC
ACCGGCGTCA TCAGCGGTCT CAACCGCGAG ATCCGCTCGG TGACCGAGCG GACCATCTAC
GACGTGATCC AGACCGACGC AGCCATCAAC CCCGGCAACA GCGGCGGGCC TCTGCTCGAC
AGCGCCGGTC TGCTCATCGG CATCAACACC GCCATCTACA GCCCTTCCGG CGCCTACGCC
GGCATCGGCT TCGCGGTCCC GGTGGATACC GTCAATCGCA TCGTGCCGCA GCTCGTGTCC
AATGGCCGCG TGTTCAAACC CGGGCTCGGC ATCTACCCGC TCAACGCCTC GTTGGCGGCG
CGGAACAACA TCCAGGGCGT GGTCATCCGC GAGGTCGCCG AAGACTCGGC AGCCGCACGT
GCCGGGCTGC GCGGTCTGGT GCATACGCGC GCAGGCCCGT CCATGCTCGG CGACGTGATC
GTCGGTATCG ACGGCGCGCT GGTCGAAAAC ATCGACGACA TCTATCGCGT GCTCGACGAA
CGCAACGTGG GCGACGAGGT GGAGCTCACG GTCGTGCGCG AGGGCAAGGA GGTCGGGGTG
TCGATCGAGC TCCAGGACCT CGCCGACTAG
 
Protein sequence
MGIARDRMRI FPRVVDVSEC AFARVAAGTD EAARRRRCGP GECDPAVALP CSGLHRGGAS 
DHPRNVKGES HQSIHVRDVA AKPNSRVQGN TVQLLALLLL VVACALIWTL LRVRGVSEHG
EARAPAVTPV SDSLPEPRAI TARGDLAADE EANIELFRQV APSVVHIESL KAQRRDRLSL
NALDIPRGTG SGFIWDDRGH VVTNYHVIQQ ADRIFVILQD GTKWPASVVG AAPDKDMAVL
EVEAPREKLR PVSLGISNEL QVGQKVFAIG NPFGFDHTLT TGVISGLNRE IRSVTERTIY
DVIQTDAAIN PGNSGGPLLD SAGLLIGINT AIYSPSGAYA GIGFAVPVDT VNRIVPQLVS
NGRVFKPGLG IYPLNASLAA RNNIQGVVIR EVAEDSAAAR AGLRGLVHTR AGPSMLGDVI
VGIDGALVEN IDDIYRVLDE RNVGDEVELT VVREGKEVGV SIELQDLAD