Gene Hoch_6456 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_6456 
Symbol 
ID8548871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp8864161 
End bp8866425 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content69% 
IMG OID646391117 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_003270818 
Protein GI262199609 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.234127 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACGAT ATTTTCCCGG CGTGGTGCGC GCCCTGTTCA TATCGATGTT GGCGGCCGTG 
GTCGGTTGCG GAGGCGAGAG CGGAACGATC ATCTCGGGCA AGCTCGAGAC GCTTTCGCTC
AGCGGTGCCT CGGAGGCGCC GCCCGTGGGC ACGGCGCTGA GCAACCGCGG CGACGCGGCA
CCCACGGCTG TGGCTGCTGC GGCCGTCGCC GAATTGCACA GCGCCGAAGC GCGCCTGCAG
GCGAGCCGCG ACGTCATGCT GGCGACGCTC GACGACGCCG AGTTCGTGCC CGGTGAGGTC
ATCGTCGCGT TCCGCGAGGA CGGGCTTTTC GGACCGCTAC AGGCGCAGGC CGAGATGCAG
GTGGCGGGTT CGCTGCTGCA GGCGGTGGAG CCGATGGCGG TGGCCAACGC GCACGTGTAT
CGCGCCGCGG ACAGCGACCG CAAACGCACC ATCGAGATGA TCCGCGAGCT CAACCAGCGC
GAGGACGTGC GTTACGCGCA GCCCAATTAC ATCTACCGCG CGCTGCGCAC GCCCAACGAT
CTCGACGCCA AGGGCCAGTG GCACTATCCA GCCATCAACC TGCCCGCGGC CTGGGACCTC
ACGATCGGCT CGTCCGACAC CGTGGTCGCC GTGGTCGACA CCGGCATCCT GTTCGACTCC
CGCAACGCGG GCGCCAACCA CCCCGAGCTG GTCGGCAAGG TGCTGCCCGG CTTCGACTTC
ATCGACGATC TCAACGTCGG CGGCGATGGC GACGAGCGCG ACGACAATCC CTTTGACGTC
GGCGACAATC CCGGCGGACA GTCGAGCTAC CACGGCAGCC ACGTGGCCGG CACCATCGCG
GCCGCGACCA ACAACGGCGT GGGCGTCGCC GGCGTCAACT GGTCGGCCAA CATCCTGCCC
GTGCGCGCGC TCGGCGTCGG CGGCGGCGGC AGCTCGCGCG ACATCCTCCA GGGCGCGCTG
TGGGCGGCCG GGTTCTCCAT CGCCGGGGTG CCCGACAATC GCAACCAGGC CGACGTCATC
AACCTCAGCC TGGGCGGCAA CTCCTTCTGT CCGCCGCTGG ATCAAGAGGT CTACGACGAC
GTCACCGCCC GGGGCACGAT CGTGGTCGTG GCCGCCGGCA ACGAGAACCA GAACGCCGCC
AACGTCACCC CGGCGAGCTG CGCCAACGTC ATCACCGTGG GCGCCACCGA CTTCTCCGGG
CGGCGCGCGC CGTACTCCAA TTTCGGCACC GTGGTCGACG TCATGGCTCC GGGCGGTGAC
CTGGGCCGCG ACGACAACGG CGACGGCGAC GGCGACGGCG TGGTCAGCCT CGGCTTCAAC
GATCTGACCC GGCAGTTCAG CGTGCAGAGC CTGCAGGGCA CATCCATGGC GGCGCCGCAC
GTGGCCGGCG TGGTCTCGCT GATGCGCGCG CTGCGCCCCG ACCTGAACAC CCAGGACGCG
GTCGCCATTC TCCGCGGCAC GGCCAACCAG GTGTCGGCCG TCGACTGCGG GCGCGCCTCC
AGCAGCGAGT GCGGGGCCGG CCTGATCGAC GCCGAGGCCG CCCTGATCAA CGTCGATGGC
GGCCTGCCGC CGCCCAGCAA CGGCCCGCTG GCCTTCAACC CCAACCCCGT GGACTTCGGC
TCGTCGGCAA GCGAGCTGAG CGTGACCATG ACCAACGTGT CCACGCAGCC GCTGAGCTGG
TCGATCAACT CGTTCGAGAC CTCGTCGAGC AACCCGGTCG CGCTGGCGCA GGGCACCTTC
TACTTCGCCG CGGGCGCGAC CACCAGCGGC AGCCTGGATG TCGGACAGTC GGCCCAATTT
ACCCTCGGCG TGGCCCGCGA TTCGGTCTCG GTGCCCGGCA ACTACGCGGC CGAGCTGATC
TTCGAGCTCG GCGGCGAGGA GCAGCGGCTG CTCACGCGCT TCAGCACCTT CCCCGAGGAC
ATCGAAGGCC CCAGCGGCCC CACCGTGGTC GGCGCGTTCA TCGCCGACGC CACGGGCAAC
CCGCAGCTCA TCGCCTCGAA GGAGGAGACG CAGTTCTTCT CGTCGTACAA GCTGTACACC
GAGCCCGGCG AAAACGTGCT CATCGCCTGG TCGGACGACA ACGGCAACTT CGAGATCGAC
GAGGGCGATC ACCTGGGCGT CGAGCCCAGC GTGCTCATCG CCGAGGACCA GCAGATCGGC
GGGGTCAACA TCGAGATGAG CCAGGTGCTG AACACCGCCG CGCTGCCGGC GCCGCTGTCG
CCGGGTGTGC TGCGCGCGCT CGAGGCCATG GCGCCGCGTC CCTGA
 
Protein sequence
MRRYFPGVVR ALFISMLAAV VGCGGESGTI ISGKLETLSL SGASEAPPVG TALSNRGDAA 
PTAVAAAAVA ELHSAEARLQ ASRDVMLATL DDAEFVPGEV IVAFREDGLF GPLQAQAEMQ
VAGSLLQAVE PMAVANAHVY RAADSDRKRT IEMIRELNQR EDVRYAQPNY IYRALRTPND
LDAKGQWHYP AINLPAAWDL TIGSSDTVVA VVDTGILFDS RNAGANHPEL VGKVLPGFDF
IDDLNVGGDG DERDDNPFDV GDNPGGQSSY HGSHVAGTIA AATNNGVGVA GVNWSANILP
VRALGVGGGG SSRDILQGAL WAAGFSIAGV PDNRNQADVI NLSLGGNSFC PPLDQEVYDD
VTARGTIVVV AAGNENQNAA NVTPASCANV ITVGATDFSG RRAPYSNFGT VVDVMAPGGD
LGRDDNGDGD GDGVVSLGFN DLTRQFSVQS LQGTSMAAPH VAGVVSLMRA LRPDLNTQDA
VAILRGTANQ VSAVDCGRAS SSECGAGLID AEAALINVDG GLPPPSNGPL AFNPNPVDFG
SSASELSVTM TNVSTQPLSW SINSFETSSS NPVALAQGTF YFAAGATTSG SLDVGQSAQF
TLGVARDSVS VPGNYAAELI FELGGEEQRL LTRFSTFPED IEGPSGPTVV GAFIADATGN
PQLIASKEET QFFSSYKLYT EPGENVLIAW SDDNGNFEID EGDHLGVEPS VLIAEDQQIG
GVNIEMSQVL NTAALPAPLS PGVLRALEAM APRP