Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_6456 |
Symbol | |
ID | 8548871 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 8864161 |
End bp | 8866425 |
Gene Length | 2265 bp |
Protein Length | 754 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646391117 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_003270818 |
Protein GI | 262199609 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.234127 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACGAT ATTTTCCCGG CGTGGTGCGC GCCCTGTTCA TATCGATGTT GGCGGCCGTG GTCGGTTGCG GAGGCGAGAG CGGAACGATC ATCTCGGGCA AGCTCGAGAC GCTTTCGCTC AGCGGTGCCT CGGAGGCGCC GCCCGTGGGC ACGGCGCTGA GCAACCGCGG CGACGCGGCA CCCACGGCTG TGGCTGCTGC GGCCGTCGCC GAATTGCACA GCGCCGAAGC GCGCCTGCAG GCGAGCCGCG ACGTCATGCT GGCGACGCTC GACGACGCCG AGTTCGTGCC CGGTGAGGTC ATCGTCGCGT TCCGCGAGGA CGGGCTTTTC GGACCGCTAC AGGCGCAGGC CGAGATGCAG GTGGCGGGTT CGCTGCTGCA GGCGGTGGAG CCGATGGCGG TGGCCAACGC GCACGTGTAT CGCGCCGCGG ACAGCGACCG CAAACGCACC ATCGAGATGA TCCGCGAGCT CAACCAGCGC GAGGACGTGC GTTACGCGCA GCCCAATTAC ATCTACCGCG CGCTGCGCAC GCCCAACGAT CTCGACGCCA AGGGCCAGTG GCACTATCCA GCCATCAACC TGCCCGCGGC CTGGGACCTC ACGATCGGCT CGTCCGACAC CGTGGTCGCC GTGGTCGACA CCGGCATCCT GTTCGACTCC CGCAACGCGG GCGCCAACCA CCCCGAGCTG GTCGGCAAGG TGCTGCCCGG CTTCGACTTC ATCGACGATC TCAACGTCGG CGGCGATGGC GACGAGCGCG ACGACAATCC CTTTGACGTC GGCGACAATC CCGGCGGACA GTCGAGCTAC CACGGCAGCC ACGTGGCCGG CACCATCGCG GCCGCGACCA ACAACGGCGT GGGCGTCGCC GGCGTCAACT GGTCGGCCAA CATCCTGCCC GTGCGCGCGC TCGGCGTCGG CGGCGGCGGC AGCTCGCGCG ACATCCTCCA GGGCGCGCTG TGGGCGGCCG GGTTCTCCAT CGCCGGGGTG CCCGACAATC GCAACCAGGC CGACGTCATC AACCTCAGCC TGGGCGGCAA CTCCTTCTGT CCGCCGCTGG ATCAAGAGGT CTACGACGAC GTCACCGCCC GGGGCACGAT CGTGGTCGTG GCCGCCGGCA ACGAGAACCA GAACGCCGCC AACGTCACCC CGGCGAGCTG CGCCAACGTC ATCACCGTGG GCGCCACCGA CTTCTCCGGG CGGCGCGCGC CGTACTCCAA TTTCGGCACC GTGGTCGACG TCATGGCTCC GGGCGGTGAC CTGGGCCGCG ACGACAACGG CGACGGCGAC GGCGACGGCG TGGTCAGCCT CGGCTTCAAC GATCTGACCC GGCAGTTCAG CGTGCAGAGC CTGCAGGGCA CATCCATGGC GGCGCCGCAC GTGGCCGGCG TGGTCTCGCT GATGCGCGCG CTGCGCCCCG ACCTGAACAC CCAGGACGCG GTCGCCATTC TCCGCGGCAC GGCCAACCAG GTGTCGGCCG TCGACTGCGG GCGCGCCTCC AGCAGCGAGT GCGGGGCCGG CCTGATCGAC GCCGAGGCCG CCCTGATCAA CGTCGATGGC GGCCTGCCGC CGCCCAGCAA CGGCCCGCTG GCCTTCAACC CCAACCCCGT GGACTTCGGC TCGTCGGCAA GCGAGCTGAG CGTGACCATG ACCAACGTGT CCACGCAGCC GCTGAGCTGG TCGATCAACT CGTTCGAGAC CTCGTCGAGC AACCCGGTCG CGCTGGCGCA GGGCACCTTC TACTTCGCCG CGGGCGCGAC CACCAGCGGC AGCCTGGATG TCGGACAGTC GGCCCAATTT ACCCTCGGCG TGGCCCGCGA TTCGGTCTCG GTGCCCGGCA ACTACGCGGC CGAGCTGATC TTCGAGCTCG GCGGCGAGGA GCAGCGGCTG CTCACGCGCT TCAGCACCTT CCCCGAGGAC ATCGAAGGCC CCAGCGGCCC CACCGTGGTC GGCGCGTTCA TCGCCGACGC CACGGGCAAC CCGCAGCTCA TCGCCTCGAA GGAGGAGACG CAGTTCTTCT CGTCGTACAA GCTGTACACC GAGCCCGGCG AAAACGTGCT CATCGCCTGG TCGGACGACA ACGGCAACTT CGAGATCGAC GAGGGCGATC ACCTGGGCGT CGAGCCCAGC GTGCTCATCG CCGAGGACCA GCAGATCGGC GGGGTCAACA TCGAGATGAG CCAGGTGCTG AACACCGCCG CGCTGCCGGC GCCGCTGTCG CCGGGTGTGC TGCGCGCGCT CGAGGCCATG GCGCCGCGTC CCTGA
|
Protein sequence | MRRYFPGVVR ALFISMLAAV VGCGGESGTI ISGKLETLSL SGASEAPPVG TALSNRGDAA PTAVAAAAVA ELHSAEARLQ ASRDVMLATL DDAEFVPGEV IVAFREDGLF GPLQAQAEMQ VAGSLLQAVE PMAVANAHVY RAADSDRKRT IEMIRELNQR EDVRYAQPNY IYRALRTPND LDAKGQWHYP AINLPAAWDL TIGSSDTVVA VVDTGILFDS RNAGANHPEL VGKVLPGFDF IDDLNVGGDG DERDDNPFDV GDNPGGQSSY HGSHVAGTIA AATNNGVGVA GVNWSANILP VRALGVGGGG SSRDILQGAL WAAGFSIAGV PDNRNQADVI NLSLGGNSFC PPLDQEVYDD VTARGTIVVV AAGNENQNAA NVTPASCANV ITVGATDFSG RRAPYSNFGT VVDVMAPGGD LGRDDNGDGD GDGVVSLGFN DLTRQFSVQS LQGTSMAAPH VAGVVSLMRA LRPDLNTQDA VAILRGTANQ VSAVDCGRAS SSECGAGLID AEAALINVDG GLPPPSNGPL AFNPNPVDFG SSASELSVTM TNVSTQPLSW SINSFETSSS NPVALAQGTF YFAAGATTSG SLDVGQSAQF TLGVARDSVS VPGNYAAELI FELGGEEQRL LTRFSTFPED IEGPSGPTVV GAFIADATGN PQLIASKEET QFFSSYKLYT EPGENVLIAW SDDNGNFEID EGDHLGVEPS VLIAEDQQIG GVNIEMSQVL NTAALPAPLS PGVLRALEAM APRP
|
| |