Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_5061 |
Symbol | |
ID | 8547472 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 6979771 |
End bp | 6981738 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646389737 |
Product | Thimet oligopeptidase |
Protein accession | YP_003269442 |
Protein GI | 262198233 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0339] Zn-dependent oligopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCAG CGACTGATAC TCCCGCCGCG TCCCCGTCCG CCGAGGCATT CGCCGCGCGC TGCGACGAGG GCTTGCAGCG CGCCAAGGCG CTGCTGCCGA CGATCATCGA CGTCTCGGGT CCGCGCACCC TGGAGAACAC GCTCGAGCCG TACAACGACA TGCTGATGCA TGCCGAGCGC ACGGCCGGCC TGGCCGGTCT GATGTCATCC GTCCACCCGG ACGAAGTCAT CCGAGAGGCC GCGCGCGCCA GCGAACAGGC CGTGGACGCC TTCATGTCGG GCCTCAGCCG CGACCGCCGC CTGTACGAGG CCATCGCCGA GCTGGCCAAA TCGCCCGCGC ACGACTCCTT CGACGCCGAG ACCCAGCGGC TCATCGAGCG CGGCCTGCGC GACTTCCACC GCAGCGGCGT CGACCGCGAC GAGGCCACCC GCGAGCGCTT GCGCGCGATC GACGACCAGC TCACCAAGCT CGGACAGGCG TTCCAAAAAG CCATCGTCGA CGACGTCCGC CACGTCGATG TCAACGATGT CTCGGAGCTG GCCGGATTGC CCGAGGACTT CGTGCGCTCG CATCCGCCGC AGGAGGACGG CCGCATCCGC ATCTCGACCG ACTTCCCCGA CTACCTGCCG GTGATGGCCT ACGCCGAGAA CGGCGACCTG CGGCGCGCGC TGTACGTCTG CTACAAGGCC CGCGGCGGCG CGGCCAACGA GGCCACGCTC AAGCAGATCC TGGTGCTGCG CGCGGAAAAA GCCAAGCTGC TCGGCTACGA GAGCTGGGCC GACTACGCCA CCGAGAACAA GATGATGAAG AGCGGCGCCA ACGCCGCCGC GTTTATCGAC CGCGTGGCCA AGATCGCTCG CCCGCGCGCC GAGCGCGACT ATCAGGAGCT GCTCGCGCGC AAGCGCAAGC AGCACCCCGA TGCCCAGGGC GTGGACGACT TCGAGAAATA CTTCTACGAG ACCAAGGTCA AAACCGAGTC GTACGCCTTC GACCCGCAGT CGGTGCGCCC GTACTTCGAG TACGCTCGCG TCGAGCGCGG CCTGCTCGAT ATCACCGCGC GCATCTACGA CATCGCCTAC GAGCCGGTGG CGCCCGAGCA GGTCGCCGAG CTGTGGCACC CCGACGTCAA GGTCTTCGAC GTCACCCGCG AGGGCGAGGT GCTCGGCCGC ATCTTCCTCG ACATGCACCC GCGCGAGGGC AAATACAAAC ACGCGGCCCA GTTCCCCTAC CGCTCGGGCG TACGCGGCAA GCAGATGCCC GAGGGCGTGC TGGTGTGCAA CTTCCCCAAC CCGCGCACCA CCGACGGTCC GGCGCTGATG GAACACCCCG ACGTGGTCAC CATGTTCCAC GAATTCGGAC ACCTGATGCA CCACGTGCTC GGCGGCCACA AGCGCTGGAT CGAGCAGAGC GGCGTGGCCA CCGAGTGGGA CTTCGTCGAG GCGCCCTCGC AGATGTTCGA GGAGTGGGCG TGGTCCCACG AGACCCTGGC CACCTTTGCC CTGCACCACG AGACCGGCGA GCCCCTGCCC GAGGAGACCG TGCAGCGCAT GCGCCGGGCC GACACCTTCG GCCTGGGGCT CAACACCATG CAGCAGATGT TCTACGCGTC CATCTCGCTC GAGTTCCACA CCGTGGCTCC CGAAGAGCTC GACATGAACG CGGCCGTACA GCGGCTACAG AGCGCGTACA CGCCCTTCCC CTTCGTCGAG GGCACCTGCT TCCACGCCAA CTTCGGACAC CTGAACGGTT ACTCGGCCCT GTACTACACG TACATGTGGT CGCTGGTGAT CGCCAAGGAC CTGCTCACGC CATTCAAAGA GCACGGCCTG CTCAATCTGG AGTGGACGCA CCGCTATCGC GACCACGTCC TGGCGCCGGG CGGCAGCAAG GACGCCGCCG TGCTGGTCGA GGAGTTTCTC GGCCGCACGT ACAAATTCGA CGCCTTCGAA GCCTATCTCG CCGGCTGA
|
Protein sequence | MPPATDTPAA SPSAEAFAAR CDEGLQRAKA LLPTIIDVSG PRTLENTLEP YNDMLMHAER TAGLAGLMSS VHPDEVIREA ARASEQAVDA FMSGLSRDRR LYEAIAELAK SPAHDSFDAE TQRLIERGLR DFHRSGVDRD EATRERLRAI DDQLTKLGQA FQKAIVDDVR HVDVNDVSEL AGLPEDFVRS HPPQEDGRIR ISTDFPDYLP VMAYAENGDL RRALYVCYKA RGGAANEATL KQILVLRAEK AKLLGYESWA DYATENKMMK SGANAAAFID RVAKIARPRA ERDYQELLAR KRKQHPDAQG VDDFEKYFYE TKVKTESYAF DPQSVRPYFE YARVERGLLD ITARIYDIAY EPVAPEQVAE LWHPDVKVFD VTREGEVLGR IFLDMHPREG KYKHAAQFPY RSGVRGKQMP EGVLVCNFPN PRTTDGPALM EHPDVVTMFH EFGHLMHHVL GGHKRWIEQS GVATEWDFVE APSQMFEEWA WSHETLATFA LHHETGEPLP EETVQRMRRA DTFGLGLNTM QQMFYASISL EFHTVAPEEL DMNAAVQRLQ SAYTPFPFVE GTCFHANFGH LNGYSALYYT YMWSLVIAKD LLTPFKEHGL LNLEWTHRYR DHVLAPGGSK DAAVLVEEFL GRTYKFDAFE AYLAG
|
| |