Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_2166 |
Symbol | |
ID | 8544552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 3011100 |
End bp | 3013964 |
Gene Length | 2865 bp |
Protein Length | 954 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 646386873 |
Product | Peptidoglycan-binding lysin domain protein |
Protein accession | YP_003266604 |
Protein GI | 262195395 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02594] conserved hypothetical protein TIGR02594 |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.16902 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAACA GCGGCTACCA AAAACGCATT CAGAATCGCG GCAGCGATGC CGCCGATCCC CTTCAGCAGA AGGTCGCTCC CGGCAAGGTG ACGCGCACCA GCCGGATGGC GTCGCCCGTG CAGCGGAAGC CGCAACAAGA CGGACCGAGC GCCGGCCGAA CCGAAGTTGC CAACAGCGCG ACCGTGTTCC GCCACGGCGG CGGGGCGGTG GACCTGGGTG GTTCCTCGGC GGCTGAAGTC GCCGAGAGCG GTTTCTCCGG CGGCGCCTCG GATGTGCCCT ACCGGGCCGA GATGGAGCGC AGCTTCGGCA CCTCCTTCAG CGACGTCCAG GCCTACAGCG GCGGCGACTC GCAGGGCGCG GCGACGCAGC TCTCGGCGCA GGCGTACACG GTCGGCAACC GGGTTGCGTT CCGCGATTCC AATCCCAGCC GCGAGCTGGT CGCGCACGAG CTGGCGCACG TGGTCCAGGG CCGCGGCGGC GAGGTGCAGG CCAAATCGGA GATGAGCCAG CCGGGCGATT CGCTCGAGCG CGAGGCCGAC TCCGTGGCCG CTCGCGTCGC CAGCGGCGAG AGCGTCCAGG ACCACACCGC GCGCTACGAC GGCGCCGGCG GGCCGGCGCT CGGCTCCGCG CCCATGCGCC TGGTCGATTC CGACGCCGCG GCCAGCGAGA CCACGGCCGC GCCCGCTGCC GACGCCGCCG CCGACGCCGC GCCCGCCGCG CTGCTCACGC CCGAGCAGGT GCAGTCCGCC ATCGCCTTCT ACGGCGCCGA CCGCTGGCCC GCGCCGAAGA TCGCCCAGGT GTCCGAAAAA CTCGGCATCG CGGTCCGCGA GCAGGTCGAC GAGCAGTTCG TGCAGGCCGC GGCCGGGTTT CAGCAGCAGC GCAGCCTCAC CGTGGACGGC ATGCTCGGCA GCGCCTCGCT GATGCCGCTG TTCGAGGGCG AGGAGATGGA CCGCGCCCAC ACCATGGCCA ATCGCATCAC GGCCGAGTAC GAGAGCTCGG GCAACTATGG TGTGGTGCAG AACGCCGACG TCGGCATCAT CAGTTATGGC GCCCATCAGA GCACGCTGCA CTCGGGCAAC CTGGGCCGCA TGCTGCAGGA TTATCTCGAC CGCGTGGCCG CCGCCGAGGA GCCGACCCAG GCGTCGCAGA CCATCCAGAG CTACATGGGC CGCATCAACG ACCGCAGCCA GTGGGAGTCG CTGCGCAACG AGGGCCCGCT GCTGTCGGCG CTGCGCGCTG CCGGCTCCGA GCAGATCATG CAGGACGCGC AGAACGCGTT CTTCAGCGAG GACTTCTGGG TGCCCGCGGT CAAGGCCGCG CTCAATCACG GCATCACCTC GCAGCTCGGC TACGCGACGC TGTACGACGC CAAGATCCAG GGCGGCATGG AGGATTCGCT GCAGCGCGCG ACCAGCGCCA TGGGCGGCAT CGTCGGCGCC ACGGTCGAGC GCAACGGGCG CTCGCAGCAG GTCACCGAGG CCGAGTTCCT GGTCGCCTTC AACGAGGCTC GCGAGGGCCG TCTGGAGCGC ATCGCGGTGC GCCGCGACGG CCAGGGCAAG CGCCGCGACG CCGAGATGCT GCGCAACTCC AAGGTGCGGC CGCAGGCCTT CGAGGAGCTG GCCCGCGATG GCAACCTCGA CCTCTCGGCC AACGTCGACG GCGAGAACAG CCTGGAGTTC CGCACCTACG GCGGGCGCCG CACCGAGGTC GATGTCCCCG AGGAGGTCGG CACCACGCCG GCTACGGGCG ACACCGAGGC CGGCACGGGC ACCGCGCCGC CGACCACGCG TCCGACTCCC GAGGTGACGC CGACTCCCGA GGTGACGCCG CCGACCACGA CGCCCGAGGT GACGCCGCAA CCGCGGCCGA ATCAGCCCAG CGCCTCCGAG TACACGGTGA AGTCGGGTGA CACGCTGAGC GCGATCGCCG GCCGGCTGCT CGGCGACCAG GACCGCTGGC GCGAGATCGC CACGCTCAAC GGCATCACCA ACCCGCGCGC GCTGCGCGTC GGTCAGGTGC TGCAGGTGCC GTCGTCGAGC GAGTCCGCGG CGCCCGAAGG TGGCCAGAGT GAGCCCGAGG CGCCCGCGGC CGCACCGGTC GAGACCGCCT ACGTGGTCCG CTCGGGCGAC ACGCTGGGCT CCATCGCGGC GCGCTTCCTC GGCAGCACCA ACCGCTGGCG CGAGATCGCC ACGCTCAACG GCATCAGCGA CCCGCGTCGG CTGAGCGTCG GCCAGCGCCT GCGCATCCCC ACCGGCGGCG CGCAGCAGGC CACGCCCGAG CCCGAGCAGC AGCGGCCCGA GCAGCAGCGG CCGGAGCAGC AGCGGCCCGA GCAGGGCGGA GGCGGCGCCG GTGGTGGCAG CAGCCCCGAG GCCGGCAAGC CGTCGTGGAT CTCGGTCGCC GAGGGCGAGC TCGGTGTCCA GGAGATCGTC GGCAGCCGCC ACAACCCGCG CGTCATCGAG TACCACTCGA CCACCGGGCG GTTTTCGGAC GACGAGACGC CCTGGTGTGC ATCCTTCGTC AACTGGGTGC TTCAGCAGGC CGGCCAGTCC GGCACCGGCA GCGCCCGCGC GCTGTCCTTC GAGAGCTACG GCACCACGCT CGACCGTCCG GCTTACGGCA GCATCGCCGT GCTCGCCTAC GGTGGCGGCC GCGGCCACTG CGCCTTCGTG GTCGGCAAGC AGGGCGACCG CATGTTGCTG CTCGGCGGCA ACCAGAGCAA CGGCGTCAAC ATCAAGTCCT TCGGCACCTC GCAGATCGTC GCCTACGTGG TGCCGCCCGG CTACCAGCCG CCGCCGAGTG CGTTCGCGCT CGATGGTGCC ACCGGCGAGG TCGGTGAGGG CGGCGGACTC AGCGACACCC GCTGA
|
Protein sequence | MSNSGYQKRI QNRGSDAADP LQQKVAPGKV TRTSRMASPV QRKPQQDGPS AGRTEVANSA TVFRHGGGAV DLGGSSAAEV AESGFSGGAS DVPYRAEMER SFGTSFSDVQ AYSGGDSQGA ATQLSAQAYT VGNRVAFRDS NPSRELVAHE LAHVVQGRGG EVQAKSEMSQ PGDSLEREAD SVAARVASGE SVQDHTARYD GAGGPALGSA PMRLVDSDAA ASETTAAPAA DAAADAAPAA LLTPEQVQSA IAFYGADRWP APKIAQVSEK LGIAVREQVD EQFVQAAAGF QQQRSLTVDG MLGSASLMPL FEGEEMDRAH TMANRITAEY ESSGNYGVVQ NADVGIISYG AHQSTLHSGN LGRMLQDYLD RVAAAEEPTQ ASQTIQSYMG RINDRSQWES LRNEGPLLSA LRAAGSEQIM QDAQNAFFSE DFWVPAVKAA LNHGITSQLG YATLYDAKIQ GGMEDSLQRA TSAMGGIVGA TVERNGRSQQ VTEAEFLVAF NEAREGRLER IAVRRDGQGK RRDAEMLRNS KVRPQAFEEL ARDGNLDLSA NVDGENSLEF RTYGGRRTEV DVPEEVGTTP ATGDTEAGTG TAPPTTRPTP EVTPTPEVTP PTTTPEVTPQ PRPNQPSASE YTVKSGDTLS AIAGRLLGDQ DRWREIATLN GITNPRALRV GQVLQVPSSS ESAAPEGGQS EPEAPAAAPV ETAYVVRSGD TLGSIAARFL GSTNRWREIA TLNGISDPRR LSVGQRLRIP TGGAQQATPE PEQQRPEQQR PEQQRPEQGG GGAGGGSSPE AGKPSWISVA EGELGVQEIV GSRHNPRVIE YHSTTGRFSD DETPWCASFV NWVLQQAGQS GTGSARALSF ESYGTTLDRP AYGSIAVLAY GGGRGHCAFV VGKQGDRMLL LGGNQSNGVN IKSFGTSQIV AYVVPPGYQP PPSAFALDGA TGEVGEGGGL SDTR
|
| |