Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1475 |
Symbol | |
ID | 8543857 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | + |
Start bp | 1998027 |
End bp | 2000786 |
Gene Length | 2760 bp |
Protein Length | 919 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 646386186 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_003265921 |
Protein GI | 262194712 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTAT CTACCAGCGT CTGTCCTCTG CTCTTGGCTG CGGCACTGTG TGGCTGTTCT GGCGATGACG GGCCGAGCGA GCCCGCTCCG ACCGTGGACG CCGGCGTGCC GGCCGCGCCC CTCACCCAGG AGGTGCTCGA CCGCATGGCG CGCGATATCG AGCTGCGCTA CGAGCTCCTC GAAACCCACG GCGCGCAGGC CGGCGTCGAC TGCAAGGCGC TCGCGGGCGA TTACGCGAGC TGCCACACCG CGCGCCTGGT CCTGAGCAAC CGCGGCGAGG AGGCGCTGCC GTCCGGCGGC TGGACGCTCT ACTTCCACAG CATCCGCCGC GTCCTGTCGG TCGAGGGCGA GGACTTCACG ATCTCGCACA TCAACGGCGA CCTGCACGCG CTGCGCCCGA CGGCCTCGTT TGCCGGTTTT GCCGCCGGCG AGGCGTTGAA CGTGGCCTTC GTGGCCGAAT ACTGGACGCT GTATCAATCG GATTTCATGC CGCGCTACTA CCTGAGCGCG GACGGGCTCG ACAGCCGCAT CATCGCCAAC ACCGACAGCG ACGACAGCGA CGCCTACGCC GCGCCGATCA CGGGCGAGAA CCGCAAGCGC ACACCGCAGG ACAACAACGT CTTCGCCACC GCGCAGACGC GCTACGACGA CAACGCCTCG CTGACATTGC TGGCGGCCGA GGAGGTGGCG CGGACGGTGA TTCCCACGCC GGTGTCGCAG GAGCACCCGG AGGCGGGCGC GCTCGACCTC TCGGTCGGCG TCGACATCGC GGCCAGCGAG GCGCTGAGCC AGGCCAGCGT GGACGCGCTG GCGCAGCGCC TGAGCCTGCT CGGCGTCGAG CGCGCGGATG GCGGCGTGAG CGTGAGCGTG AGCGTGGACG CCGCGCACTT CGAGGGCGAG GACGCGGCGT ACGCCAGCGC GGGCGGCTAC GAGCTGCGCG TGGGTCCCGC GGACGGCGGC GGTGGCGACG ACATCGAGAT CATCGGCTAC GACCAGGCCG GCGCGTTCTA CGGCGTGCAG TCGCTCATGG CCCTGTTGCC GGCCGGCGGG GACGGCGGCG ATGGCGCGCT CAGCGTGCCG CACGCGCTGA TCAAAGACGC GCCTCGCTTC GCGCATCGCG GCGTGCTGGT CGACGTCGCG CGCAACTTCC GCAGCAAGCA GGTGCTCCTG CGCTTCATCG AGCAGATGGC CGCGTACAAG CTCAACCGCC TGCACCTGCA CCTCAGCGAC GACGAGGGCT GGCGGCTGGC GATCGACGGG CTGCCCGAGC TCACCGAGGT CGGCGGCCGC CGCTGCCACG ACCTCGACGA GCGGCGCTGC CTGCTGCCGC AGCTCGGCTC GGGTCCCTTC GACGACAACG CCGGCAGCGG CTTCTACAGT CGCGCGGACT ACATCGACAT CGTCCGCCAC GCGCAGGCGC ATTTCGTCCA GGTCATCCCC GAGATCGACA TGCCCGCCCA CGCGCGCGCC GCCATCGTCG CCATGGAGGC CCGCTACGAG CTGCTGGCCG CCAGCGACCA GGACGCGGCC GCCGAGTACC GGCTGCGCGA TCCCGACGAC ACGTCCAACT ACACCTCGGT GCAGTTCTAC GACGACAGCT ATCTCAACCC CTGCCAGCCC TCGACCTACC GCTTCGCCGA CAAGGTCATC GGCGAGGTGC AGGCCATGCA TCTCGAGGCC GGGCAGCCGC TCGCGGCCTG GCACATGGGC GGCGACGAGG CCAAGAATAT CCACCTCGGC GGCGGCTACG AGGGCCTCGG CGGCACCACC GAGTGGAAAG GCAGTATCGA CCAGTCGGCC GAAGACCTGC CCTGGGCCAA GTCGCCCCGG TGTCAGCAAC TGCTCGCCGA CCAGCCCGCG ATCGGCGGCG TCGGCGATCT CGGCGCGTAC TTCGCGCGCC GCATGGGCGA GATCGTGGCC GCGCGCGAGA TCGCGACCCT GGTCGCCTGG CAAGACGGGC TCAAGGGCAT CGACTCGGCC GGCGAGCTGG CGGTCGCCGA GGTCATGATC AACTTCTGGG AGACCCTGTA CTGGGGCGGC TTCGAGAGCG TGCACGCGTG GCTCGAGAGC GGCTTCTCGG TCGTGCTGTC CAATCCCGAT TATCTCTATT TCGACTTCCC GTACGAAGTC GATCCGGCCG AGCGCGGCTA CTACTGGGCC GCGCGCGCCA TCGACGCCAA GAAGGTGTTC ACCTTCAGCC CCGGCAACCT GGCGCAGAAC GCCGAGACCA GCCGCGACCG CGACGGCAAC CCCTTCGCCG CCACCACGGC GGTCGCCGAG GCCGCGTTCA CCGGCATCCA GGGCCAGCTT TGGGGCGAGA CCATCCGCAG CGACGCCCAC TTCGAGTACA TGGCCTTTCC GCGCCTGCTG GCGCTGGCCG AGCGCGCCTG GCACCGCGCG AGCTGGGAGC TCGAGCGCGA TGTCGGACAG AGCTTCGAAG CCGGCGTGAG CGAGCACGTG GACACGGCCG CGCTCGCGTT CGACTGGAAC CGCTTTGCCA ACGCGCTGGG CCAGAAGGAG CTGGCCAAGC TCGACGCCGC CGGGGTCGCC TACCGCGTCC CGGTGCCCGG CGCGACCCTG GTCGAGGGCA AGCTGTCGGC CAATGTCGAG TTCCCCGGGC TGCGCATCCA GTACCGCGAC GCGGCCGGGA GCTGGCAGCT CTACGACGCC GAGGCCCAGC CCGCCGCGAG CGCGAGCACC GAGCTGCGCG CGCTGGCCAC CGACGAGCGC GCGGGCCGCG CTGTGACGGT CGCGCCGTGA
|
Protein sequence | MKLSTSVCPL LLAAALCGCS GDDGPSEPAP TVDAGVPAAP LTQEVLDRMA RDIELRYELL ETHGAQAGVD CKALAGDYAS CHTARLVLSN RGEEALPSGG WTLYFHSIRR VLSVEGEDFT ISHINGDLHA LRPTASFAGF AAGEALNVAF VAEYWTLYQS DFMPRYYLSA DGLDSRIIAN TDSDDSDAYA APITGENRKR TPQDNNVFAT AQTRYDDNAS LTLLAAEEVA RTVIPTPVSQ EHPEAGALDL SVGVDIAASE ALSQASVDAL AQRLSLLGVE RADGGVSVSV SVDAAHFEGE DAAYASAGGY ELRVGPADGG GGDDIEIIGY DQAGAFYGVQ SLMALLPAGG DGGDGALSVP HALIKDAPRF AHRGVLVDVA RNFRSKQVLL RFIEQMAAYK LNRLHLHLSD DEGWRLAIDG LPELTEVGGR RCHDLDERRC LLPQLGSGPF DDNAGSGFYS RADYIDIVRH AQAHFVQVIP EIDMPAHARA AIVAMEARYE LLAASDQDAA AEYRLRDPDD TSNYTSVQFY DDSYLNPCQP STYRFADKVI GEVQAMHLEA GQPLAAWHMG GDEAKNIHLG GGYEGLGGTT EWKGSIDQSA EDLPWAKSPR CQQLLADQPA IGGVGDLGAY FARRMGEIVA AREIATLVAW QDGLKGIDSA GELAVAEVMI NFWETLYWGG FESVHAWLES GFSVVLSNPD YLYFDFPYEV DPAERGYYWA ARAIDAKKVF TFSPGNLAQN AETSRDRDGN PFAATTAVAE AAFTGIQGQL WGETIRSDAH FEYMAFPRLL ALAERAWHRA SWELERDVGQ SFEAGVSEHV DTAALAFDWN RFANALGQKE LAKLDAAGVA YRVPVPGATL VEGKLSANVE FPGLRIQYRD AAGSWQLYDA EAQPAASAST ELRALATDER AGRAVTVAP
|
| |