Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4810 |
Symbol | |
ID | 8336164 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 5479676 |
End bp | 5481277 |
Gene Length | 1602 bp |
Protein Length | 533 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644957910 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_003115512 |
Protein GI | 256393948 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.48578 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGCGGA CCGCAGCGGC TCTCAGCACC TGCGCCGCCC TCCTTGCTCT GGCCGCGTGC GGCGGCGGGA GCAGCGGGAC GAAGAGTTCC GTCCCGCCTG CCAGCACGAA CGGCAGTAGT TCCGCGCCGT CAGTCCCCTT GTCGCCGGCA GATCCGGCCG CGCTGGAGCG GGTCGTTCCC GAGCCGGCCG GCATCACGGC GGCTGCCGGG ACGTTCACCC TGACGAGTGC CACCGCGATC CACGCCTCGT CCGGCGCCGA GCCGGTCGCC GCCGACCTCG CCGCGTATCT CAAGCAGCAG ACCGGGCTGG CGCCGGCTGT ATCGCAAAGC CCTGATGCGG CGATCCAGCT TGTGCTGCAG CCCAGCGGCG GCGATCCGTC GCTGGGCACC GAGGGCTACA CGCTGGTGAT CGGTCCGAGC TCGGTCAAGC TCACGGCGGC TACTGACGCA GGGCTCTTCC ATGGTGTGCA GACCGTGCGG CAGCTGCTTG TCGGCGCCAA GCTCCAGGAC GGGACGATCA CCGACCACCC GCGATTCGCT TATCGCGGCG TGATGTTGGA TGTGGCGCGG CACTTCTACA GCGTCGCGGA CGTGAAGGCT TATATCGACG CCGCCGCGTT GTACAAGGTC AACGAGTTCC ACCTGCACCT GACCGACGAC CAGGGCTGGC GGTTCGCCGT GCCGGGGTGG CCGAAGCTGA CGTCGGTGGG CGCGGCGACG CAGGTCGGCG GCGGCGTCGG CGGGTCGTAT TCGGCGGCTG ATCTGAAGGA GATCGTCGAT TACGCGGCGT CGCGCTACAT GACCGTGATT CCGGAGATCG ACATGCCGGG GCACGTCGGC GCTGCGGTGT ACGCCTACCC TTCGCTGGCG TGCGACGGTC GGCACCACGG TCCGGTGACG AGCGTATCGC CGGCGTACGA CTCGCTGTGC ACGTCGAGCG AATCAACATA CAGATTTGTC GATACAGCGA CCAAAGCCGC CGCCGACGCC ACCCCCGGCG CGACCTACCT GCACATGGGC GGCGACGAGG CGCAGGCACT GAGCCTGACG CAGTACAACG CCTTCGTCGC GAAGACACAG AATCTCGTGG CAGGGCACGA TCGCACGCCG ATCGCCTGGG CCGAAGCCGG TACCGCAACC CTGCTGCCGC AGACGGTGCT GGAGTACTGG AACACCGCGC AGCCGCAGCC CTACGTCCTC CAGGCCGCCG CCAAGGGCAC CAAGCTCATC ATGGCGCCGG GCAACCACGC CTACCTGGAC CAGCAGCCGG TCGCCGGATT CCGCGTCGGC CTGCACTGGG CCGGCTACGT GCCGGTGTCG AAGGCCTACG ACTGGGATCC GGTGACCGTC CTGCCCGGCA TCGCGCCCTC GGCGGTACTC GGCGTCGAGG CACCGCTGTG GAGCGAGACC GTGAAGAACC TCGCCGACGC CGAAACCCTC GCCTACCCCC GCCTCCCCGC CATCGCCGAA ATCGGTTGGT CGGCACCGAA CACCCACGAC TGGCAGCGAT TCTCGAAGAG GCTGGCAGCG CAGGCTCCCC TGTGGGACAA GCTGGGGATC GCTTACTACA AGTCGCCGGA AGTGCCTTGG GGGTCGGGGT AG
|
Protein sequence | MPRTAAALST CAALLALAAC GGGSSGTKSS VPPASTNGSS SAPSVPLSPA DPAALERVVP EPAGITAAAG TFTLTSATAI HASSGAEPVA ADLAAYLKQQ TGLAPAVSQS PDAAIQLVLQ PSGGDPSLGT EGYTLVIGPS SVKLTAATDA GLFHGVQTVR QLLVGAKLQD GTITDHPRFA YRGVMLDVAR HFYSVADVKA YIDAAALYKV NEFHLHLTDD QGWRFAVPGW PKLTSVGAAT QVGGGVGGSY SAADLKEIVD YAASRYMTVI PEIDMPGHVG AAVYAYPSLA CDGRHHGPVT SVSPAYDSLC TSSESTYRFV DTATKAAADA TPGATYLHMG GDEAQALSLT QYNAFVAKTQ NLVAGHDRTP IAWAEAGTAT LLPQTVLEYW NTAQPQPYVL QAAAKGTKLI MAPGNHAYLD QQPVAGFRVG LHWAGYVPVS KAYDWDPVTV LPGIAPSAVL GVEAPLWSET VKNLADAETL AYPRLPAIAE IGWSAPNTHD WQRFSKRLAA QAPLWDKLGI AYYKSPEVPW GSG
|
| |