Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caci_4251 |
Symbol | |
ID | 8335605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Catenulispora acidiphila DSM 44928 |
Kingdom | Bacteria |
Replicon accession | NC_013131 |
Strand | + |
Start bp | 4825931 |
End bp | 4827517 |
Gene Length | 1587 bp |
Protein Length | 528 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644957354 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_003114956 |
Protein GI | 256393392 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.115163 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.185001 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCCGGC GCGCGCGGCT GGCGCTGCTG GCGGCCGGGG TCGCGCTCGG CATGACCGTG ACCGGTGCGG CCGGCGCGGC CGGGGCTTCG TCCTCGGCCG CTGCCGCCGC CTCGTTCCCG AACGTCGTGG TCCCGGTGCC GGTGTCGGAG ACCGCGAACG GCTCGACCTT CACGCTCGCC TCGGCGGCGA CGCTGACGGC CGACGACGCG AACGTCGGCG GCTACCTGGC CGGGATCCTG CGGGCCTCGA CCGGGTACGC ACTGCCGCTT ACCGTCGGCG CCGCGGCGCC CGGCACGATC GCGCTGTCCC TGTCCGGTGC GCCGGCCACG GTCGGCGCCG AGGGGTATCA GCTCACGATC AAGGCGAGCT CGGTGCTGCT CCAGGCAAAC TCGGCGGCGG GGTTGTTCCA CGGCGTGCAG ACGTTGCTGC AACTGCTGCC GGCTCAGGTG ATGAGCCCGG CGAAGGTGAC TTCGGTGGCG TGGAAGGCGA CCGGCGGCAC GATCCTGGAC TATCCGCGCT TCGGGTATCG CGGGGCGATG CTGGACGTGG CGCGGCACTT CTTCACCGTC GCGCAGGTCG AGCACTACAT CGACGAACTG TCGCTGTACA AGGTGAACTA CCTGCATCTG CACCTGTCGG ACGACCAGGG ATGGCGCATC GCGATCAACT CCTGGCCGAA CCTGGCGACC ACCGGCGGCT CCACCGAGGT CGGAGGCGGC GCCGGCGGCT ACTACACGCA GGCGGACTAC ACCACGATCG TGAACTACGC CGCGTCGCAC TACATGACGC TGGTCCCCGA GATCGACACG CCGGGTCACA CGAACGCCGC GCTCGCCTCG TACGCGGCCT TGAACTGCAA CGGGGTCGCG CCGCCTCTGT ACACCGGGAC CGACGTCGGC TTCAGCTCGC TGTGCGTCTC GCTGCCGCTG ACGTACACGT TCCTGGACCA GGTCGTCGGC GAGCTCGCGG CACTGACTCC GGGCCCTTAC ATCCACATCG GCGGCGACGA GGCCAGCTCC ACGTCGCAGA GCGACTACAC GTCCTTCATC ACCAAGGCGC AGCAGATCGT GGGCAACCAC GGCAAGGCGG TCATGGGCTG GCACAACATC GCCGCGGCCA CCCTGGCGCC GTCCACGCTC GCGCAGTTCT GGGACACGAC GAAGTCGAAC TCCGCGCTGG CTGCCGCGGC GGCTAAGGGC ACGAAGATCG TCATGTCCCC GGCGAACCAC GCCTACCTGG ACATGAAGTA CACCAAGAAG ACGACGCTGG GCCAGAACTG GGCCGGCTAC GTCGACGTCA ACGCGGCCTA CGGCTGGGAC CCGGGGAACT ACCTGTCAGG CGTCAGCGCC TCGGCGATCG CCGGCGTCGA GGCGCCGCTG TGGTCCGAGA CGCTCGTCAC GTCGGCGAAC ATCGACTACA TGGCCTTCCC GCGCCTTCCC GCGCTGATGG AGCTCGGATG GTCGCCCGAA TCGACCCACA ACCAGACGTC GTTCGACGCC CGGCTCGGCG CGCAGGGACC CCGGTGGCAC GCGATGGGGG TGGATTACTA CAAGTCGACG CAGGTCAAGT GGCCGAGCGG GTCGTGA
|
Protein sequence | MFRRARLALL AAGVALGMTV TGAAGAAGAS SSAAAAASFP NVVVPVPVSE TANGSTFTLA SAATLTADDA NVGGYLAGIL RASTGYALPL TVGAAAPGTI ALSLSGAPAT VGAEGYQLTI KASSVLLQAN SAAGLFHGVQ TLLQLLPAQV MSPAKVTSVA WKATGGTILD YPRFGYRGAM LDVARHFFTV AQVEHYIDEL SLYKVNYLHL HLSDDQGWRI AINSWPNLAT TGGSTEVGGG AGGYYTQADY TTIVNYAASH YMTLVPEIDT PGHTNAALAS YAALNCNGVA PPLYTGTDVG FSSLCVSLPL TYTFLDQVVG ELAALTPGPY IHIGGDEASS TSQSDYTSFI TKAQQIVGNH GKAVMGWHNI AAATLAPSTL AQFWDTTKSN SALAAAAAKG TKIVMSPANH AYLDMKYTKK TTLGQNWAGY VDVNAAYGWD PGNYLSGVSA SAIAGVEAPL WSETLVTSAN IDYMAFPRLP ALMELGWSPE STHNQTSFDA RLGAQGPRWH AMGVDYYKST QVKWPSGS
|
| |