Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1990 |
Symbol | |
ID | 4070896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2384116 |
End bp | 2385234 |
Gene Length | 1119 bp |
Protein Length | 372 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984004 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_591065 |
Protein GI | 94969017 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.639682 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCTA CTCGGCTCCA GGAACTCCAG CAGCAAGTCG GCCAGCTAAT GATCGTCGGC TTCGACGGCA CCGAGATGTC TGCGCGCGTA CGCACGCTTC TCGCTACGAT CCAGCCGGCG GGTACCATCT TCTTCAAGCG CAACGTAGCG ACTGCCGAGC AGACATGGAA GCTCAACTAC GAGGCGCAGG CGGCCGTTTC CACACCGCTC TTCCGTTGCG TTGACCTCGA AGGCGGCACC GTCGATCGCC TTCGAGACGC AGTCGCTCCT GCGCCGTCGC TCTCTAATGT GGCGGCGACC GGATCAAAAA AAGTCATGCG TCGCTTTGCT CGGACGCTCG CAGCAGAAGC TCGCGCTCTC GGATTCAACA CTGACTTCGC TCCGGTCTTC GACCTGCGCA CAGTCGAATC AGTCAAGGTT CTCGCCGGCC GAACGATCGC AGCCGATCCC AAGCACATCA TCGAACTCGC CAGGGAGTTC TTGAAGGGCT TCAAAGACGA AAACGTTCTC GGTTGCGGCA AGCATTTTCC CGGCCTCGGC GCGGGTGCTG TCGATTCCCA CTACGAGCTG CCAACCATTA GCAAGCCCTG GAAGGCGTTA TGGGAAGAGG ACCTGCTTCC CTATCGCAAG CTTAAAGACG AGATCGCCTT TGCGATGGTC GCGCACTGCG TTTACCCGAA CGCTACGAAA GAAAAGGCCC CCGCTTCCAT CTCCCGTTTC TGGATGACAG ACATCCTGCG CAAGAAGATC GGATTTAAGC ACCTCATCTG TTCCGACGAC ATGGAGATGA AAGGTGTTCA AAAAGCGGTT TCGATCGAAG AAGCCTGCAT CCAGGCAGTC CGCGGCGGCG CCGATCTTTT TCTCGTCTGC AACAACGAAT CGCTTGTGTG GCGTTGCTTT CACGCTGTGC TGCGTGAAGC CGAACGCGAC AAATCCTTTG CGAAACAAAT CGCAGCCGCG TCTCGCCGCG TGTTCGAGTT CAAGAAACGT TCGCGGGCCG TGCGAGCCAA GTTCAACCCT GCGCCGACTC TCCGCACCGT AGACAAGCTT CGTCGCACGA TCTGGGAACT CACCGAAGAA GTTCGCTACA GCAGCCCCAA TCCGGAGCGG GCCCTTTGA
|
Protein sequence | MASTRLQELQ QQVGQLMIVG FDGTEMSARV RTLLATIQPA GTIFFKRNVA TAEQTWKLNY EAQAAVSTPL FRCVDLEGGT VDRLRDAVAP APSLSNVAAT GSKKVMRRFA RTLAAEARAL GFNTDFAPVF DLRTVESVKV LAGRTIAADP KHIIELAREF LKGFKDENVL GCGKHFPGLG AGAVDSHYEL PTISKPWKAL WEEDLLPYRK LKDEIAFAMV AHCVYPNATK EKAPASISRF WMTDILRKKI GFKHLICSDD MEMKGVQKAV SIEEACIQAV RGGADLFLVC NNESLVWRCF HAVLREAERD KSFAKQIAAA SRRVFEFKKR SRAVRAKFNP APTLRTVDKL RRTIWELTEE VRYSSPNPER AL
|
| |