Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3122 |
Symbol | |
ID | 5900577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 3386385 |
End bp | 3387410 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641563625 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001684747 |
Protein GI | 167647084 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 0.435185 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.194122 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCCAGCA TCTCCGCCGC CATCCTCGGC TGCGCCGGGA CCACCCTGAC GGCGGAAGAG GCCGCGTTCT TCCGGGACGT GAAGCCGTGG GGCTTTATCC TGTTCAAGCG CAACATCGCC GATCCCAATC AGGTCCGGGC CCTGACGGCG GCGTTGCGCG AGACAGTGGG GCGCCCCGAC GCGCCGATCC TGATCGACCA GGAGGGCGGC CGTGTCGCCC GCCTGCAGCC GCCGCACTGG AAGACCTATC CGCCCGGCCG AGCCTATGGC GAACTGGTGG CCAACGACCC GTTGGCGGCC CGCGAGATCA CCCGCCTGGG CGCGCGGCTG ATCGCCCACG ACCTGCTGGC GCTGGGGATC AATGTCGACT GCGTGCCGGT GCTGGACGTG CCCGATCCGC AGGGGCACGA GATCATCGGC GACCGCGCCT ATGGCGACAC GCCTGAGCAG GTGGCCACCC TGGGCCGCGC GGCGGCCGAG GGTCTGCTGG CCGGCGGCGT CCTGCCAATC ATCAAGCACA TCCCCGGCCA TGGCCGCGCC ATGAGCGACA GCCACCTGGA GCTGCCGGTC GTGAAGGCCA AGCTGGCCGA ACTGGACGCC CGGGACTTCG CGCCGTTCCG CGTGTTGTCC GACATGCCCA TGGCGATGAC CGCCCACGTC GTTTACACGG CCATCGACCG CCGCAATCCG GCGACGACGT CGCGCAAGGC GATCAAGAAA ATCATCCGCG AATCCATCGG CTTCGACGGA CTTCTGATGA GCGACGACCT GTCGATGAAG GCGCTGTCGG GCGACTTCAA GCAGCGCGCC AAGGCCAGTC TGTCGGCCGG CTGCGACGTC GTTCTGCACT GCAACGGCGA CATGGCCGAG ATGAAGGCGG TGATGTCGGG CGTCGGCAAG CTGTCGCGCG AGGCCAAGCG CCGGGTGCAG GCGGTCATGG GGCGGCTGGT CAAGGTTCCC GAGCCGCTGG ACGTGGCCGA GGCTCGCGCC CGCTTCGACG CGGCCTTCAA CGGCGAATTT GCGTGA
|
Protein sequence | MASISAAILG CAGTTLTAEE AAFFRDVKPW GFILFKRNIA DPNQVRALTA ALRETVGRPD APILIDQEGG RVARLQPPHW KTYPPGRAYG ELVANDPLAA REITRLGARL IAHDLLALGI NVDCVPVLDV PDPQGHEIIG DRAYGDTPEQ VATLGRAAAE GLLAGGVLPI IKHIPGHGRA MSDSHLELPV VKAKLAELDA RDFAPFRVLS DMPMAMTAHV VYTAIDRRNP ATTSRKAIKK IIRESIGFDG LLMSDDLSMK ALSGDFKQRA KASLSAGCDV VLHCNGDMAE MKAVMSGVGK LSREAKRRVQ AVMGRLVKVP EPLDVAEARA RFDAAFNGEF A
|
| |