Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0293 |
Symbol | |
ID | 5897567 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 324940 |
End bp | 327285 |
Gene Length | 2346 bp |
Protein Length | 781 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 641560777 |
Product | Beta-N-acetylhexosaminidase |
Protein accession | YP_001681928 |
Protein GI | 167644265 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3525] N-acetyl-beta-hexosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.218419 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTTTCG TAGCCAAGCC CTTCGCCAAA CTCGCCGCCC TGGGCGCATC GGTCGCCGTC CTGGCCTTCG CCTCGGCCAG CGTCGCCGCG ACGCCGACCC TGACCTTGAC GCCGGCGCCC GCCCAGGCCG AGATGGGGCA AGGGGTCTTC GCCCTGACCG CCCGGACCCG GATCTTCGTC GCCAAGGGCG ATGTCGAGGC CAGGGTCGTG GCCAGCCAGC TTTCCGACAT GCTGTTCAAG GCCCGAGGCC TGAAGCCCGC CGTCGTCGAG GGCGCGCCGC CCGCAGGCGA GGCCGCGATC GTCCTGGTCC GGACCCAGGC CGCCCCTGAA GCAGGAATTG GCGACACGGC CGAGGCCTAC CTCCTCGACG TCGCCCCGAC CGGCGTCACC ATCACCGCTC CCAAGCGCGC CGGTCTGTTC TACGGCGCGG TCAGCGTCTG GCAACTGGCC GTGCAGGACG CCGCCAAGGG TCCCGCGGAC CTGCCGGCGG TCAGCATCGT CGACGCCCCG CGTTTCGCCT GGCGCGGCTT CATGCTCGAC AGCGCCCGCC ACGTCCAGAG CATCGACACC ATCAAGGCGA TCCTCGACGC CATGGCCGCC CACAAGCTCA ATGTCCTGCA TTGGCATCTG GTCGATGACC AAGGGTGGCG GCTGGAGATC AGGAAATATC CGAGGCTGAC GTCCGAAGGG GCCTGGCGCG CGCCGGCAGG GGCGGCGGGC AAGGACCCCA AGACCGGCAA GCCGATCCGC TACGGCGGCT TCTACACCCA GGACCAGGTG CGCGACCTAG TCGCCTACGC CGCCGCGCGC GGCGTCACCA TCGTGCCCGA GATCGAGATG CCGGGCCACG CTCTGGCGCC GCTGGTGGCC TATCCCCAGT TCGGCATGAC GAAGACTCCG CCGCGCGCCA GCATGGGCGA CTGGGGCGTG TTCCCCTATC TCTACAGGCC CAGCGAAGAG ACGTTCACGT TCCTCAACGA CGTGCTCGAC GAGGTGATGG ACTTGTTCCC CTCGCCCTAT ATCCATGTGG GCGGCGACGA GGCGGTCAAG GATCAGTGGA AGGCCAGCCC CGAGGTCCAG GCCCAGATCC AGGCCTTGGG CGTCAAGGAC GAGCACGGCC TGCAGAGCTG GTTCATCCAG CGGGCCGAGA AGCACATCAA CGCCCGCGGC CGGCGGATGA TCGGCTGGGA CGAGATCCTT GAAGGCGGCC TGGCCCCCAA CGCCACGGTG ATGTCCTGGC GCGGCGTCGA CGGCGCGGTC GCCGCCGCCT CGCAGGGTCA CGACGCCGTC CTGGCCCCAG ACAGCACGCT CTACATGGAC CGCCGGCAGA GCGCCTCGGC CGACGAGCCG CCTGGCCGGA TCAAGATCAC CAGCCTCAAG GACGTCTACG CGTTCGACGC CGCCCCGGCC GCGCTCACAC AGGCCCAGCG CGCCCACATC CTGGGCCTGC AGGCCACGTC GTTCACCGAG CACATGCGCA CCGACGAGCG ACTGGAAAGG ATGACCTTCC CCCGGCTGGT CGCGGTGGCC GAGAATGGCT GGACGCCTGA GGCCCAGCGC GACTGGACCC GCTTCGCCGC CCGCCTGCCG GCCGAGACCG CGCGGCTGGA CGCGCTGGGT GTCGCCCACG ACACCGTGCC CTACGAGCCC CAGGCGACCC TGACCCCGGC GGCGGACGGC GAGATCTCGG TCGCCCTGGC CTCGGGTCTG GGCCTGGGTG AGATCCGCTA CACCACCGAC GGCCAGGCGC CGACCAAGAC CTCCGCGCTG TATGATGCGC CTCTCGCGGT CGCGCCCGGC AAGACGCTCC GTGTCCGGAC GTTCCTCGAA GATGACGCCC TGGGCCGGAT CCGCGACTAC CCCATCAGCC TGGCGGCGGC CCGGACGCGC AACAGCCATC AGCTCGAGAC CTGCGGCAAC GGCATCAATC TCTCATTGGA GGACGATGCG CCGGTCACGG GTCCGCGCGC CGTGTTCGCG GTCGACCTGA TGAACCCCTG CTGGGTGTGG AAGGGCGCGG ACCTGTCCTC GGTCCTGAAG CTGACGGCTC GCATTGGCCA GGTCCCGTTC AACTTCCAGA TCGGCGCCGA CAAGGCCAGG ATCCCGTTGC GGGCGCCAGC GACGCCGGAC GGCGAGCTCG AAGTGCGCCT CGACGGGTGC GCGGGCGAGC GGATCGCCGT CCTCCCGCTA GGCGCCGCGG CGCGCGGACC GGCCATGGGA ACCGTTTCCG GCGTGGTCCC CGCCAAGGGT GGCGTCCACG ACCTCTGCCT CAGCTTTACA GCGCGCGGCG TCGAGCCGAC GCTGGTTCTC GACCAGGTCA CGCTGACCCC AACCCACAAG AACTAG
|
Protein sequence | MTFVAKPFAK LAALGASVAV LAFASASVAA TPTLTLTPAP AQAEMGQGVF ALTARTRIFV AKGDVEARVV ASQLSDMLFK ARGLKPAVVE GAPPAGEAAI VLVRTQAAPE AGIGDTAEAY LLDVAPTGVT ITAPKRAGLF YGAVSVWQLA VQDAAKGPAD LPAVSIVDAP RFAWRGFMLD SARHVQSIDT IKAILDAMAA HKLNVLHWHL VDDQGWRLEI RKYPRLTSEG AWRAPAGAAG KDPKTGKPIR YGGFYTQDQV RDLVAYAAAR GVTIVPEIEM PGHALAPLVA YPQFGMTKTP PRASMGDWGV FPYLYRPSEE TFTFLNDVLD EVMDLFPSPY IHVGGDEAVK DQWKASPEVQ AQIQALGVKD EHGLQSWFIQ RAEKHINARG RRMIGWDEIL EGGLAPNATV MSWRGVDGAV AAASQGHDAV LAPDSTLYMD RRQSASADEP PGRIKITSLK DVYAFDAAPA ALTQAQRAHI LGLQATSFTE HMRTDERLER MTFPRLVAVA ENGWTPEAQR DWTRFAARLP AETARLDALG VAHDTVPYEP QATLTPAADG EISVALASGL GLGEIRYTTD GQAPTKTSAL YDAPLAVAPG KTLRVRTFLE DDALGRIRDY PISLAAARTR NSHQLETCGN GINLSLEDDA PVTGPRAVFA VDLMNPCWVW KGADLSSVLK LTARIGQVPF NFQIGADKAR IPLRAPATPD GELEVRLDGC AGERIAVLPL GAAARGPAMG TVSGVVPAKG GVHDLCLSFT ARGVEPTLVL DQVTLTPTHK N
|
| |