Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3622 |
Symbol | |
ID | 5901077 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3910163 |
End bp | 3912409 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564133 |
Product | Beta-glucosidase |
Protein accession | YP_001685247 |
Protein GI | 167647584 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.642494 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGAC AGCAACTTCG GGCCCTCGCC CTGGCCCTGA TGCTGACCAC GGCGCTGCCG ACAGCCGCCG CTCACGCCCA GGCCGCGTCT GCGAGCACGG CGAAACCCTG GATGAACACC AAGCTGAGCG CCGATCAGCG CGCCGAGCTG GTCGTCGCGC AAATGACCCA GGACGAGAAA TTGGCGCTGG TGTTCGGCTT CTTCGGTTCG AACCAGAAGA CGCCGCAGTT CACCCCCTCG CCGGAGGCCC GCATGGGCTC GGCCGGCTAC ATCCCCGGCA TTCCGCGCCT CGGCGTGCCG CCGCTGTGGG AGACCGACGC CGGCGTCGGC GTCGCCACCC AGCGCGAAAC CAGCGACCCG TACCGCGAGC GCACCTCCCT GCCCTCGGGC CTGGCCACCG CCGCGACCTG GAATCCCGAG CTGGCCTACA AGGGCGGCGC GATGATCGGC TCGGAGGCTC GCGATTCGGG CTTCAACGTG CAGCTGGCCG GCGGCGTGAA CCTGGCTCGC GAGCCGCGCA ACGGCCGCAA CTTCGAATAT GGCGGCGAGG ATCCGCTGCT GGCCGGCACG ATCGTCGGCG CGCAGATCCG CGGCATCCAG TCCAACAAGA TCATCTCGAC CATCAAGCAC TGGGCGCTGA ACGGCCAGGA GACCGGCCGC ATGACCGTCA GCGCCAATAT TGCCGACGAC GCGGCCCGCG CGTCGGACTT CCTGGCCTTC GAACTGGCCA TCGAGCAGTC GGACCCCGGC GCGGTGATGT GCGCCTATAA CCGCATCAAC AGCACGTACG CCTGCGAGAG CAACTACCTG CTCAACGAGG TCCTGAAGAC CGACTGGGGC TACAAGGGGT TCGTGATGTC CGACTGGGGC GGGGTGCACT CCACCCCCAA GGCCGCCAAG GCGGGCCTGG ACCAGGAGTC GGCCTACACC TTCGACAAGC AGCCCTTCTT CGGCGCGCCT CTGAAGGCCG CCGTGGCCGA TGGCTCGGCG CCGCAGGCTC GCCTGGACGA CATGGCCAAG CGCATCACCC GCTCGATGTT CGCCCACGGC CTGTTCGACC ACCCTGTCGC CATCAAGCCG ATCGACTTCG CCGCCCACGC CAAGATCACC CAGGCCGACG CCGAAGAGGC CATCGTCCTG CTGAAGAACG ACAAGGGCCT GCTTCCGCTG GCCAGGACCG CCAGGAAGAT CGTCGTCATC GGCTCGCACG CCGATGTCGG CGTGCTGTCG GGCGGCGGCT CGTCGCAGGT GTTCCCGATC GGCGGCATGG CGGTGAAGGG TCTGGGTCCC AAGGGCTTCC CCGGCCCGAT CGTCTACCAC CCCTCCTCGC CGCTGAAGGC GCTGCAGGTC CGCAATCCCG GCGCGACCTT CGCCTATGAC GACGGGACCG ATGCGGCCGC CGCCGCCAAG CTGGCCGCCG GCGCCGACCT CGTGATCGTC TTCGCCCACC AGTGGGCCGC CGAGTCGCAG GACTATTCCC TGACCCTGGC CGACAATCAG GACGCCCTGA TCGACGCCGT CGCCTCGGCC AATCCCAAGA CCGCCGTGGT TCTGGAAACG GGCGGTCCGG TGCTGATGCC CTGGCTCGAC AAGGTCGGCG CCGTGGTCGA GGCCTGGTAT CCAGGCACCC ATGGCGGCGA GGCCATCGCC CGGGTGCTGA CCGGCGAGGT CAACCCGTCC GGGCGCTTGC CGATCACCTT CCCGAAAAGC GTCGACCAGT TGCCGCGTCA GACGATCGAC TGCGACCCGG CCAAGCCCGA GGACTTCTGC GACGTCAACT ACGACATCGA GGGCGCGGCG GTCGGCTACA AGTGGTTCGA CCAGAAGGGC CACGCCCCGC TGTTCGCCTT CGGCCACGGC CTGTCCTACT CGACCTTCGC CTACAGCGGC CTGAAGACCG AGGTGGTCGG CGACACGCTG AGGGTCAGCT TCACCGTCAA GAACGCCGGA AAGGCTGCTG GCAAGGACGT GCCGCAGGTC TATGTCGGCC CGAAGGCTGG GGGTTGGGAA GCCCCGCGCC GCTTGGCCGG CTTCAAGAAG GTCGATCTGG TTCCGGGCGC GACCACCAAG GTCAGCGTCA CCGTCGATCC GCGCCTGCTG GCCACCTTCG ACTCCAAGGC CAAGACCTGG AACATCGCCG CCGGCGCCTA CGAGGTGTCG CTGGGCGCCT CGTCGCGCGA CCTGACGGCG AAGTCCGATG TGGCGATGGC GGCCAAGACA CTCCCGGTGT CGTACGACGG GAAGTAG
|
Protein sequence | MKRQQLRALA LALMLTTALP TAAAHAQAAS ASTAKPWMNT KLSADQRAEL VVAQMTQDEK LALVFGFFGS NQKTPQFTPS PEARMGSAGY IPGIPRLGVP PLWETDAGVG VATQRETSDP YRERTSLPSG LATAATWNPE LAYKGGAMIG SEARDSGFNV QLAGGVNLAR EPRNGRNFEY GGEDPLLAGT IVGAQIRGIQ SNKIISTIKH WALNGQETGR MTVSANIADD AARASDFLAF ELAIEQSDPG AVMCAYNRIN STYACESNYL LNEVLKTDWG YKGFVMSDWG GVHSTPKAAK AGLDQESAYT FDKQPFFGAP LKAAVADGSA PQARLDDMAK RITRSMFAHG LFDHPVAIKP IDFAAHAKIT QADAEEAIVL LKNDKGLLPL ARTARKIVVI GSHADVGVLS GGGSSQVFPI GGMAVKGLGP KGFPGPIVYH PSSPLKALQV RNPGATFAYD DGTDAAAAAK LAAGADLVIV FAHQWAAESQ DYSLTLADNQ DALIDAVASA NPKTAVVLET GGPVLMPWLD KVGAVVEAWY PGTHGGEAIA RVLTGEVNPS GRLPITFPKS VDQLPRQTID CDPAKPEDFC DVNYDIEGAA VGYKWFDQKG HAPLFAFGHG LSYSTFAYSG LKTEVVGDTL RVSFTVKNAG KAAGKDVPQV YVGPKAGGWE APRRLAGFKK VDLVPGATTK VSVTVDPRLL ATFDSKAKTW NIAAGAYEVS LGASSRDLTA KSDVAMAAKT LPVSYDGK
|
| |