Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2140 |
Symbol | |
ID | 5902586 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2313953 |
End bp | 2316199 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562630 |
Product | Beta-glucosidase |
Protein accession | YP_001683766 |
Protein GI | 167646103 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1472] Beta-glucosidase-related glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.0303148 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0291608 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACCAAC AGACCTGGCG GGGCGTGACC CTGGCCCTGA TGCTCGGCGC ATCCTCCTGC GCCCTGGCGC CCGCCGCCTG CGCCCAGGCG CCCGCGCCCG CCGCCCGTCC GTGGCTTGAT CCCAAGCTGG GCGCCGACAC GCGCGCCGAC CTGGCGCTCA AGGCCATGAC CCAGGACGAG AAGCTGACGA TCATCTTCGG CTATTTCGGC GCCGACATGG CCCCCAAGTA TAAGCGCGTG GCCGACGCCC TGCCCGGCTC GGCCGGCTAT GTGCCGGGGA TCGCGCGCCT GGGCATTCCC GCCCAGTTCC AGACCGACGC CGGCGTCGGC GTGGCCACCC AGGGGGGCGA GCCCAACAAG CGCGAACGCA CCGCCCTGCC CTCCGGCATG GCCACCGCCG CGACCTGGAA TCCGAAACTG GCCCAGGCCG GCGGCGCGAT GATCGGCGCC GAGGCCCGCT CGTCCGGCTT CAACGTCATG CTGGCCGGCG GCGTGAACCT GGTCCGTGAA CCGCGCAACG GCCGCAACTT CGAATATGGC GGCGAGGATC CGTGGCTGGC CGGCCAGATG GTCGGGGCCC AGATCAAGGG CATCCAGTCC AACGCCATCA TCTCGACCAT CAAGCACTAC GCCCTGAACG GCCAGGAGAC CGGCCGCTTC GTGCTGGACG CCAAGATCGG CGAAGGCGAG GCCCGCACCT CCGACCTGCT GGCCTTCCAG TTCGCCACCG AGATCGCGGA CCCCCACTCG GTGATGTGCG CCTACAACAA GGTCAATGGC GACTACGCCT GCGAGAACGA TTTCCTGCTC AACAAGGTTC TCAAGCAGGA CTGGGCCTAC AAGGGCTATG TGATGTCGGA TTGGGGCGCG CACCATTCCA GCGCCAAGGC CGCCAATGCG GGCCTGGACC AGGAATCGGC CGGCGACGCC TTCGACAAGC AGCCCTTCTT CAAGGGTCCG CTGAAGGACG CCCTGGCCAA GGGCGAGGTG TCCCAGGCCC GGCTCGACGA CATGGCCCGC CGCATTCTGC GCAGCCTGTT CGCCAGCGGC GTGGTCGAAA AGCCGGTGAA GATCGAGACC ATCGACTACG CCGCCCACGC CAAGGTCACG CAGGCCGACG CCGAGGAAGG CATTGTCCTG CTGAAGAATG ACAAGGGCCT GCTGCCGCTC GTCGCCAGCG CCAAGAAGAT CGTCGTGATC GGCGGCCACG CCGATGTCGG CGTGCTGTCG GGCGGCGGCT CCTCGCAGGT CTATCCGATC GGCGGCCGCG CGGTGCAGGG CGAAGGTCCC GCCACTTGGC CGGGTCCGAT GATCTACTTC CCCTCCTCGC CCCTGAAGGC GCTGAAGGCC CGCCTGCCGG GCGCCGACAT CCAGTACATC AACGGCAAGG ACAAGGCCGC CGCCGCCAAG CTGGCGACCG GCGCCGACGT GGTGCTGGTG TTCGCCACCC AATGGAACGG CGAGTCGTTC GACAGCCCGC TGACTCTGGA AAACGACCAG GACGCCCTGA TCGACGCCGT CGCCTCAGCC AACGCCAAGA CCGTGGTGGT GCTGGAGACC GGCGGCCCGG TGCTGATGCC CTGGCTCGAC AAGGTGGGCG GCGTGGTCGA GGCCTGGTAT CCCGGTTCGG AAGGCGGCGA GGCCATCGCC CGCGTGCTCA CCGGCGAGGT CGACGCCTCG GGCCGCCTGC CCGTCACCTT CCCCGCCGCC CTGGCCCAGT TGCCGCGTCC GGTGCTGGAC GGCGACCCCA AGAAGCCCGA CGACAGCTTC CCGGTCGACT ATACGATCGA GGGCGCGACG GTCGGCTACA AGTGGTTCGA CAAGAAGGGC CAGCAACCGC TGTTCGCGTT CGGCCACGGC CTGTCCTACA CCAGCTTCGC CTACGCCAAC CTCAAGGCCG AGGCCCGGAA CGGCGCCCTG ACCGTCAGCT TCGACGTCAG GAACACCGGC CGGCGAACCG GCAAGGCCGT GCCGCAGGTC TATGTCTCGC CCAAGGCCGG GGGATGGGAA GCGCCTCAGC GTCTGGCCGC CTTCAGCAAG GTCGAGCTGG CGCCCGGCGC GACCCAGAAC GTCACCCTGA CCATCGATCC GCGCCTGCTG GCCGCCTGGG ACGACAAGGC CCACGGCTGG TCGATCGCGG CTGGCGACTA CACCGTCACC CTGGGCGCTT CGTCACGCGA CACCGCCGCC AAGGCGGACG TCGCGGTGGC GGCCCGGACC GTTCCTGTCG GCCTGATGAA GCCCTGA
|
Protein sequence | MNQQTWRGVT LALMLGASSC ALAPAACAQA PAPAARPWLD PKLGADTRAD LALKAMTQDE KLTIIFGYFG ADMAPKYKRV ADALPGSAGY VPGIARLGIP AQFQTDAGVG VATQGGEPNK RERTALPSGM ATAATWNPKL AQAGGAMIGA EARSSGFNVM LAGGVNLVRE PRNGRNFEYG GEDPWLAGQM VGAQIKGIQS NAIISTIKHY ALNGQETGRF VLDAKIGEGE ARTSDLLAFQ FATEIADPHS VMCAYNKVNG DYACENDFLL NKVLKQDWAY KGYVMSDWGA HHSSAKAANA GLDQESAGDA FDKQPFFKGP LKDALAKGEV SQARLDDMAR RILRSLFASG VVEKPVKIET IDYAAHAKVT QADAEEGIVL LKNDKGLLPL VASAKKIVVI GGHADVGVLS GGGSSQVYPI GGRAVQGEGP ATWPGPMIYF PSSPLKALKA RLPGADIQYI NGKDKAAAAK LATGADVVLV FATQWNGESF DSPLTLENDQ DALIDAVASA NAKTVVVLET GGPVLMPWLD KVGGVVEAWY PGSEGGEAIA RVLTGEVDAS GRLPVTFPAA LAQLPRPVLD GDPKKPDDSF PVDYTIEGAT VGYKWFDKKG QQPLFAFGHG LSYTSFAYAN LKAEARNGAL TVSFDVRNTG RRTGKAVPQV YVSPKAGGWE APQRLAAFSK VELAPGATQN VTLTIDPRLL AAWDDKAHGW SIAAGDYTVT LGASSRDTAA KADVAVAART VPVGLMKP
|
| |