Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_3309 |
Symbol | |
ID | 4075713 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008043 |
Strand | + |
Start bp | 317847 |
End bp | 319181 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 638004817 |
Product | Beta-glucosidase |
Protein accession | YP_611543 |
Protein GI | 99078285 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.581632 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.566627 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGATT TCACTTTTAC ACGCCGCGAC TTTCCAGGGG ATTTCCTGTT CGGCTGCGCG ACGTCTTCCT ATCAGATTGA AGGCCATCAA TATGGCGGCG CAGGGCCGAC CCATTGGGAT AGTTTTGCCG CAACACCCGG CAATGTGGTG CGTTCTGAGG ATGGCGCACG CGCCTGTGAC CACTACCACC GCTTTGAGGA GGACCTCGAT CTCGCCGCTG CAGCGGGGTT TGAGTGCTAT CGGTTCTCGA CCAGTTGGGC ACGCGTGCTG CCCGAGGGGC GTGGCACGCC CAATGCCGAG GGGCTCGATT TCTACGACCG GCTGACCGAC GCGATGCTCG AACGCGGCCT GAAGCCCTGC GCGACGCTCT ACCACTGGGA GCTGCCGCAG CCACTGGCCG ACATGGGCGG CTGGCGCAAT CGAGATGTAA GCAACTGGTT TGCCGAATTC ACCGAAGTCA TCATGTCCCG GATCGGCGAT CGGATGTATT CCGTGGCACC CATCAACGAA CCCTGGTGCG TTGGGTGGCT TTCGCATTTT CTGGGCCATC ACGCGCCCGG TCTGCGCGAC ATCCGCGCCA CCGCACGGGC CATGCACCAT GTCCTGTTGT CCCATGGCCG CGCCATCGAA GTGATGCGGG GCCTTGGAAT GAACAACCTT GGTGCGGTGT TCAATTTTGA ATGGGCGGAG CCGCTGGATC AGAGTGCCCA AGCTCAGGCC GCGGCAGAAA CCTATGATGC GATCTACAAC CGCTTTTTCC TCGGCGGGGT CTTTAAAGGC GCTTATCCCG AAGCAGCCTT GCGCGGGCTC GAACCCCATT TGCCGCAGGG CTGGCAGGAC GATTTCGCCA CCATCACCCA AAAGGTCGAT TGGTGCGGTT TGAACTATTA CACTCGCAAG GTGATCGGTC CAGACAATGG CCCCTGGCCC CATTACGCAG AGCTGGTGGG CGAACTGCCC ACGACCCAGA TGGGGTGGGA GATTTATCCA GATGGGCTTT ACAAGTTTCT GAAGCGCACA GCCGAAGACT ACACCGGAGG TCTGCCGCTC ATCGTGACCG AGAACGGCAT GGCAAACCCG GATGTGCTTC TGGAGGGCGA GGTGCCGGAC GCAGCCCGCA TCGCCTACGT TGAGGCACAC CTTGCGCGGG TACGCCAAGC GATTGCAGAG GGCGTCCCGG TGAAGGGTTA CTTTCTGTGG TCACTTCTCG ACAATTACGA ATGGGCGCTC GGGTACGAGA AACGCTTTGG TCTGGTGCAT GTCGACTTTG AGACGCTGAA GCGCACGCCG AAGGCCTCTT ATCGTGCCTT GCAACGTGCG CTTACCGCAT CCTGA
|
Protein sequence | MTDFTFTRRD FPGDFLFGCA TSSYQIEGHQ YGGAGPTHWD SFAATPGNVV RSEDGARACD HYHRFEEDLD LAAAAGFECY RFSTSWARVL PEGRGTPNAE GLDFYDRLTD AMLERGLKPC ATLYHWELPQ PLADMGGWRN RDVSNWFAEF TEVIMSRIGD RMYSVAPINE PWCVGWLSHF LGHHAPGLRD IRATARAMHH VLLSHGRAIE VMRGLGMNNL GAVFNFEWAE PLDQSAQAQA AAETYDAIYN RFFLGGVFKG AYPEAALRGL EPHLPQGWQD DFATITQKVD WCGLNYYTRK VIGPDNGPWP HYAELVGELP TTQMGWEIYP DGLYKFLKRT AEDYTGGLPL IVTENGMANP DVLLEGEVPD AARIAYVEAH LARVRQAIAE GVPVKGYFLW SLLDNYEWAL GYEKRFGLVH VDFETLKRTP KASYRALQRA LTAS
|
| |