Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4086 |
Symbol | |
ID | 5155436 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 4282696 |
End bp | 4284156 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640558919 |
Product | beta-galactosidase |
Protein accession | YP_001240058 |
Protein GI | 148255473 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR03356] beta-galactosidase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0681352 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAACG ACGTCTCCCG CCGTGATCTT GCGAAGCTGG CCGGACTGGC TGCCATGGGG GCGGCGGCCG GACCCGCGCA CGCGGAGGAG GCTGCGGTGA CGGATGATGC CGGGCGTCGT TTTCCCTCTG ATTTCGTTTG GGGGACGGCC ACCTCGTCCT ACCAGATCGA AGGTGGCGCC ACCGCCGATG GCCGCGGACC ATCGATCTGG GATGTCTTCA CCCACACCCC CGGCAAGATC GAGGACGGCA GCACCGGCGA CGTCGCTTGC GATCATTACG ATCGCTACAA GCACGATGTG CGGCTGATCA AGGAGCTTGG CTGCCGCGCC TATCGCTTCT CGATTGCGTG GCCGCGGCTG TTTCCCGATG GCGGGTTGAC CCCCAATCCG AAGGGGCTCG ACTTTTACAG CCGCCTCGTC GACGAGCTCC TGGCGAACGG CATCGAGCCC TATGCGACAT TGTATCATTG GGATCTGCCG CAGGCGCTGC AGGACCGCGT CGGCGGCTGG CGCTCGGCGG AGACCGCAGC GGCGTTCGCG CATTATGCCG GATATGTGGC CCAGACTCTG AGCGACCGGG TCAAGACCAT CTTCACGATC AACGAATGCG GCCGGTTCAT TCCGTTCGGC TATGGTCTCG GCATCGATGC GCCCGGGCTG AAACTGCCGC AGCAGGAGGT CAACCAGGCG CGCCATCACG TGGCGCTGGC GCATGGCCTC GCGGTGCAGG CGATCCGTGC CAAGGGGAGG GCAGGTACGC GCGTCGGCAT GGCCGAGAAC ATCACCGCCT GCCTGCCCGC GATCGATACG CCAGAGAACA TCCGCGCCGC CGAGATCGCC ACGCGCGAGA TGAATGCGGG CTTCCTCAAC GTGATCCTCG AGGGCCGCTA CACCGACGCG TTCCTGGCCT GGTCGGGCAA GGATGCGCCG ACATTCACCG CGGACGAACT CAAGACGATC TCCACGCCGG TCGATTTCGT CGGCCTCAAC ATCTACGCGC CGCAGGCCTA TGTCGTGGCG TCTGAGCGCG CGCCGGGGTT CGACGTGTTG CCGATGCCGT CCTCGTTCCC GCATATGAGC TCGCCCTGGC TGCTGGTCGG ACCCGAGACC GCTTATTGGG TGCCGAAGCT CGCGGCCAAG ATCTGGAACC TCAAGACCAT CTACATTACC GAAAACGGCA CCTCGTCGGA TGACAAGGTG ACGGCGGACG GCAAGGTTCA TGACCTCGAT CGCGTGATGT ATCTGCGCAA CTATCTCGCG CAGCTGCAGC GCGCAACCTC CGAAGGCGTG CCGGTGAAGG GCTATTTCCT CTGGAGCCTG ATGGACAATT TCGAATGGGT GTTCGGCTAT AAGCAGCGCT TCGGTGTTTA TCATGTCGAT TTCGACACCC AGCTGCGTAC ACCCAAGCTC AGCGCGTCCT ATTATCGTCA CGTCATCACG CGCAATGCCG TGAGTGCGTG A
|
Protein sequence | MPNDVSRRDL AKLAGLAAMG AAAGPAHAEE AAVTDDAGRR FPSDFVWGTA TSSYQIEGGA TADGRGPSIW DVFTHTPGKI EDGSTGDVAC DHYDRYKHDV RLIKELGCRA YRFSIAWPRL FPDGGLTPNP KGLDFYSRLV DELLANGIEP YATLYHWDLP QALQDRVGGW RSAETAAAFA HYAGYVAQTL SDRVKTIFTI NECGRFIPFG YGLGIDAPGL KLPQQEVNQA RHHVALAHGL AVQAIRAKGR AGTRVGMAEN ITACLPAIDT PENIRAAEIA TREMNAGFLN VILEGRYTDA FLAWSGKDAP TFTADELKTI STPVDFVGLN IYAPQAYVVA SERAPGFDVL PMPSSFPHMS SPWLLVGPET AYWVPKLAAK IWNLKTIYIT ENGTSSDDKV TADGKVHDLD RVMYLRNYLA QLQRATSEGV PVKGYFLWSL MDNFEWVFGY KQRFGVYHVD FDTQLRTPKL SASYYRHVIT RNAVSA
|
| |