Gene BBta_4086 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_4086 
Symbol 
ID5155436 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp4282696 
End bp4284156 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content64% 
IMG OID640558919 
Productbeta-galactosidase 
Protein accessionYP_001240058 
Protein GI148255473 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2723] Beta-glucosidase/6-phospho-beta-glucosidase/beta-galactosidase 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence
[TIGR03356] beta-galactosidase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0681352 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACG ACGTCTCCCG CCGTGATCTT GCGAAGCTGG CCGGACTGGC TGCCATGGGG 
GCGGCGGCCG GACCCGCGCA CGCGGAGGAG GCTGCGGTGA CGGATGATGC CGGGCGTCGT
TTTCCCTCTG ATTTCGTTTG GGGGACGGCC ACCTCGTCCT ACCAGATCGA AGGTGGCGCC
ACCGCCGATG GCCGCGGACC ATCGATCTGG GATGTCTTCA CCCACACCCC CGGCAAGATC
GAGGACGGCA GCACCGGCGA CGTCGCTTGC GATCATTACG ATCGCTACAA GCACGATGTG
CGGCTGATCA AGGAGCTTGG CTGCCGCGCC TATCGCTTCT CGATTGCGTG GCCGCGGCTG
TTTCCCGATG GCGGGTTGAC CCCCAATCCG AAGGGGCTCG ACTTTTACAG CCGCCTCGTC
GACGAGCTCC TGGCGAACGG CATCGAGCCC TATGCGACAT TGTATCATTG GGATCTGCCG
CAGGCGCTGC AGGACCGCGT CGGCGGCTGG CGCTCGGCGG AGACCGCAGC GGCGTTCGCG
CATTATGCCG GATATGTGGC CCAGACTCTG AGCGACCGGG TCAAGACCAT CTTCACGATC
AACGAATGCG GCCGGTTCAT TCCGTTCGGC TATGGTCTCG GCATCGATGC GCCCGGGCTG
AAACTGCCGC AGCAGGAGGT CAACCAGGCG CGCCATCACG TGGCGCTGGC GCATGGCCTC
GCGGTGCAGG CGATCCGTGC CAAGGGGAGG GCAGGTACGC GCGTCGGCAT GGCCGAGAAC
ATCACCGCCT GCCTGCCCGC GATCGATACG CCAGAGAACA TCCGCGCCGC CGAGATCGCC
ACGCGCGAGA TGAATGCGGG CTTCCTCAAC GTGATCCTCG AGGGCCGCTA CACCGACGCG
TTCCTGGCCT GGTCGGGCAA GGATGCGCCG ACATTCACCG CGGACGAACT CAAGACGATC
TCCACGCCGG TCGATTTCGT CGGCCTCAAC ATCTACGCGC CGCAGGCCTA TGTCGTGGCG
TCTGAGCGCG CGCCGGGGTT CGACGTGTTG CCGATGCCGT CCTCGTTCCC GCATATGAGC
TCGCCCTGGC TGCTGGTCGG ACCCGAGACC GCTTATTGGG TGCCGAAGCT CGCGGCCAAG
ATCTGGAACC TCAAGACCAT CTACATTACC GAAAACGGCA CCTCGTCGGA TGACAAGGTG
ACGGCGGACG GCAAGGTTCA TGACCTCGAT CGCGTGATGT ATCTGCGCAA CTATCTCGCG
CAGCTGCAGC GCGCAACCTC CGAAGGCGTG CCGGTGAAGG GCTATTTCCT CTGGAGCCTG
ATGGACAATT TCGAATGGGT GTTCGGCTAT AAGCAGCGCT TCGGTGTTTA TCATGTCGAT
TTCGACACCC AGCTGCGTAC ACCCAAGCTC AGCGCGTCCT ATTATCGTCA CGTCATCACG
CGCAATGCCG TGAGTGCGTG A
 
Protein sequence
MPNDVSRRDL AKLAGLAAMG AAAGPAHAEE AAVTDDAGRR FPSDFVWGTA TSSYQIEGGA 
TADGRGPSIW DVFTHTPGKI EDGSTGDVAC DHYDRYKHDV RLIKELGCRA YRFSIAWPRL
FPDGGLTPNP KGLDFYSRLV DELLANGIEP YATLYHWDLP QALQDRVGGW RSAETAAAFA
HYAGYVAQTL SDRVKTIFTI NECGRFIPFG YGLGIDAPGL KLPQQEVNQA RHHVALAHGL
AVQAIRAKGR AGTRVGMAEN ITACLPAIDT PENIRAAEIA TREMNAGFLN VILEGRYTDA
FLAWSGKDAP TFTADELKTI STPVDFVGLN IYAPQAYVVA SERAPGFDVL PMPSSFPHMS
SPWLLVGPET AYWVPKLAAK IWNLKTIYIT ENGTSSDDKV TADGKVHDLD RVMYLRNYLA
QLQRATSEGV PVKGYFLWSL MDNFEWVFGY KQRFGVYHVD FDTQLRTPKL SASYYRHVIT
RNAVSA