Gene BBta_6640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_6640 
Symbol 
ID5150079 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp6907656 
End bp6909344 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content66% 
IMG OID640561332 
Productbenzoyl-CoA-dihydrodiol lyase 
Protein accessionYP_001242446 
Protein GI148257861 
COG category[I] Lipid transport and metabolism 
COG ID[COG1024] Enoyl-CoA hydratase/carnithine racemase 
TIGRFAM ID[TIGR03222] benzoyl-CoA-dihydrodiol lyase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.0742487 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGGCG CACCGCGGAA GCTCGCAAAC GGAAAAGCGT TCATCGATTT CCAGACGGAA 
CCGTCGCGCT ACCGCCACTG GAAGCTGTCG GTCCAAGGCG AGACCGCGAC GCTCGCGATG
GATGTCGACG AGAATGCCGG GCTGTTCGAG GGCTATCAGC TCAAGCTGAA CTCCTACGAT
CTCGGCGTCG ACATCGAGCT TGCCGACGCG TTGATGCGGC TCCGCTTCGA GCATCCCAAC
GTCAAGGTCG TGCTGCTGCG CTCGGCCAAA GACCGCGTGT TCTGCGCGGG CGCCAACATC
CGCATGCTGG CCGGTTCGGA ACATCCGCAC AAGGTCAACT TCTGCAAGTT CACCAATGAG
ACGCGCAACG CGATGGAGGA TGCGAGCCAA TATTCCGGCC AGCGCTTCGT CACGGTGGTG
AACGGCACGG CCGCCGGCGG CGGCTACGAG CTGGCGCTGG CGACCGACCA CATCATGATG
GTCGATGATG GCTCCACCGC GGTGTCGCTG CCCGAAGTGC CGTTGCTCGC CGTGCTGCCG
GGCACCGGAG GCCTGACGCG CGTTGTCGAC AAGCGGATGG TGCGCCGCGA CCACGCCGAC
GTGTTCTGCA CCATCGAAGA GGGGATCAAG GGGCGCCGCG CCGTGCAATG GCGCCTGATC
GACGAACTGG TGCCTCCGTC GAAGCTCGAA GAGCGTGTCA AGGCGACGGC GTCCGATCTG
GCGACCAAGT CGCCGCGCAG CGGTCCCGCC GAGGGCGTGA AGCTCACGCA GCTCGATCGC
AAATTCCGCG CCGACGGCAT CGACTACGGC TTCGTCCGGC TCGACATCGA GGCGGCGCAG
CGGCTCGCCA CCATCACGGT GCGTGTGGCC GAGGCGGCGT CAGCCAAGTC CGCCGCGGAG
ATCGTGGCGC TCGGAGCTGC GTTCTGGCCG TTGCAATGCG CCCGCGAGCT CGACGACGCC
ATTCTTCACT TGCGCAACAA TGCCTATGAG ATCGGAACGG TCATCTTCAA GACCGAGGGC
GATCCCGAGG TCGCCCGCGC CTATGATGAT CTGCTCGAAG CGCATCGCGA CAATTGGTTC
GTGCACGAGA TCCGCCATTA CTGGAAGCGC GTCCTGAAGC GCATCGATGT GACCTCGCGG
ACCCTGGTGG CGCTGATCGA GCCCGGCTCC TGCTTCGTCG GCACGCTCGC CGAGCTCGCT
TTCGCCGCTG ACCGCAGCTA CATGCTGCTC GGCCAGTTCC AGGGCGACAA TCGCCCGGCA
GCGACGATCG AGCTGACGCG CGCCAATTTC GGCGCCTATC CGATGTCGCA TGGCCTCACC
CGCTTGCAGT CGCGCTTCCT GGCCACGCCC GACAAGCCGG CGCAGCTCGA AGCCAAAATC
GCAACGGCGC TGGACGCCGA CGAGGCCGAG GCGCTGGGGC TCGTCACCTT CGCATTGGAC
GACATCGACT GGGCGGACGA GGTGCGCGTG TTCCTCGAAG AGCGCGCGAG CTTCTCGCCC
GATGCGATGA CCGGTCTCGA AGCCAATCTG CGCTTCGCCG GGCCCGAGAC GATGGAGTCC
AAGATCTTCG CGCGGCTCAC CGCCTGGCAG AACTGGATCT TCCAGCGGCC TAATGCGGTC
GGCGAGGAGG GCGCGCTGGC GCGCTACGGC ACCGGGCAAC GCCCGACCTA CGATCTGAGG
CGCGTCTGA
 
Protein sequence
MAGAPRKLAN GKAFIDFQTE PSRYRHWKLS VQGETATLAM DVDENAGLFE GYQLKLNSYD 
LGVDIELADA LMRLRFEHPN VKVVLLRSAK DRVFCAGANI RMLAGSEHPH KVNFCKFTNE
TRNAMEDASQ YSGQRFVTVV NGTAAGGGYE LALATDHIMM VDDGSTAVSL PEVPLLAVLP
GTGGLTRVVD KRMVRRDHAD VFCTIEEGIK GRRAVQWRLI DELVPPSKLE ERVKATASDL
ATKSPRSGPA EGVKLTQLDR KFRADGIDYG FVRLDIEAAQ RLATITVRVA EAASAKSAAE
IVALGAAFWP LQCARELDDA ILHLRNNAYE IGTVIFKTEG DPEVARAYDD LLEAHRDNWF
VHEIRHYWKR VLKRIDVTSR TLVALIEPGS CFVGTLAELA FAADRSYMLL GQFQGDNRPA
ATIELTRANF GAYPMSHGLT RLQSRFLATP DKPAQLEAKI ATALDADEAE ALGLVTFALD
DIDWADEVRV FLEERASFSP DAMTGLEANL RFAGPETMES KIFARLTAWQ NWIFQRPNAV
GEEGALARYG TGQRPTYDLR RV