Gene BBta_3235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBBta_3235 
Symbol 
ID5153470 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBradyrhizobium sp. BTAi1 
KingdomBacteria 
Replicon accessionNC_009485 
Strand
Start bp3389448 
End bp3390788 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content59% 
IMG OID640558101 
Productputative toluate 1,2-dioxygenase alpha subunit 
Protein accessionYP_001239248 
Protein GI148254663 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.358123 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAAAAT ACGGGAATGA TGCCGGCGCC ATCATGGCGC TGGTCCGCAA GGCCGAGGTT 
CATCGCGACG TCTATGTCGA TCCCGAGATC TTCGATCTCG AGATGGAGCA CCTGTTTCCG
AACAGCTGGA TCTATATCGG GCATGCCAGC CAGCTTTCGA AACCAGGAGA TTTCATCACC
GCCAATGTCG GCCGCCAGCC GCTGATGGCC AGCCGCCATA GCGATGGCGC CATCCATGTC
TTCTACAACC GGTGTCCGCA TAAGGGCGTC AAGATCGCCT CGGAGCCGTG CGGCAACACT
GGAAAATTCT TCCGCTGTCC CTACCACGCC TGGTCGTTCA AGACCGACGG TTCGCTGCTC
GCCATCCCTT TGAAGAAGGG ATATGAGGGC ACCGGCTTTC CCGACACGCA AGCCAGCGAA
GGATTGGCGA AGGTCAAGAA CGTCGTGGTC TATCGCGACT TCATCTTTGT CCGTCTGAGC
GATGCAGGCC TGTCCTTCGA GGACTATTTC GGCGAAAGCC TCTCGACCAT CGACAACATG
GTTGATCGCT CTCCGGAAGG AAAGCTCGCG GTCTTGGCCG CGCCGATCCG CTACATGCAT
AACTGCAATT GGAAGATGCT GGTCGAGAAC CAGACCGATA CGTGTCATCC GATGGTTGCG
CATGAGAGTT CAGCCGGTAC CGCGGTCAAG GTCTGGAAGC GCGAGCAGGG CGACTCGACG
GAGACGCCGA TGGCGGTGCA GCTCTATGCT CCGTTCATGA GCCCTTACGA GTTCTACGAG
CAGAGCGGCA TCAGGATCTG GCCGAACGGC CACGGCCATA CCGGCGTCGC GAATTCCATC
CACTCGAACT ACTCGGATGT CGAGGGTTAT GTCGAGCAGA TGGTCGCGAC CTATGGCGAC
AAGCGCGCCC ATGAGATTCT GGGCGAAGTC AGGCACAACA CGATCTACTT CCCGAACATC
ATGGTGAAGG GGGCCGTGCA GATCTTGCGC AACTTCATCC CGATCGCGGT CGACAAGACC
CTGGTCGAGA GCTGGGTGTA CCGTCTTGTC GGCGCTCCCG ACAAGCTCTA CGAGCGCGCG
CTGATGTACA ACCGCTTCAT CAATGCACCG ACCTCGATCG TCGGGCATGA TGATCTCGAA
ATGTATGAGC GGGCTCAGGA GGGCCTGACA TCGAATGGCA ACCAGTGGGT CAATCTGCAG
CGGCTGCACG AGCCCGGCGA AGCGGATGAC GTCACCGCCG TCATCAATGG CACCTCGGAG
CGCCAGATGC GCAACCAGTT TCATGCCTGG GCCAAGTTTA TGACCATGAA CATGGACAAG
CGCGTCGAGG CCGCCGAATG A
 
Protein sequence
MGKYGNDAGA IMALVRKAEV HRDVYVDPEI FDLEMEHLFP NSWIYIGHAS QLSKPGDFIT 
ANVGRQPLMA SRHSDGAIHV FYNRCPHKGV KIASEPCGNT GKFFRCPYHA WSFKTDGSLL
AIPLKKGYEG TGFPDTQASE GLAKVKNVVV YRDFIFVRLS DAGLSFEDYF GESLSTIDNM
VDRSPEGKLA VLAAPIRYMH NCNWKMLVEN QTDTCHPMVA HESSAGTAVK VWKREQGDST
ETPMAVQLYA PFMSPYEFYE QSGIRIWPNG HGHTGVANSI HSNYSDVEGY VEQMVATYGD
KRAHEILGEV RHNTIYFPNI MVKGAVQILR NFIPIAVDKT LVESWVYRLV GAPDKLYERA
LMYNRFINAP TSIVGHDDLE MYERAQEGLT SNGNQWVNLQ RLHEPGEADD VTAVINGTSE
RQMRNQFHAW AKFMTMNMDK RVEAAE