Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_3235 |
Symbol | |
ID | 5153470 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 3389448 |
End bp | 3390788 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 640558101 |
Product | putative toluate 1,2-dioxygenase alpha subunit |
Protein accession | YP_001239248 |
Protein GI | 148254663 |
COG category | [P] Inorganic ion transport and metabolism [R] General function prediction only |
COG ID | [COG4638] Phenylpropionate dioxygenase and related ring-hydroxylating dioxygenases, large terminal subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.358123 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAAAAT ACGGGAATGA TGCCGGCGCC ATCATGGCGC TGGTCCGCAA GGCCGAGGTT CATCGCGACG TCTATGTCGA TCCCGAGATC TTCGATCTCG AGATGGAGCA CCTGTTTCCG AACAGCTGGA TCTATATCGG GCATGCCAGC CAGCTTTCGA AACCAGGAGA TTTCATCACC GCCAATGTCG GCCGCCAGCC GCTGATGGCC AGCCGCCATA GCGATGGCGC CATCCATGTC TTCTACAACC GGTGTCCGCA TAAGGGCGTC AAGATCGCCT CGGAGCCGTG CGGCAACACT GGAAAATTCT TCCGCTGTCC CTACCACGCC TGGTCGTTCA AGACCGACGG TTCGCTGCTC GCCATCCCTT TGAAGAAGGG ATATGAGGGC ACCGGCTTTC CCGACACGCA AGCCAGCGAA GGATTGGCGA AGGTCAAGAA CGTCGTGGTC TATCGCGACT TCATCTTTGT CCGTCTGAGC GATGCAGGCC TGTCCTTCGA GGACTATTTC GGCGAAAGCC TCTCGACCAT CGACAACATG GTTGATCGCT CTCCGGAAGG AAAGCTCGCG GTCTTGGCCG CGCCGATCCG CTACATGCAT AACTGCAATT GGAAGATGCT GGTCGAGAAC CAGACCGATA CGTGTCATCC GATGGTTGCG CATGAGAGTT CAGCCGGTAC CGCGGTCAAG GTCTGGAAGC GCGAGCAGGG CGACTCGACG GAGACGCCGA TGGCGGTGCA GCTCTATGCT CCGTTCATGA GCCCTTACGA GTTCTACGAG CAGAGCGGCA TCAGGATCTG GCCGAACGGC CACGGCCATA CCGGCGTCGC GAATTCCATC CACTCGAACT ACTCGGATGT CGAGGGTTAT GTCGAGCAGA TGGTCGCGAC CTATGGCGAC AAGCGCGCCC ATGAGATTCT GGGCGAAGTC AGGCACAACA CGATCTACTT CCCGAACATC ATGGTGAAGG GGGCCGTGCA GATCTTGCGC AACTTCATCC CGATCGCGGT CGACAAGACC CTGGTCGAGA GCTGGGTGTA CCGTCTTGTC GGCGCTCCCG ACAAGCTCTA CGAGCGCGCG CTGATGTACA ACCGCTTCAT CAATGCACCG ACCTCGATCG TCGGGCATGA TGATCTCGAA ATGTATGAGC GGGCTCAGGA GGGCCTGACA TCGAATGGCA ACCAGTGGGT CAATCTGCAG CGGCTGCACG AGCCCGGCGA AGCGGATGAC GTCACCGCCG TCATCAATGG CACCTCGGAG CGCCAGATGC GCAACCAGTT TCATGCCTGG GCCAAGTTTA TGACCATGAA CATGGACAAG CGCGTCGAGG CCGCCGAATG A
|
Protein sequence | MGKYGNDAGA IMALVRKAEV HRDVYVDPEI FDLEMEHLFP NSWIYIGHAS QLSKPGDFIT ANVGRQPLMA SRHSDGAIHV FYNRCPHKGV KIASEPCGNT GKFFRCPYHA WSFKTDGSLL AIPLKKGYEG TGFPDTQASE GLAKVKNVVV YRDFIFVRLS DAGLSFEDYF GESLSTIDNM VDRSPEGKLA VLAAPIRYMH NCNWKMLVEN QTDTCHPMVA HESSAGTAVK VWKREQGDST ETPMAVQLYA PFMSPYEFYE QSGIRIWPNG HGHTGVANSI HSNYSDVEGY VEQMVATYGD KRAHEILGEV RHNTIYFPNI MVKGAVQILR NFIPIAVDKT LVESWVYRLV GAPDKLYERA LMYNRFINAP TSIVGHDDLE MYERAQEGLT SNGNQWVNLQ RLHEPGEADD VTAVINGTSE RQMRNQFHAW AKFMTMNMDK RVEAAE
|
| |