Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4401 |
Symbol | |
ID | 5153106 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | + |
Start bp | 4614325 |
End bp | 4615215 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640559209 |
Product | TauD/TfdA family dioxygenase |
Protein accession | YP_001240346 |
Protein GI | 148255761 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.878457 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0785293 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGTCG AGACCCGCAT GCCCCCGGCC GCTCGGTCAG CCAATATCGA GATTGTTCCG ACGGACCGGT CGTTGGGAGC CGAGATCCGC AATGTCGATC TCCGGCAACT GGACGACGCG GCCTTCGCCG CCGTGCTTCG CGCCTTCCAC ACTCATTCCG TACTGCTGGT CCGCGGACAA CACCTGTCTG ACCAGGATCT GATCGCCTTC AGCCGCCGCT TCGGCGATCT CGACTGGGCG CCAGTGCAGG AGAATGGCCG GCGCTTCGTC GAGGGCCTCC CGGAAATCTA TATCGTCTCG AATGTGAAGG TGAACGGCGA GGCGATCGGC AGCCTCGGCG CAGGCGAGGC CGTGTGGCAC ACTGACATGT CCTATCTCGA GACGCCGCCG ATCGCGAGCG CGCTCTATGC GCTGGAGATT CCTCCCGTCG GCGGCAACAC CTCGTTCTGC AGCATGTACG CGGTCTACGA CGCGCTGCCG ACCGAACTGA AGCATCGCAT CGCGGATCTC AAGATCAAGC ACGACGGCAC CTACAACAGC GGCGGCTTCG TGCGGCAGGG CGTGACGCCG ACGGATGATC CGCGGAGCTC GCCGGGCGCT GTGCATCCGC TGGTCTGTAC GCATCCGGAT TCCGGCCGGC AGATGCTGTA TCTCGGCCGC CGGCGCAACG CCTATCTGGT CGGCCTGGAG CTCGCCGAGT CGGAAGCGCT GCTTGATGAA TTATGGACCT ATGTCGCGCG CCCGGAGTTC GCCTGGGAGC ACGTCTGGCA GGTTGGCGAT CTCGTCATCT GGGACAACCG CTCCACGATG CATCGACGCG ATCCGTTCGA CGATCAGGCG CGGCGAATCA TGCACCGAAC CCAGATCAAG GGAACAGAGC GCCCGCAGTG A
|
Protein sequence | MNVETRMPPA ARSANIEIVP TDRSLGAEIR NVDLRQLDDA AFAAVLRAFH THSVLLVRGQ HLSDQDLIAF SRRFGDLDWA PVQENGRRFV EGLPEIYIVS NVKVNGEAIG SLGAGEAVWH TDMSYLETPP IASALYALEI PPVGGNTSFC SMYAVYDALP TELKHRIADL KIKHDGTYNS GGFVRQGVTP TDDPRSSPGA VHPLVCTHPD SGRQMLYLGR RRNAYLVGLE LAESEALLDE LWTYVARPEF AWEHVWQVGD LVIWDNRSTM HRRDPFDDQA RRIMHRTQIK GTERPQ
|
| |