Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BBta_4299 |
Symbol | |
ID | 5149396 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bradyrhizobium sp. BTAi1 |
Kingdom | Bacteria |
Replicon accession | NC_009485 |
Strand | - |
Start bp | 4503452 |
End bp | 4504552 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 640559118 |
Product | putative alkanesulfonate monooxygenase (FMNH2-dependent aliphatic sulfonate monooxygenase) |
Protein accession | YP_001240255 |
Protein GI | 148255670 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0448772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00193589 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACGGCCA AGGATTCGTC CATCAAGTTC GCCTATTGGG TGCCCAACGT TTCGGGCGGA TTGGTGATCT CCAACATCGA GCAGCGGACG AGCTGGGATA TCGGCTACAA CAAGCGGCTG GCACAGATCG CCGAGGAGAG CGGGTTCGAC TACGCGCTGA GCCAGATCCG TTTCACGGCT GGCTATGGCG CCGAATATCA ACATGAATCG GTGTCGTTCA GCCATGCGCT GCTCGAAGCG ACGACGAAGC TGAAGGTGAT CGCCGCGATC CTGCCCGGAC CGTGGCATCC GGCGGTCGTC GCCAAGCAGA TCGCCACCAT CAATCATCTC ACCCAGGGGC GCGTTGCGGT GAACATCGTC AGCGGCTGGT TCCGCGGCGA GTTCGCCGCG ATTGGCGAGC CCTGGCTGGA TCACGATGAG CGCTATCGCC GCTCGGAGGA GTTCATCCGT GCGCTCCGGG GCATCTGGAC CGAGGACAAC TTCAATCTCC GCGGCGACTT CTATCGTTTC ACGAACTACT CGCTGAAGCC GAAGCCGATC GACCCGCAGC CGGAGATCTT CCAGGGCGGC AGCTCGCGTG CCGCCCGCGA CATGGCCTCG CGCGTCTCTG ACTGGTACTT CACCAACGGC AACTCGGTGG AGGGCATCAA GGCCCAGGTC GACGATATCA GGGCCAAGGC GAAGGCCAAC AATCACCAGG TCAAGATCGG CGTCAACGCG TTCGCGATCG CGCGCGACAC CGAGGCCGAG GCGCAGGCCG TGCTCAAGGA AATCATCGAC AACGCCAATC CCGAGGCCGT CCACGCCTTC GCGCACGAGG TGGCCAATGC CGGCAAGGCG TCGCCGGAGC GCGAGGGCAA TTGGGCGAAG TCGACCTTCG AGGATCTCGT CCAATACAAT GACGGCTTCA AGACCAACCT GATCGGCACG CCGCGGCAGA TTGCCGAGCG CATCGTCGCA CTGAAGGAGG TCGGCGTTGA CCTCGTGCTG CTTGGCTTCC TGCACTTCCA GGAGGAGGTC GCCTATTTCG GCAAGCACGT GATCCCGCTG GTGCGCGAGC TCGAGGCCAA GGCCGGCCTT CGCGTGCAGG CTGCCGAGTA G
|
Protein sequence | MTAKDSSIKF AYWVPNVSGG LVISNIEQRT SWDIGYNKRL AQIAEESGFD YALSQIRFTA GYGAEYQHES VSFSHALLEA TTKLKVIAAI LPGPWHPAVV AKQIATINHL TQGRVAVNIV SGWFRGEFAA IGEPWLDHDE RYRRSEEFIR ALRGIWTEDN FNLRGDFYRF TNYSLKPKPI DPQPEIFQGG SSRAARDMAS RVSDWYFTNG NSVEGIKAQV DDIRAKAKAN NHQVKIGVNA FAIARDTEAE AQAVLKEIID NANPEAVHAF AHEVANAGKA SPEREGNWAK STFEDLVQYN DGFKTNLIGT PRQIAERIVA LKEVGVDLVL LGFLHFQEEV AYFGKHVIPL VRELEAKAGL RVQAAE
|
| |