Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_I2501 |
Symbol | |
ID | 3849421 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007651 |
Strand | - |
Start bp | 2856712 |
End bp | 2858142 |
Gene Length | 1431 bp |
Protein Length | 476 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637842170 |
Product | alkanesulfonate monooxygenase |
Protein accession | YP_443021 |
Protein GI | 83720974 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03565] alkanesulfonate monooxygenase, FMNH(2)-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGAGCAAC ATTATAGGGG CCCGTGCGCG CATATTTGCC GCGGCGGCAC GCTATGCACG AAAGCGGAAA ACCTAAGCTC GAAAAAATCG TTCCTATACG AAACCGCGCT CCCTAGACTG ATTTCCCGAG AGACGGCCGG GCCGCCCGCC GCGCGACAGC GCGAGCGGAT TGCGCGATCC GCCGCGGCGC GTATTTCCAC CGCTCATCCG CGAGCGACGC GCGCGCCGCG TCGCCGCCGT TCATTTTTTG CACGAACAGT CAGGCAGGAG CTTCGCATGA ATGTGTTCTG GTTCATCCCC ACGCACGGCG ACAGCCGCTA TCTCGGCACG GCCGAGGGCG CGCGCGCCGC GGATTACGAC TACTTCAGGC AGGTCGCGGT GGCGGCCGAT ACGCTCGGCT ACGACGGCGT GCTGCTGCCG ACGGGCCGTT CGTGCGAGGA TGCGTGGGTC GTCGCGTCGA GCCTGATTCC GGCGACGAAG CGCCTGAAAT TCCTCGTCGC GATCCGGCCC GGCCTGTCGT CGCCGGGGCT GTCCGCGCGG ATGGCGTCGA CGTTCGATCG GCTCTCCGGC GGGCGTCTGC TGATCAACGT CGTGACGGGC GGCGATTCGG CCGAGCTCGA AGGCGACGGT CTCTTTGCCG ATCACGACAC GCGCTACGCG CTCACCGACG ACTTCCTGCA CATCTGGCGC AAGCTGCTCG CCGAATCGCA CGAGAACGGC AGCGTCGATT TCGACGGCGA GCATCTGCGC GCGAAGGGCG GCAAGCTGCT GTATCCGCCC ATCCAGCATC CGCATCCGCC GCTGTGGTTC GGCGGCTCGT CGCCCGCCGC GCACGCGATC GCGGCCGATC ACATCGAGAC TTATCTGACC TGGGGCGAAC CGCCCGCGGC GGTCGCGAAG AAGATCGCCG ACATACGCGC GCGCGCGGCC GAGCGCGGCC GCGAGATCAG GTTCGGAATT CGCCTGCACG TGATCGTGCG CGAGACCGAG GAGGAGGCGT GGCGCGACGC GGATCGCCTC ATCAGCCGGC TCGACGACGA CACGATCGCG CGCGCGCAGC AGGCGTTTGC GAAGATGGAC TCCGAAGGGC AGCGCCGGAT GGCCGCGCTG CACGGCGGCA AGCGCGGCTC GCGCCAGGAG CTCGAGATCT ATCCGAACCT GTGGGCCGGC GTCGGGCTCG TGCGCGGCGG CGCGGGGACG GCGCTCGTCG GGAATCCCGA GCAAGTCGCC GCGCGCATGC GCGAGTATGC GGCGCTCGGT ATCGAGACGT TCATCCTGTC CGGCTATCCG CATCTCGAGG AATCGTACCG CTTCGCCGAG CTCGTGTTTC CGCTCGTCAA GGGCGGCGAC GCGCGGCGCG CGGGGCCGCT GTCGGGGCCG TTCGGCGAAG TCGTCGGCAA CGGGTATTTG CCGAAGGTGA GCCAGAGCTG A
|
Protein sequence | MEQHYRGPCA HICRGGTLCT KAENLSSKKS FLYETALPRL ISRETAGPPA ARQRERIARS AAARISTAHP RATRAPRRRR SFFARTVRQE LRMNVFWFIP THGDSRYLGT AEGARAADYD YFRQVAVAAD TLGYDGVLLP TGRSCEDAWV VASSLIPATK RLKFLVAIRP GLSSPGLSAR MASTFDRLSG GRLLINVVTG GDSAELEGDG LFADHDTRYA LTDDFLHIWR KLLAESHENG SVDFDGEHLR AKGGKLLYPP IQHPHPPLWF GGSSPAAHAI AADHIETYLT WGEPPAAVAK KIADIRARAA ERGREIRFGI RLHVIVRETE EEAWRDADRL ISRLDDDTIA RAQQAFAKMD SEGQRRMAAL HGGKRGSRQE LEIYPNLWAG VGLVRGGAGT ALVGNPEQVA ARMREYAALG IETFILSGYP HLEESYRFAE LVFPLVKGGD ARRAGPLSGP FGEVVGNGYL PKVSQS
|
| |