Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BMA10247_A1428 |
Symbol | betA |
ID | 4891088 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia mallei NCTC 10247 |
Kingdom | Bacteria |
Replicon accession | NC_009079 |
Strand | + |
Start bp | 1389993 |
End bp | 1391690 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640147695 |
Product | choline dehydrogenase |
Protein accession | YP_001078613 |
Protein GI | 126446210 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.783226 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACAC GCGAATTCGA CTACATCATC TGCGGCGCCG GCTCCGCGGG CAACGTGCTC GCGACGCGCC TCACCGAGGA TCCGGGCGTC ACCGTGCTGC TCCTCGAGGC GGGCGGCCCC GACTACCGTT TCGACTTCCG CACGCAGATG CCGGCGGCGC TCGCCTACCC GCTGCAAGGC CGCCGCTACA ACTGGGCGTA CGAAACCGAT CCCGAGCCGC ACATGAACCA CCGCCGGATG GAATGCGGGC GCGGCAAGGG GCTCGGCGGC TCGTCGCTGA TCAATGGGAT GTGCTACATC CGCGGCAATG CGCTCGATTA CGACAACTGG GCGACGCACA AGGGGCTCGA GGACTGGGCG TATCTCGATT GCCTGCCGTA CTTCAGGAAG GCCGAGACGC GCGACGTCGG CCCGAACGAC TATCACGGCG GTGACGGCCC GGTCTCCGTG ACGACGAGCA AGCCCGGCGT GAATCCGCTG TTCGAGGCGA TGGTCGAGGC CGGCGTGCAG GCCGGCTATC CGCGCACCGA CGATCTCAAC GGCTATCAGC AGGAAGGCTT CGGCCCGATG GACCGCACGG TCACGCCGCG CGGCCGCCGC GCCTCGACCG CGCGCGGCTA TCTCGACCAG GCGCGCGCGC GGCCGAATCT CGAAATCGTC ACGCACGCGC TTGCCGATCG CATCCTGTTC TCCGGCAAGC GCGCAACGGG CGTCACGTTC CTGCACGGCA GCGCGCGCGT CACCGCGCAC GCGCGCCGCG AAGTGCTCGT GTGCAGCGGC GCGATCGCAT CGCCGCAACT GCTGCAGCGC TCGGGCGTCG GCCCCGGCGA ATGGCTGCGC GAGCTCGACA TTCCGGTCGT GCTCGACCTG CCCGGCGTCG GCCGCAATCT GCAGGATCAC CTGGAGATGT ACATCCAGTT CGAATGCAAG GAGCCGGTAT CGCTATATCC GGCGCTCAAG TGGTGGAACC AGCCGAAGAT CGGCCTCGAA TGGATGCTCA ACGGCACCGG GCTCGGCGCG AGCAACCACT TCGAGGCGGG CGGCTTCATT CGCACCCGCG ACGACGATCC GTGGCCGAAC ATCCAATATC ACTTCCTGCC CGTCGCGATC AATTACAACG GCTCGAACGC GATCGAGATG CACGGCTTCC AGGCGCACGT CGGCTCGATG CGCTCGCCGA GCTGCGGGCG CGTGAAGCTG AAGTCGCGCG ACCCGCACGC GCATCCGAGC ATCCTGTTCA ATTACATGGC CGAGGCGCTC GACTGGCGCG AGTTCCGCGA CGCGATCCGC GCGACGCGCG AGATCATGCG GCAGCCCGCG CTCGACCGCT TCCGCGGCCG CGAGCTGAAC CCGGGCGCGG ATCTGAAAAG CGACAACGAG CTCGATACGT TCGTACGCGC GCGCGCAGAA ACGGCATTCC ATCCGTCATG CTCGTGCAAG ATGGGCTACG ACGACATGGC GGTGGTCGAC AACGAAGGCC GCGTGCACGG GATCGACGGA TTGCGGGTCG TCGACGCGTC GATCATGCCG ATCATCACGA CCGGCAATCT GAACGCACCG ACGATCATGA TCGCCGAGAA GATCGCCGAC CGGATCCGCA AGCACAAGCC GCTCGAACGC TCGAACGCGC AATACTACGT CGCGAACGGC GCGCCCGCGC GCGGCGGCAA GCCCGCGCGG GCGCCCGCCG TCGTATAG
|
Protein sequence | MTTREFDYII CGAGSAGNVL ATRLTEDPGV TVLLLEAGGP DYRFDFRTQM PAALAYPLQG RRYNWAYETD PEPHMNHRRM ECGRGKGLGG SSLINGMCYI RGNALDYDNW ATHKGLEDWA YLDCLPYFRK AETRDVGPND YHGGDGPVSV TTSKPGVNPL FEAMVEAGVQ AGYPRTDDLN GYQQEGFGPM DRTVTPRGRR ASTARGYLDQ ARARPNLEIV THALADRILF SGKRATGVTF LHGSARVTAH ARREVLVCSG AIASPQLLQR SGVGPGEWLR ELDIPVVLDL PGVGRNLQDH LEMYIQFECK EPVSLYPALK WWNQPKIGLE WMLNGTGLGA SNHFEAGGFI RTRDDDPWPN IQYHFLPVAI NYNGSNAIEM HGFQAHVGSM RSPSCGRVKL KSRDPHAHPS ILFNYMAEAL DWREFRDAIR ATREIMRQPA LDRFRGRELN PGADLKSDNE LDTFVRARAE TAFHPSCSCK MGYDDMAVVD NEGRVHGIDG LRVVDASIMP IITTGNLNAP TIMIAEKIAD RIRKHKPLER SNAQYYVANG APARGGKPAR APAVV
|
| |