Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II1072 |
Symbol | betA |
ID | 3846715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | - |
Start bp | 1247285 |
End bp | 1248982 |
Gene Length | 1698 bp |
Protein Length | 565 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637838375 |
Product | choline dehydrogenase |
Protein accession | YP_439269 |
Protein GI | 83717344 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2303] Choline dehydrogenase and related flavoproteins |
TIGRFAM ID | [TIGR01810] choline dehydrogenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00240234 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGACAC GCGAATTCGA CTACATCATC TGCGGCGCGG GCTCCGCGGG CAACGTGCTC GCGACGCGCC TCACCGAGGA CCCGGACGTC ACCGTGCTGC TCCTCGAGGC GGGCGGCCCC GACTACCGGT TCGACTTCCG CACGCAGATG CCGGCGGCGC TCGCGTACCC GCTGCAGGGC CGCCGCTACA ACTGGGCATA CGAAACCGAT CCCGAGCCGC ACATGAACAA CCGCCGGATG GAGTGCGGGC GCGGCAAGGG GCTCGGCGGC TCGTCGCTGA TCAACGGGAT GTGCTATATC CGCGGCAACG CGCTCGATTA CGACAACTGG TCGACGCACA AGGGCCTCGA GGACTGGACG TATCTCGACT GCCTGCCGTA CTTCAGGAAG GCCGAGACGC GCGACGTCGG CCCGAACGAT TATCACGGCG GCGACGGCCC GGTCTCGGTG ACGACGAGCA AGCCCGGCGT GAATCCGCTG TTCGAGGCGA TGGTCGAGGC CGGCGTGCAG GCCGGCTATC CGCGCACCGA CGACCTCAAC GGCTACCAGC AGGAAGGCTT CGGCCCGATG GACCGCACGG TCACGCCGCG CGGCCGCCGC GCCTCGACCG CGCGCGGCTA TCTCGACCAG GCGCGGGCGC GGCCGAATCT CGAAATCGTC ACGCACGCGC TCGCCGATCG CATCCTGTTC TCCGGCAAGC GCGCGACGGG CGTCACGTTC CTGCACGGCA GCGCGCGCGT CACCGCGCAC GCGCGCCGCG AGGTGCTCGT GTGCAGCGGC GCGATCGCCT CGCCGCAACT GCTGCAGCGC TCGGGCGTCG GCCCCGGCGA ATGGCTGCGC GAGCTCGACA TTCCCGTCGT GCTCGATCTG CCCGGCGTCG GCCGCAATCT GCAGGACCAC CTGGAGATGT ACATCCAGTT CGAGTGCAAG GAGCCGGTAT CGCTGTATCC GGCGCTCAAG TGGTGGAACC AGCCGAAGAT CGGTCTCGAT TGGATGATCA ACGGCACAGG CCTCGGCGCG AGCAACCACT TCGAAGCGGG CGGTTTCATC CGCACGCGCG ACGACGATCT GTGGCCGAAC ATCCAATATC ACTTCCTGCC CGTCGCGATC AACTACAACG GCTCGAATGC GATCGAGATG CACGGCTTCC AGGCGCACGT CGGCTCGATG CGCTCGCCGA GCCGCGGCCG CGTGAAGCTG AAATCGCGCG ATCCGAACGC GCATCCGAGC ATCCTGTTCA ACTACATGGC CGAAGCGCTC GACTGGCGCG AATTCCGCGA CGCGATTCGC GCGACGCGCG AGATCATGCA CCAGCCGGCG CTCGACCGCT TCCGCGGCCG CGAGCTGAAC CCGGGCGCGG ACCTGAAGAG CGACAACGAG CTCGACGCGT TCGTGCGCGC GCGCGCGGAA ACGGCGTTCC ATCCGTCGTG CTCGTGCAAG ATGGGCTACG ACGACATGGC GGTCGTCGAC AATGAAGGCC GCGTGCACGG GATCGACGGA TTGCGGGTTG TCGATGCGTC GATCATGCCG ATCATCACGA CCGGCAATCT GAACGCGCCG ACGATCATGA TCGCCGAGAA GATCGCCGAC AAGATCCGCA AGCGAAAGCC GCTCGAACGG TCGAACGCGC GATATTACGT CGCGAACGGC GCGCCCGCGC GCGGCGGCAA GCCTGCCCGA GCGCCCGCCG CCGTGTAA
|
Protein sequence | MTTREFDYII CGAGSAGNVL ATRLTEDPDV TVLLLEAGGP DYRFDFRTQM PAALAYPLQG RRYNWAYETD PEPHMNNRRM ECGRGKGLGG SSLINGMCYI RGNALDYDNW STHKGLEDWT YLDCLPYFRK AETRDVGPND YHGGDGPVSV TTSKPGVNPL FEAMVEAGVQ AGYPRTDDLN GYQQEGFGPM DRTVTPRGRR ASTARGYLDQ ARARPNLEIV THALADRILF SGKRATGVTF LHGSARVTAH ARREVLVCSG AIASPQLLQR SGVGPGEWLR ELDIPVVLDL PGVGRNLQDH LEMYIQFECK EPVSLYPALK WWNQPKIGLD WMINGTGLGA SNHFEAGGFI RTRDDDLWPN IQYHFLPVAI NYNGSNAIEM HGFQAHVGSM RSPSRGRVKL KSRDPNAHPS ILFNYMAEAL DWREFRDAIR ATREIMHQPA LDRFRGRELN PGADLKSDNE LDAFVRARAE TAFHPSCSCK MGYDDMAVVD NEGRVHGIDG LRVVDASIMP IITTGNLNAP TIMIAEKIAD KIRKRKPLER SNARYYVANG APARGGKPAR APAAV
|
| |