Gene BURPS1106A_A1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A1837 
SymbolbetA 
ID4903949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp1805115 
End bp1806812 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content67% 
IMG OID640144943 
Productcholine dehydrogenase 
Protein accessionYP_001075871 
Protein GI126457859 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID[TIGR01810] choline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGACAC GCGAATTCGA CTACATCATC TGCGGCGCCG GCTCCGCGGG CAACGTGCTC 
GCGACGCGCC TCACCGAGGA TCCGGGCGTC ACCGTGCTGC TCCTCGAGGC GGGCGGCCCC
GACTACCGTT TCGACTTCCG CACGCAGATG CCGGCGGCGC TCGCCTACCC GCTGCAAGGC
CGCCGCTACA ACTGGGCGTA CGAAACCGAT CCCGAGCCGC ACATGAACCA CCGCCGGATG
GAATGCGGGC GCGGCAAGGG GCTCGGCGGC TCGTCGCTGA TCAATGGGAT GTGCTACATC
CGCGGCAATG CGCTCGATTA CGACAACTGG GCGACGCACA AGGGGCTCGA GGACTGGGCG
TATCTCGATT GCCTGCCGTA CTTCAGGAAG GCCGAGACGC GCGATGTCGG CCCGAACGAC
TATCACGGCG GTGACGGCCC GGTCTCCGTG ACGACGAGCA AGCCCGGCGT GAATCCGCTG
TTCGAGGCGA TGGTCGAGGC CGGCGTGCAG GCCGGCTATC CGCGCACCGA CGATCTCAAC
GGCTATCAGC AGGAAGGCTT CGGCCCGATG GACCGCACGG TCACGCCGCG CGGCCGCCGC
GCCTCGACCG CGCGCGGCTA TCTCGACCAG GCGCGCGCGC GGCCGAATCT CGAAATCGTC
ACGCACGCGC TTGCCGATCG CATCCTGTTC TCCGGCAAGC GCGCAACGGG CGTCACGTTC
CTGCACGGCA GCGCGCGCGT CACCGCGCAC GCGCGCCGCG AAGTGCTCGT GTGCAGCGGC
GCGATCGCAT CGCCGCAACT GCTGCAGCGC TCGGGCGTCG GCCCCGGCGA ATGGCTGCGC
GAGCTCGACA TTCCGGTCGT GCTCGACCTG CCCGGCGTCG GCCGCAATCT GCAGGATCAC
CTGGAGATGT ACATCCAGTT CGAATGCAAG GAGCCGGTAT CGCTATATCC GGCGCTCAAG
TGGTGGAACC AGCCGAAGAT CGGCCTCGAA TGGATGCTCA ACGGCACCGG GCTCGGCGCG
AGCAACCACT TCGAGGCGGG CGGCTTCATT CGCACCCGCG ACGACGATCC GTGGCCGAAC
ATCCAATATC ACTTCCTGCC CGTCGCGATC AATTACAACG GCTCGAACGC GATCGAGATG
CACGGCTTCC AGGCGCACGT CGGCTCGATG CGCTCGCCGA GCCGCGGGCG CGTGAAGCTG
AAGTCGCGCG ACCCGCACGC GCATCCGAGC ATCCTGTTCA ATTACATGGC CGAGGCGCTC
GACTGGCGCG AGTTCCGCGA CGCGATCCGC GCGACGCGCG AGATCATGCG GCAGCCCGCG
CTCGACCGCT TCCGCGGCCG CGAGCTGAAC CCGGGCGCGG ATCTGAAAAG CGACAACGAG
CTCGATACGT TCGTACGCGC GCGCGCGGAA ACGGCATTCC ATCCGTCATG CTCGTGCAAG
ATGGGCTACG ACGACATGGC GGTGGTCGAC AACGAAGGCC GCGTGCACGG GATCGACGGA
TTGCGGGTCG TCGACGCGTC GATCATGCCG ATCATCACGA CCGGCAATCT GAACGCACCG
ACGATCATGA TCGCCGAGAA GATCGCCGAC CGGATCCGCC AGCACAAGCC GCTCGAACGC
TCGAACGCGC AATACTACGT CGCGAACGGC GCGCCCGCGC GCGGCGGCAA GCCCGCGCGG
GCGCCCGCCG TCGTATAG
 
Protein sequence
MTTREFDYII CGAGSAGNVL ATRLTEDPGV TVLLLEAGGP DYRFDFRTQM PAALAYPLQG 
RRYNWAYETD PEPHMNHRRM ECGRGKGLGG SSLINGMCYI RGNALDYDNW ATHKGLEDWA
YLDCLPYFRK AETRDVGPND YHGGDGPVSV TTSKPGVNPL FEAMVEAGVQ AGYPRTDDLN
GYQQEGFGPM DRTVTPRGRR ASTARGYLDQ ARARPNLEIV THALADRILF SGKRATGVTF
LHGSARVTAH ARREVLVCSG AIASPQLLQR SGVGPGEWLR ELDIPVVLDL PGVGRNLQDH
LEMYIQFECK EPVSLYPALK WWNQPKIGLE WMLNGTGLGA SNHFEAGGFI RTRDDDPWPN
IQYHFLPVAI NYNGSNAIEM HGFQAHVGSM RSPSRGRVKL KSRDPHAHPS ILFNYMAEAL
DWREFRDAIR ATREIMRQPA LDRFRGRELN PGADLKSDNE LDTFVRARAE TAFHPSCSCK
MGYDDMAVVD NEGRVHGIDG LRVVDASIMP IITTGNLNAP TIMIAEKIAD RIRQHKPLER
SNAQYYVANG APARGGKPAR APAVV