Gene BURPS668_0216 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_0216 
Symbol 
ID4885370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp206607 
End bp208250 
Gene Length1644 bp 
Protein Length547 aa 
Translation table11 
GC content69% 
IMG OID640126144 
ProductGMC family oxidoreductase 
Protein accessionYP_001057269 
Protein GI126440292 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0487725 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTACG ACTACATCAT CGTCGGAGGC GGCTCGGGGG GCGCGAGTCT CGCGGGGCGT 
CTCGCCGACG CGTGCCCGGA CGCGACGATC GCGCTGATCG AGGCGGGCGG CCACACCGAA
CGCAATCTGC TCGTCAACAT GCCGGTGGGG ATCGCCGCGC TCGTGCCGTT CAAGCTCGGC
ACGAACTACG GCTACGAAAC GGTGCCGCAG CCCGGCCTCG GCGGGCGCCG CGGCTATCAG
CCCCGCGGCC GCGGGCTCGG CGGCTCGAGC GCGATCAACG CGATGATCTA CACGCGCGGC
CATCCGCTCG ATTATGACGA ATGGGAGCAG CTCGGCTGCA CCGGCTGGGG CTGGCGCGAC
GTGCTGCCGT ATTTCCGGCG CGCCGAAGGC AACGCGCGCG GCGCGAACGA ATGGCACGGC
GCCGACGGCC CGCTCACGGT ATCCGATCTG CGCTTTCGTA ATCCGTTCTC CGAACGATTC
ATCGCGGCCG CGCATGAGGC CGGCTATCCG CTGAACGACG ATTTCAACGG CGAGCATCAG
GAGGGCGTGG GCTTCTACCA GGTCACGCAT CGCGACGGCT CGCGCTGCAG CGTCGCGCGC
GCCTACGTGT ACGGCCGCAC GCGGCCGAAC CTGCACGTGA TCGTCGACGC GACGGTGCTG
CGCGTCGTGT TCGACGGCAA GCGCGCGACG GGCGTCGAGT TCGCGCGCGC CGGGCGCACC
GAGCAGCTTG CCGCGCGCGC GGAAGTGATT CTGTCCGCCG GCGCGTTCAA TACGCCGCAA
TTGCTGATGT GCTCGGGCGT CGGCCCCGCC GCGCAACTGC GCCGGCACGG CGTCGCGCTC
GTGCACGATG CGCCCGACGT CGGCGAGAAC CTGATCGATC ACATCGATTT CATCATCAAC
AAGCGCGTGA ATTCGTCGGA GCTCGTCGGC ATCTGCATGC GCGGCATCGC GAAGATGACG
CCCGCGCTGT TCAGCTATCT GTCCGGGCGT CGCGGAATGA TGACGAGCAA TGTCGCGGAG
GCGGGCGGCT TCATCAAGAG CGAACCGGGG CTCGATCGTC CCGATCTGCA ATTGCATTTC
TGCACCGCGC TCGTCGACGA TCACAACCGC AACATGCACT GGGGCTTCGG CTATTCGCTG
CACGTGTGCG CGCTGCGGCC GAAGAGCCGC GGCAACGTCG CGCTCGCAAG CGGCGACGCG
CGCGTCGCGC CGCTCATCGA TCCGCGCTTC TTCAGCGACG AACGCGATCT CGACCTGCTC
GTGACGGGCG CGAAGGCGAT GCGCAGAATC CTCTGCGCCG CGCCGCTCGC GTCGCAGGGC
GGGCGCGAGC TGTATACCGA TCCGGGCGAT ACCGATGCGC AATTGCGCGC GGCGATCGTC
GCGCATGCGG ACACGATCTA CCACCCGGTC GGCACGTGCC GGATGGGCAC CGATGCGCGC
GCGGTCGTCG ATCCGCAATT GCGCGTGAAA GGGGTGGACG GGCTGCGGGT GGTCGATGCT
TCGGTGATGC CGACGCTCAT CGGCGGCAAC ACGAACGCGC CGACCGTGAT GATCGCCGAG
CGCGCGGCCG ATTTCATCGT GGCCGCGCGC AACGGCCAGG CCGCGCCCAT GCGCGAGCGA
ATCGCGGCGA CGCACGGCGG CTGA
 
Protein sequence
MQYDYIIVGG GSGGASLAGR LADACPDATI ALIEAGGHTE RNLLVNMPVG IAALVPFKLG 
TNYGYETVPQ PGLGGRRGYQ PRGRGLGGSS AINAMIYTRG HPLDYDEWEQ LGCTGWGWRD
VLPYFRRAEG NARGANEWHG ADGPLTVSDL RFRNPFSERF IAAAHEAGYP LNDDFNGEHQ
EGVGFYQVTH RDGSRCSVAR AYVYGRTRPN LHVIVDATVL RVVFDGKRAT GVEFARAGRT
EQLAARAEVI LSAGAFNTPQ LLMCSGVGPA AQLRRHGVAL VHDAPDVGEN LIDHIDFIIN
KRVNSSELVG ICMRGIAKMT PALFSYLSGR RGMMTSNVAE AGGFIKSEPG LDRPDLQLHF
CTALVDDHNR NMHWGFGYSL HVCALRPKSR GNVALASGDA RVAPLIDPRF FSDERDLDLL
VTGAKAMRRI LCAAPLASQG GRELYTDPGD TDAQLRAAIV AHADTIYHPV GTCRMGTDAR
AVVDPQLRVK GVDGLRVVDA SVMPTLIGGN TNAPTVMIAE RAADFIVAAR NGQAAPMRER
IAATHGG