Gene BURPS1710b_3023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_3023 
SymbolguaD 
ID3690857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp3333262 
End bp3334572 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID637729479 
Productguanine deaminase 
Protein accessionYP_334402 
Protein GI76811969 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.494486 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAA CCGCTTTCCG CGCCCGCCTG CTGAGGTTCG ACGGCGACCC CGCGCAATCG 
GACGATGCGC TCGCGTACGA CGAGGACGGC CTGCTGATCG TCGAGAACGG GCGCGTCGTC
GCGGCGGGCG CCCATGCGGC GCTCGCCGCG CGCCTCGCGC CCGGCGCGAC GCTCGTCGAG
ATGCGCGACA AGCTGATCGC GCCCGGCTTC ATCGACACGC ACGTGCACTA TCCGCAGACC
GAAATGATCG CCTCGCCGGC GCCGGGCCTG TTGCCGTGGC TCGACCGCTA CACGTTCCCG
ACCGAGCGGC GCTTCGCCGA TCCCGCGCAT GCGCGCGACG TCGCCGAGTT CTTCCTCGAT
ACGCTGCTCG CGTGCGGCAC GACGACGGCG CTCGTCTACT GCACGGTGCA CAAGCAGTCG
GCCGACGCGC TGTTCGGCGC GAGCGAGGCG CGCGGCTTGC GGATGATCGC GGGCAAGGTG
CTGATGGACC GCCACTGCCC CGAGTTCCTG CGCGACACCG CGCAATCGGG CTACGAAGAC
AGCGCCGAGC TGATCGCCCG CTGGCACGGC CACGGCCGGC AGTCGTACGC GCTCACGCCG
CGCTTCGCGC CGACATCGAC GCACGCACAG CTCGAAGCGT GCGGCGCGCT CGCCCGGCTT
CATCCGGACG TGTTCATCCA GAGCCACGTC GCGGAGAATC TCGACGAGCT CCGCTGGGCG
GCCGAGCTGT TTCCCGAGCG GCGCAGCTAT CTCGATGTCT ACGATCACTA CGGGCTGCTG
CGCCGTCGCG CCGTGTACGG CCACTGCATC CATCTCGACG ACGACGACCG CCGGCGCTTC
GCCGAAACGG GCGCGATCGC CGCGCACTGC CCGACGTCGA ACCTGTTCCT CGGCAGCGGC
CTGTTCGATT TCGAGCGCGC GAACGCGCGG CACATGGCCG TCACGCTCGC GACCGACGTC
GGCGGCGGCA CATCGTTCTC GATGCTCCAA ACGATGAACG AAGCGCACAA GATCGCGCGG
ATGACGGGCC ATCACCTGAG CGCGACGCGC ATGTTCTGGC TCGCGACGGC AGGCGCCGCG
CACGCGCTCG ATCTCGCGGA CACGATCGGC ACGCTCGCGC CGCACGCGGA AGCCGACTTC
GTCGTGCTCG ATCCTGCCGC GACGCCGCTG CTCGCGCGCC GCACCGCGCG CGCGGAATCG
CTCGAGGAGC TGCTGTTCGC GCTCGCGCTG CTCGGCGACG ATCGCGCGGT CTATCGCACG
TATGCCGCCG GCCGCTGCGT GCACCGGCGC GACATCGCCG ACGCGGGCTG A
 
Protein sequence
MTQTAFRARL LRFDGDPAQS DDALAYDEDG LLIVENGRVV AAGAHAALAA RLAPGATLVE 
MRDKLIAPGF IDTHVHYPQT EMIASPAPGL LPWLDRYTFP TERRFADPAH ARDVAEFFLD
TLLACGTTTA LVYCTVHKQS ADALFGASEA RGLRMIAGKV LMDRHCPEFL RDTAQSGYED
SAELIARWHG HGRQSYALTP RFAPTSTHAQ LEACGALARL HPDVFIQSHV AENLDELRWA
AELFPERRSY LDVYDHYGLL RRRAVYGHCI HLDDDDRRRF AETGAIAAHC PTSNLFLGSG
LFDFERANAR HMAVTLATDV GGGTSFSMLQ TMNEAHKIAR MTGHHLSATR MFWLATAGAA
HALDLADTIG TLAPHAEADF VVLDPAATPL LARRTARAES LEELLFALAL LGDDRAVYRT
YAAGRCVHRR DIADAG