Gene BURPS668_2918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2918 
SymbolguaD 
ID4885602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2875722 
End bp2877032 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID640128846 
Productguanine deaminase 
Protein accessionYP_001059936 
Protein GI126441339 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.495945 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAA CCGCTTTCCG CGCCCGCCTG CTGAGGTTCG ACGGCGACCC CGCGCAATCG 
GACGATGCGC TCGCGTACGA CGAGGACGGC CTGCTGATCG TCGAGAACGG ACGCGTCGTC
GCGGCGGGCG CCCATGCGGC GCTCGCCGCG CGCCTCGCGC CCGGCGCGAC GCTCGTCGAG
ATGCGCGACA AGCTGATCGC GCCCGGCTTC ATCGACACGC ACGTGCACTA TCCGCAGACC
GAAATGATCG CCTCGCCGGC GCCGGGCCTG CTGCCGTGGC TCGACCGCTA CACGTTCCCG
ACCGAGCGGC GCTTCGCCGA TCCCGCGCAT GCGCGCGACG TCGCCGAGTT CTTCCTCGAT
ACGCTGCTCG CGTGCGGCAC GACGACGGCG CTCGTCTACT GCACGGTGCA CAAGCAGTCG
GCCGACGCGC TGTTCGGCGC GAGCGAGGCG CGCGGCTTGC GGATGATCGC GGGCAAGGTG
CTGATGGACC GCCACTGCCC CGAGTTCCTG CGCGACACCG CGCAATCGGG CTACGACGAC
AGCGCCGAGC TGATCGCCCG CTGGCACGGC CACGGCCGGC AGTCGTACGC GCTCACGCCG
CGCTTCGCGC CGACATCGAC GCACGCGCAG CTCGAAGCGT GCGGCGCGCT CGCCCGGCTT
CATCCGGACG TGTTCATCCA GAGCCACGTC GCGGAGAATC TCGACGAGCT CCGCTGGGCG
GCCGAGCTGT TTCCCGAACG GCGCAGCTAT CTCGACGTCT ACGATCACTA CGGGCTGCTG
CGCCGTCGCG CCGTGTACGG CCACTGCATC CATCTCGACG ACGACGACCG CCGGCGCTTC
GCCGAAACGG GCGCGATCGC CGCGCACTGC CCGACATCGA ACCTGTTCCT CGGCAGCGGC
CTGTTCGATT TCGAGCGCGC GAACGCGCGG CACATGGCCG TCACGCTCGC GACCGACGTC
GGCGGCGGCA CATCGTTCTC GATGCTCCAA ACGATGAACG AAGCGCACAA GATCGCGCGG
ATGACGGGCC ATCACCTGAG CGCGACGCGC ATGTTCTGGC TCGCGACGGC AGGCGCCGCG
CACGCGCTCG ATCTCGCGGA CACGATCGGC ACGCTCGCGC CGCACGCGGA AGCCGACTTC
GTCGTGCTCG ATCCTGCCGC GACGCCGCTG CTCGCGCGCC GCACCGCGCG CGCGGAATCG
CTCGAGGAGC TGCTGTTCGC GCTCGCGCTG CTCGGCGACG ATCGCGCGGT CTATCGCACG
TATGCCGCCG GCCGCTGCGT GCACCGGCGC GACATCGCCG ACGCGGGCTG A
 
Protein sequence
MTQTAFRARL LRFDGDPAQS DDALAYDEDG LLIVENGRVV AAGAHAALAA RLAPGATLVE 
MRDKLIAPGF IDTHVHYPQT EMIASPAPGL LPWLDRYTFP TERRFADPAH ARDVAEFFLD
TLLACGTTTA LVYCTVHKQS ADALFGASEA RGLRMIAGKV LMDRHCPEFL RDTAQSGYDD
SAELIARWHG HGRQSYALTP RFAPTSTHAQ LEACGALARL HPDVFIQSHV AENLDELRWA
AELFPERRSY LDVYDHYGLL RRRAVYGHCI HLDDDDRRRF AETGAIAAHC PTSNLFLGSG
LFDFERANAR HMAVTLATDV GGGTSFSMLQ TMNEAHKIAR MTGHHLSATR MFWLATAGAA
HALDLADTIG TLAPHAEADF VVLDPAATPL LARRTARAES LEELLFALAL LGDDRAVYRT
YAAGRCVHRR DIADAG