Gene BMASAVP1_A2609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBMASAVP1_A2609 
SymbolguaD 
ID4681900 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia mallei SAVP1 
KingdomBacteria 
Replicon accessionNC_008785 
Strand
Start bp2592260 
End bp2593570 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content71% 
IMG OID639846868 
Productguanine deaminase 
Protein accessionYP_993909 
Protein GI121599661 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.694405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCAAA CCGCTTTCCG CGCCCGCCTG CTGAGGTTCG ACGGCGACCC CGCGCAATCG 
GACGATGCGC TCGCGTACGA CGAGGACGGC CTGCTGATCG TCGAGAACGG GCGCGTCGTC
GCGGCGGGCG CCCATGCGGC GCTCGCCGCG CGCCTCGCGC CCGGCGCGAC GCTCGTCGAG
ATGCGCGACA AGCTGATCGC GCCCGGCTTC ATCGACACGC ACGTGCACTA TCCGCAGACC
GAAATGATCG CCTCGCCGGC GCCGGGCCTG TTGCCGTGGC TCGACCGCTA CACGTTCCCG
ACCGAGCGGC GCTTCGCCGA TCCCGCGCAT GCGCGCGACG TCGCCGAGTT CTTCCTCGAT
ACGCTGCTCG CGTGCGGCAC GACGACGGCG CTCGTCTACT GCACGGTGCA CAAGCAATCG
GCCGACGCGC TGTTCGGCGC GAGCGAGGCG CGCGGCTTGC GGATGATCGC GGGCAAGGTG
CTGATGGACC GCCACTGCCC CGAGTTCCTG CGCGACACCG CGCAATCGGG CTACGACGAC
AGCGCCGAGC TGATCGCCCG CTGGCACGGC CACGGCCGGC AGTCGTACGC GCTCACGCCG
CGCTTCGCGC CGACATCGAC GCACGCGCAG CTCGAAGCGT GCGGCGCGCT CGCCCGGCTT
CATCCGGACG TGTTCATCCA GAGCCACGTC GCGGAGAATC TCGACGAGCT CCGCTGGGCG
GCCGAGCTGT TTCCCGAGCG GCGCAGCTAT CTCGATGTCT ACGATCACTA CGGGCTGCTG
CGCCGTCGCG CCGTGTACGG CCACTGCATC CATCTCGACG ACGACGACCG CCGGCGCTTC
GCCGAAACGG GCGCGATCGC CGCGCACTGC CCGACGTCGA ACCTGTTCCT CGGCAGCGGC
CTGTTCGATT TCGAGCGCGC GAACGCGCGG CACATGGCCG TCACGCTCGC GACCGACGTC
GGCGGCGGCA CATCGTTCTC GATGCTCCAA ACGATGAACG AAGCGCACAA GATCGCGCGG
ATGACGGGCC ATCACCTGAG CGCGACGCGC ATGTTCTGGC TCGCGACGGC AGGCGCCGCG
CACGCGCTCG ATCTCGCGGA CACGATCGGC ACGCTCGCGC CGCACGCGGA AGCCGACTTC
GTCGTGCTCG ATCCTGCCGC GACGCCGCTG CTCGCGCGCC GCACCGCGCG CGCGGAATCG
CTCGAGGAGC TGCTGTTCGC GCTCGCGCTG CTCGGCGACG ATCGCGCGGT CTATCGCACG
TATGCCGCCG GCCGCTGCGT GCACCGGCGC GACATCGCCG ACGCGGGCTG A
 
Protein sequence
MTQTAFRARL LRFDGDPAQS DDALAYDEDG LLIVENGRVV AAGAHAALAA RLAPGATLVE 
MRDKLIAPGF IDTHVHYPQT EMIASPAPGL LPWLDRYTFP TERRFADPAH ARDVAEFFLD
TLLACGTTTA LVYCTVHKQS ADALFGASEA RGLRMIAGKV LMDRHCPEFL RDTAQSGYDD
SAELIARWHG HGRQSYALTP RFAPTSTHAQ LEACGALARL HPDVFIQSHV AENLDELRWA
AELFPERRSY LDVYDHYGLL RRRAVYGHCI HLDDDDRRRF AETGAIAAHC PTSNLFLGSG
LFDFERANAR HMAVTLATDV GGGTSFSMLQ TMNEAHKIAR MTGHHLSATR MFWLATAGAA
HALDLADTIG TLAPHAEADF VVLDPAATPL LARRTARAES LEELLFALAL LGDDRAVYRT
YAAGRCVHRR DIADAG