Gene RSc2099 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRSc2099 
SymbolguaD 
ID1220940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRalstonia solanacearum GMI1000 
KingdomBacteria 
Replicon accessionNC_003295 
Strand
Start bp2273388 
End bp2274725 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content70% 
IMG OID637238494 
Productguanine deaminase 
Protein accessionNP_520220 
Protein GI17546818 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.716358 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACCT TGCCGCCCGC GCCCGACGCC ACCCTCCACG CCTACCGCGG CCGCGTGCTG 
CATTTCCTGC ACGACCCGCA GTATCGCGAG CACGATGCCT ATCAGTACTG GGAAGACGGG
CTGCTGGTCA TCGGCGCCGG CAAGGTCGTG CGGGCCGGCG ACCACGCCGC GCTCAAGGGC
ACCCTGCCCG CCGGCGCGCA GGTGCACGAC TACAGCGGCA AGCTGATCGT CCCCGGCTTC
ATCGACACGC ACATCCACTT CCCGCAGACG GACATCATCG CCTCGCCGTC GCCGGGCCTG
CTGCACTGGC TGGAGACCTA CACCTTCCCC GAGGAGCGCC GCTTCGAAAG CCCGCAGTAC
GCGGCAGGCG TAGCATCGTT CTTCCTCGAT GAGCTGCTGC GCAACGGCAC CACCAGCGCA
ATGGTCTGGA GCACCGTGCA TCGCGGCTCG GCCGAGACGC TGTTCGAGCA GGCCCGGGCG
CGCGGCATGC GCCTGATCAC CGGCAAGGTG ATGATGGACC GCAACTGCCC CGAATACCTG
CGCGACACGG CCGAGCGCGG CGCGCGCGAT GCGGCCGACC TGATCGCGCG CTGGCACGGC
AAGGACCGGC TCGCCTACGC CATCACGCCG CGCTTCGCGC CGACCTCGAG CGAAGCACAG
CTGGCGGCCT GCGGTGAGCT GGCCCGGCAG CATCCGGACG TCTTCATCCA GACCCACGTC
GCCGAGAACC CCGACGAAGT GAAATGGGTG GCCGAGCTGT TCCCCAATGC CCGCAGCTAC
CTGGACGTCT ACGACCGCTA CGGCCTGCTG CGCCGCGGCG CCCTCTACGG CCATGCCATC
TGGCTGGACG ACGGCGACCG CCGGCGCATG GCCGAGTCCG GCGCGGCCGT CGCGCACTGC
CCCACCTCCA ACCTGTTCCT CGGCAGCGGT CTGTACAACT TCCACGCCAG CGACGCGCAC
CGCCTGGCAC TGACGCTCGC CACCGACGTG GGCGGCGGCA GCTCGTTCTC GATGCTGCGC
ACCATGGGCG CCGCGCACGA GGTCGCGCGC ATGGGCGGCT ACCACCTGAG CGCGCTGCGC
CTGTTCTACC TGGCCACGCG CGGCGCCGCC GAGGCGCTTG GCTGGCAGGA CCGCATCGGC
AGCTTCGTGC CGGGCGCGGA AGCCGACTTC ATCGTGCTCG ACCCGGCCGC CACGCCGCTG
CTGGCGCGCC GCAATTCCCG CGCCGAAACG CTGGAGGCAC AGCTGTTCTC GCTGGCGCTG
CTGGGCGAAG ACCGTGCCGT GGCCGCCACT TACATCCAGG GCGAACCCGC CAAATTCGCG
GTTGGCGTAT CGGCCTGA
 
Protein sequence
MTTLPPAPDA TLHAYRGRVL HFLHDPQYRE HDAYQYWEDG LLVIGAGKVV RAGDHAALKG 
TLPAGAQVHD YSGKLIVPGF IDTHIHFPQT DIIASPSPGL LHWLETYTFP EERRFESPQY
AAGVASFFLD ELLRNGTTSA MVWSTVHRGS AETLFEQARA RGMRLITGKV MMDRNCPEYL
RDTAERGARD AADLIARWHG KDRLAYAITP RFAPTSSEAQ LAACGELARQ HPDVFIQTHV
AENPDEVKWV AELFPNARSY LDVYDRYGLL RRGALYGHAI WLDDGDRRRM AESGAAVAHC
PTSNLFLGSG LYNFHASDAH RLALTLATDV GGGSSFSMLR TMGAAHEVAR MGGYHLSALR
LFYLATRGAA EALGWQDRIG SFVPGAEADF IVLDPAATPL LARRNSRAET LEAQLFSLAL
LGEDRAVAAT YIQGEPAKFA VGVSA