Gene Smed_4494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_4494 
Symbol 
ID5319186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009620 
Strand
Start bp977901 
End bp979208 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content61% 
IMG OID640776295 
Productguanine deaminase 
Protein accessionYP_001313227 
Protein GI150376631 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR02967] guanine deaminase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0638874 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0636141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCA TCCTCATTCG CGGCCGCACT TTAAGCTTCC ATCGCGAACC GCAGGGCATC 
GACGACAAAG CCTCCTATGC CTATGAGAGC GACGGCGCCC TTCTCGTCGA GAACGGCGTG
ATCACCGCAT CCGGCGCCTA CGCGATGGTC CAGGCGGGTG CACCTGAGAA CGTGGCGGAG
GTCGATCACC GGCCAGATCT TATCGTACCG GGCTTCATCG ACACACATCT GCATTTTCCG
CAGATGCAGG TTATGGCCTC CTATGCGGCG AACCTGCTTG AATGGCTGAA CAGCTACACT
TTTCCCGAGG AATGCCGGTT CGTCGAGACT GCGCATGCGG AGAGGATCGC CCGGCATTTC
TTCGACGAGA TGGTCCGCCA TGGCACCACG ACCGCCGCAG CCTATTGTTC CGTACACAAA
ACGTCCGCCG ACGCCTTCTT CGCCGAGAGC GTGAAGCGCG GCATGTGCAT GGTGGCCGGC
AAGGTGATGA TGGATCGCAA TGCGCCCCAG GGACTGCTCG ACACGCCCGA GACGAGCTAT
GATGAAACGC GGGCGGTCAT TGCGGACTGG CATGGCAAGG GGCGCAACCA TGTGGCCATC
ACGCCGCGCT TCGCGATCAC TTCGACCCCG GAGCAGATGG AGGCCGTGAG ATCGCTCGTC
GACGAGTTTC CCGACCTGCA TGTGCAGACG CACCTTTCGG AAAATCGCGA CGAGATCGCC
TACACGCTGG AACTCTATCC GGAAGCGGCG GACTATACTG ACGTCTACGC CCGGTATGGT
CTGCTCGGAC CGAAAAGCCT CTTCGGTCAT TGCATTCACC TGTCCGAGCG TGAAGCCGAT
GCCATGAGCG ACACCGGCTC GATCGCCGTC TTCTGCCCGA CCTCCAATCT TTTCCTTGGT
TCCGGCCTGT TCCCTTTGCG GGCACTGACG CGGCGACAGC GGCCGGTGCG CGTCTCGGTC
GCATCGGACA TCGGCGGCGG CACCAGCTAT TCGATGATGA AGACGCTCGA TGAAGCCTAC
AAGATTCTGC AGTTGCAAGG CGAAAGGCTG AACCCCTTCG ACAGCTTCTA CATGATGACT
CGCGGCAATG CCGAGGCGCT GTCGCTCGCC GGCCGCATCG GCACGCTGGA GCCCGGCACG
GATGCCGACC TCGCGGTCCT CGACATGGCG GCAACACCGG CAATGGCACT CAGGGCCGAA
GTTGTCAATT CTCTGGCCGA CGAGCTTTTC CTCCTGCAGA CGATGGGTGA CGACCGCGCT
GTCGTCGAGA CCTACGTCGC AGGAAAGCCA TGCAAATCAA TGCTTTAG
 
Protein sequence
MTAILIRGRT LSFHREPQGI DDKASYAYES DGALLVENGV ITASGAYAMV QAGAPENVAE 
VDHRPDLIVP GFIDTHLHFP QMQVMASYAA NLLEWLNSYT FPEECRFVET AHAERIARHF
FDEMVRHGTT TAAAYCSVHK TSADAFFAES VKRGMCMVAG KVMMDRNAPQ GLLDTPETSY
DETRAVIADW HGKGRNHVAI TPRFAITSTP EQMEAVRSLV DEFPDLHVQT HLSENRDEIA
YTLELYPEAA DYTDVYARYG LLGPKSLFGH CIHLSEREAD AMSDTGSIAV FCPTSNLFLG
SGLFPLRALT RRQRPVRVSV ASDIGGGTSY SMMKTLDEAY KILQLQGERL NPFDSFYMMT
RGNAEALSLA GRIGTLEPGT DADLAVLDMA ATPAMALRAE VVNSLADELF LLQTMGDDRA
VVETYVAGKP CKSML