Gene Smed_0134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0134 
Symbol 
ID5320963 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp148315 
End bp149412 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content62% 
IMG OID640789067 
Product2'-deoxycytidine 5'-triphosphate deaminase 
Protein accessionYP_001325829 
Protein GI150395362 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0717] Deoxycytidine deaminase 
TIGRFAM ID[TIGR02274] deoxycytidine triphosphate deaminase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.338998 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCGCG AAACAGGGAT TTTGGCCGAC CGCGCAATTG CAGCGCTGTT CGGCTCGGGA 
CGTCTGAAGA GCGAGAAGGC GCTGGATGGG GACCAGATTC AGCCGGCCAG CCTCGATCTT
CGCCTTGGAT CCACGGCATT CCGTGTCCGG GCAAGCTTCA TGCCCGGCCC TTCTCATCTG
GTGGCCGACA AGCTCGATCG CCTGAAACTG CATGTCATTG ATCTCAGCGA CGGCGCGGTG
CTCGAAACCG GCTGCGTCTA TATCGTACCG CTGATGGAGA GCCTGTCGCT GCCGGCGAAC
ATGTCCGCCT CCGCCAATCC GAAAAGCTCA ACCGGCCGAC TCGACATCTT TACACGGGTG
ATCACCGACA GGGCGCAGGA GTTCGACAAG ATCCCGGCCG GCTACAGCGG TCCGCTCTAT
CTCGAAATCA GCCCGCGAAC CTTTCCGATC GTCGTCAGGC GCGGGTCGCG GCTTTCGCAG
ATACGCTTCC GCGTCGGACA CGCGGTTCTC TCCGAGCAGG AACTGCTGGC CCTTCATGAA
AGCGACGTGC TCGTTGCAAG CGATCGGCCA AACGTCTCCG GCGGCGGGAT CGCCCTTTCC
ATCGACCTCA AGGGCACCGG CCCCGACGGG CTGATCGGCT ATCGCGGCAA GCATCACACA
TCGGTCGTCG ACGTCGACAA GAAAGCCCAG CATGCGGTTT TCGATTTCTG GGAGCCCCTC
TACAGCCGCG GCCGCGATGA CCTCATCCTC GACCCGGACG AATTCTACAT CCTCGTTTCC
CGCGAGGCCG TTCATGTACC GCCACTCTAT GCCGCCGAGA TGACGCCGTT CGATCCGCTG
GTGGGCGAGT TCCGCGTTCA TTATGCCGGT TTCTTTGACC CCGGGTTCGG CCATGCGTCG
GCCGGCGGCA GCGGCAGCCG GGCGGTGCTC GAAGTCCGCA GTCACGAGGT GCCGTTCATT
CTGGAGCATG GCCAGATCGT CGGCCGTCTC ATCTACGAAC ACATGCTGGA CCGCCCGGAG
GGGCTTTACG GCCTCGACCT CGGCTCCAAT TATCAGGCGC AGGGGCTCAA GCTCTCTAAG
CATTTTCGCG CGGAATGA
 
Protein sequence
MGRETGILAD RAIAALFGSG RLKSEKALDG DQIQPASLDL RLGSTAFRVR ASFMPGPSHL 
VADKLDRLKL HVIDLSDGAV LETGCVYIVP LMESLSLPAN MSASANPKSS TGRLDIFTRV
ITDRAQEFDK IPAGYSGPLY LEISPRTFPI VVRRGSRLSQ IRFRVGHAVL SEQELLALHE
SDVLVASDRP NVSGGGIALS IDLKGTGPDG LIGYRGKHHT SVVDVDKKAQ HAVFDFWEPL
YSRGRDDLIL DPDEFYILVS REAVHVPPLY AAEMTPFDPL VGEFRVHYAG FFDPGFGHAS
AGGSGSRAVL EVRSHEVPFI LEHGQIVGRL IYEHMLDRPE GLYGLDLGSN YQAQGLKLSK
HFRAE