Gene Rleg_0191 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0191 
Symbol 
ID8011420 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp190525 
End bp191619 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content65% 
IMG OID644822783 
Product2'-deoxycytidine 5'-triphosphate deaminase 
Protein accessionYP_002974041 
Protein GI241202945 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0717] Deoxycytidine deaminase 
TIGRFAM ID[TIGR02274] deoxycytidine triphosphate deaminase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0836606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.759587 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCTC GCGAAACGGG AATTCTGGCG GATCGCGCGA TCTCCGCGCT GTTCGAAACG 
GGGCGTCTGA TCTCCGAGCG GGAGCTGGAC CGCGACCAGA TCCAGCCGGC AAGCCTCGAC
CTGCGCTTGG GCGGCAAGGC TTTTCGGGTG CGTGCCAGCT TCATGCCAGG CCCCTCGCAT
CTGGTGTCCG ACAAGCTTGA CCGGCTGAGC CTGCACGTGA TCGACCTCTC CGAAGGCGCG
GTGCTCGAAA CGGGCTGCGT CTATATCGTG CCGCTGATGG AGAGCCTGGC GCTGCCGGCC
GAGATGTCGG CCTCGGCCAA TCCGAAGAGC TCGACCGGGC GCCTCGATAT CTTCACCCGC
GTCATTACCG ACTACGCCCA GGAATTCGAC AAGATCCCAT CAGGCTATTC CGGCCCGCTC
TATCTCGAAA TCAGCCCGCG CACCTTCCCG ATCGTTGTGC GCCGCGGCTC GCGGCTGTCG
CAGATCCGGT TCCGCGTCGG CCAGGCGCTG CTCGGCGAGC CGGAACTGTT GAAGCTGCAT
GAGAGCGAGA CGCTGGTCGC CAGCAAGCTG CCGAACGTTT CCGGCGGCGG CATCGCCCTG
TCGATCGACC TCGCCGGAGA CAAGGACGGC CTGATCGGTT ATCGCGGCAA ACATCACACC
GCCGTCGTCG ACGTCGACAA GAAGGCCGAG CACGACCTCT TCGATTTCTG GGAGCCGCTC
CACAGCCGCG GCCGCAACGA GCTGATCCTC GATCCCGATG AATTCTATAT CCTCGTCTCG
CGCGAGGCGG TGCACGTGCC GCCGGATTAC GCAGCCGAAA TGACCCCCTT CGATCCGCTG
GTCGGCGAAT TCCGCGTCCA TTATGCCGGC TTCTTCGATC CGGGCTTCGG CCATGCGCCT
GCCGGCGGGC GCGGCAGCCG CGCGGTGCTC GAAGTGCGCA GCCACGAGGT GCCCTTCATC
CTCGAAGACG GCCAGATCGT CGGCCGCCTC GTCTACGAAC ACATGCAGGA AAAGCCCGCC
AGCCTCTACG GCTCCGGCCT AGGCTCCAAT TACCAGGCCC AGGGCCTGAA GCTCTCGAAG
CACTTCCGCA TCTGA
 
Protein sequence
MMARETGILA DRAISALFET GRLISERELD RDQIQPASLD LRLGGKAFRV RASFMPGPSH 
LVSDKLDRLS LHVIDLSEGA VLETGCVYIV PLMESLALPA EMSASANPKS STGRLDIFTR
VITDYAQEFD KIPSGYSGPL YLEISPRTFP IVVRRGSRLS QIRFRVGQAL LGEPELLKLH
ESETLVASKL PNVSGGGIAL SIDLAGDKDG LIGYRGKHHT AVVDVDKKAE HDLFDFWEPL
HSRGRNELIL DPDEFYILVS REAVHVPPDY AAEMTPFDPL VGEFRVHYAG FFDPGFGHAP
AGGRGSRAVL EVRSHEVPFI LEDGQIVGRL VYEHMQEKPA SLYGSGLGSN YQAQGLKLSK
HFRI