Gene Rleg2_1675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1675 
Symbol 
ID6980412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1704838 
End bp1706151 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content65% 
IMG OID643396399 
Productcytosine deaminase-like protein 
Protein accessionYP_002281189 
Protein GI209549272 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00400409 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATT CCTTCCTTTC CCCGCCCAAT GCCGGCCGCT TCGTGCTGAG CAATGCAACG 
CTGCCTGCCG TCGCTGTCGA GGGATTTGAC GCGCCGGCCA CCGAGGGGCT GGTCAAGGCG
GATATCGTCA TCGCCGATGG CGTGATCTCC ACGCTGCTGC CGGCGGGTGC CGCGCCGGTG
GAATTGCCAA GATCCGATCT GAAGGATGGC ATGGTCTGGC CCTGCTTCAC CGATATGCAC
ACCCATCTCG ACAAGGGGCA TATCTGGCCT CGCAACGCCA ATCCCGACGG CACCTTCATC
GGCGCCCTGG ACGCAGTGCG GGCCGACCGG GAGGCGTACT GGTCGGCGGC GGATGTCAGC
AAGCGCATGG AATTCTCGCT GCGCTCGGCC TATGCGCACG GGACCAGCCT GATCCGCACG
CATCTCGATT CGCTGGCGCC GCAGCACCGT ATTTCCTTCG AGGTTTTTGC CGAGATGCGC
GAGGCCTGGA AAGACCGGAT CGCGCTGCAG GCGGTGGCGC TCTTCCCGCT GGAAAATATG
GTGGACGCGA CCTATTTCGC CGACCTGGTC GCCGTCGTCC GCGAAAAAGG CGGGCTGATC
GGCGGCGTCA CCCGGATGAC CGCCGATCTC GACAGCCAGC TCGATGTACT TTTCAGGGCC
GCCGCCGATA ACGGCCTCGA CGTCGATCTT CATGTCGACG AGAGCGACGA TCCCGCCGCC
GAGACGCTGA AGGCGATCGC GCAAGCCGTG CTGCGCAACC GGTTCGACGG CAAGGTGACG
GCAGGCCATT GCTGCTCGCT CGCCAGGCAG GACGAGGAGA CGGCAAAACG CGCGGTCGAG
CTCGTCGCGA AAGCTGGCAT CTCCATCGTC TCGCTGCCGA TGTGCAACCT CTATCTACAG
GACCGCTATC CCGGCCGAAC GCCTCGCTGG CGCGGCGTCA CCCTGTTCAA GGAACTGGCG
GCGGCCGGCG TTGCGACGGC AGTTTCCTCC GACAACACGC GTGATCCCTT TTATGCCTAT
GGCGATCTCG ACCCGGTCGA GGTGTTGCGC GAGGCGGTGC GCATTCTGCA TCTCGATCAC
CCGCTGGATA CGGCCGCGCG CGTCGTTACC ACCTCGCCTG CGGATATTCT CGGCCGGCCC
GACAAGGGGC GCATCGCCGT CGGCGCCCTG GCCGACCTCG TGCTCTTCAG CGCCCGGCGC
TGGAGCGAAT TTCTTTCCCG TCCGCAGTCT GACCGCGTCG TGCTTCGCCG CGGCAAGGTG
ATCGACCGCA GCCTGCCGGA CTATCGTGAA CTCGACAGCG TCGTTGGAGC ATGA
 
Protein sequence
MTHSFLSPPN AGRFVLSNAT LPAVAVEGFD APATEGLVKA DIVIADGVIS TLLPAGAAPV 
ELPRSDLKDG MVWPCFTDMH THLDKGHIWP RNANPDGTFI GALDAVRADR EAYWSAADVS
KRMEFSLRSA YAHGTSLIRT HLDSLAPQHR ISFEVFAEMR EAWKDRIALQ AVALFPLENM
VDATYFADLV AVVREKGGLI GGVTRMTADL DSQLDVLFRA AADNGLDVDL HVDESDDPAA
ETLKAIAQAV LRNRFDGKVT AGHCCSLARQ DEETAKRAVE LVAKAGISIV SLPMCNLYLQ
DRYPGRTPRW RGVTLFKELA AAGVATAVSS DNTRDPFYAY GDLDPVEVLR EAVRILHLDH
PLDTAARVVT TSPADILGRP DKGRIAVGAL ADLVLFSARR WSEFLSRPQS DRVVLRRGKV
IDRSLPDYRE LDSVVGA