Gene Rleg_6176 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6176 
Symbol 
ID8016189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012852 
Strand
Start bp220734 
End bp222023 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content62% 
IMG OID644827482 
Productcytosine deaminase 
Protein accessionYP_002978682 
Protein GI241258798 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0175267 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.144345 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATC TGATCGTGAG AAATGCAAAT GTCCCCGATG GCCGAAAGGG TATTGATATT 
GGCATCCAGG GCGGCAAGAT CGTTGCCATT GAGAGTAATC TCCAGGCGCA GGCGGGAGAA
GAAATCGACG CGACCGGCCG GCTGGTCAGT CCGCCTTTTG TCGATCCGCA TTTCCATATG
GACGCCACCC TGTCGCTCGG CCTGCCGCGC ATGAACGTGT CCGGCACCCT GCTCGAAGGC
ATTGCGCTCT GGGGGGAGCT GCGCCCGATC GTGACGAAGG AGGAACTGGT CGATCGCGCG
CTGCGCTATT GCGATCTGGC GGTCACGCAG GGCCTGCTCT TCATCCGCAG CCATGTCGAT
ACCAGTGATC CCAGGCTTGT GACCGTCGAG GCGATGATCG AGGTTCGCGA GAAGGTCGCG
CCCTATATCG ATCTGCAGCT GGTCGCCTTT CCCCAGGACG GTTACTACCG CTCGCCGGGC
GCGATCGACG CGCTCAACCG CGCCCTCGAC ATGGGCGTCG ATATCGTCGG CGGCATTCCC
CACTTCGAAC GGACGATGGG CGAAGGGACG GCGTCTGTCG AGGCGCTCTG CCGCATCGCC
GCCGATCGCG GCCTGCCGGT CGATATCCAC TGCGACGAAA CCGACGATCC GCTCTCGCGC
CATATCGAGA CGCTGTCTGC GGAAACCATC CGCTTCGGCT TGCAAGGGCG CGTCGCCGGC
TCGCATCTGA CCTCGATGCA CTCGATGGAC AATTATTACG TCTCCAAGCT CATTCCGCTG
ATGGCGGAGG CCGAGATCAA CGTCATCCCC AATCCGCTGA TCAACATCAT GCTGCAGGGC
CGGCACGACA CCTATCCGAA ACGCCGCGGC ATGACCCGCG TGCGGGAATT GATGGATGCC
GGGCTCAATG TCTCCTTCGG GCACGACTGC GTCATGGACC CCTGGTATTC GATGGGGTCG
GGCGACATGC TGGAGGTTGG CCATATGGCA ATCCATGTCG CGCAGATGGC CGGCATCGAC
GACAAGAAGA GGATCTTCGA CGCGCTGACC GTCAATTCGG CAAAGACGAT GGGGCTTGCA
GATTACGGCC TGGAAAAGGG ATGCAACGCC GACCTCGTCA TCCTCCAGGC GAGCGACACG
CTGGAAGCAC TGCGGCTGAA GCCGAACCGC CTGGCGGTCA TCCGCCGCGG CAAGGTCATC
GCCCGCTCGG CGCCGCGCAT CGGCGAGCTT TTCCTCGACG GACGCCCGGC ACGGATCGAC
GGCGGGTTGG ATTACGTGCC TCGTTATTGA
 
Protein sequence
MFDLIVRNAN VPDGRKGIDI GIQGGKIVAI ESNLQAQAGE EIDATGRLVS PPFVDPHFHM 
DATLSLGLPR MNVSGTLLEG IALWGELRPI VTKEELVDRA LRYCDLAVTQ GLLFIRSHVD
TSDPRLVTVE AMIEVREKVA PYIDLQLVAF PQDGYYRSPG AIDALNRALD MGVDIVGGIP
HFERTMGEGT ASVEALCRIA ADRGLPVDIH CDETDDPLSR HIETLSAETI RFGLQGRVAG
SHLTSMHSMD NYYVSKLIPL MAEAEINVIP NPLINIMLQG RHDTYPKRRG MTRVRELMDA
GLNVSFGHDC VMDPWYSMGS GDMLEVGHMA IHVAQMAGID DKKRIFDALT VNSAKTMGLA
DYGLEKGCNA DLVILQASDT LEALRLKPNR LAVIRRGKVI ARSAPRIGEL FLDGRPARID
GGLDYVPRY