Gene Rleg2_6355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6355 
Symbol 
ID6983657 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp26 
End bp1315 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content61% 
IMG OID643399355 
Productcytosine deaminase 
Protein accessionYP_002284111 
Protein GI209552196 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.616526 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.149808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATT TGATCGTCAG AAATGCAAAT CTCCCCGATG GCTCAGAGAG TATCGATATC 
GGCATCCAGG GCGGCAAGAT CATCGCTCTT GAGCACAATC TCCAGGCGCA GGCGGGTGAA
GAGATCGACG CGACCGGTCG GCTGGTCAGC CCGCCCTTCG TCGATCCGCA TTTCCATATG
GATGCCACCC TGTCCCTCGG CTTGCCGCGC ATGAATGTAT CCGGCACGCT GCTTGAAGGA
ATCGCGCTCT GGGGCGAGCT TCGCCCGATC GTGACAAAAG AAGAATTGGT CGACCGTGCG
TTGCGCTATT GCGACCTGGC TGTCACCCAG GGGCTGCTCT TCATTCGCAG CCATGTCGAC
ACCAGCGATC CCAGGCTGGT GACCGTCGAG GCGATGATCG AGGTTCGCGA AAAGGTCGCC
CCCTATATCG ACCTGCAATT GGTCGCTTTT CCTCAGGATG GCTATTACCG CTCGCCCGGC
GCGATCGACA CCCTCAACCG CGCCCTCGAC ATGGGTGTCG ATATCGTTGG CGGCATTCCC
CACTTCGAAC GGACGATGGG CGAAGGCACG GCGTCGGTCG AGGCGCTCTG CCGCATCGCT
GCCGACCGCG GCCTGCCGGT CGACATGCAT TGCGACGAAA CCGATGATCC GCACTCGCGC
CATATCGAGA CGCTGGCTGC GGAGACCATC CGCTTCGGGC TCAAGGGGCG CGTTGCCGGC
TCGCATCTGA CCTCAATGCA TTCGATGGAT AATTATTATG TCTCCAAGCT CATTCCGCTG
ATGGCGGAGG CCGAGATCAA CGTCATCCCC AATCCGCTGA TCAACATCAT GCTGCAGGGC
CGGCACGACA CCTATCCGAA ACGCCGCGGC ATGACCCGCG TGCGCGAATT GATGGATGCC
GGGCTTAATG TTTCCTTCGG GCACGATTGC GTCATGGACC CCTGGTACTC GATGGGATCG
GGCGACATGC TGGAAGTCGG CCATATGGCG ATCCATGTCG CGCAGATGGC CGGCATCGAC
GACAAGAAGA GGATATTCGA GGCGCTGACC GTCAATTCGG CGAGGACGAT GGGGCTTGCA
GGCTATGGCC TGGAAAAGGG ATGCAACGCC GACCTCGTCA TCCTCCAGGC GAGCGACACG
CTGGAAGCGC TGCGGCTGAA GCCGAGCCGG CTGGCGGTGA TCCGGCGCGG TAAGGTCGTC
GCCCGCTCGG CGCCGCGCAT CGACGAGCTT TTCCTGGACG GGCGCCCTGC CCGGATCGAT
GGCGGCCTGG ACTATATCCC CCGCTATTGA
 
Protein sequence
MFDLIVRNAN LPDGSESIDI GIQGGKIIAL EHNLQAQAGE EIDATGRLVS PPFVDPHFHM 
DATLSLGLPR MNVSGTLLEG IALWGELRPI VTKEELVDRA LRYCDLAVTQ GLLFIRSHVD
TSDPRLVTVE AMIEVREKVA PYIDLQLVAF PQDGYYRSPG AIDTLNRALD MGVDIVGGIP
HFERTMGEGT ASVEALCRIA ADRGLPVDMH CDETDDPHSR HIETLAAETI RFGLKGRVAG
SHLTSMHSMD NYYVSKLIPL MAEAEINVIP NPLINIMLQG RHDTYPKRRG MTRVRELMDA
GLNVSFGHDC VMDPWYSMGS GDMLEVGHMA IHVAQMAGID DKKRIFEALT VNSARTMGLA
GYGLEKGCNA DLVILQASDT LEALRLKPSR LAVIRRGKVV ARSAPRIDEL FLDGRPARID
GGLDYIPRY