Gene Rleg_4666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4666 
Symbol 
ID8007144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp29714 
End bp31027 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content65% 
IMG OID644821602 
Productcytosine deaminase-like protein 
Protein accessionYP_002972862 
Protein GI241113027 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.69631 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTATT CCTTCATATC CCCGCCGAAT GCGGCCCGCT TCGTGTTGAG CAACGCGACA 
GTGCCCGCCG TCACCGTCGA GCATGTCGAC GTGCCGGTCA CCGAAGGGCT GGCCACGGTC
GATATCGTCA TCAGCGACGG CATGGTTGCC GCCATTCGGC CGGCCGGCGC CGCACCCGCC
GATTATGCCA GGATCGATCT CAAGGACGGC ATGGTCTGGC CGTGCTTTGC CGATATCCAT
ACTCATCTCG ACAAAGGCCA TATCTGGCCG CGCCAGGCCA ATCCCGACGG CAGCTTCATG
GGCGCGCTCG ATGCCGTCAG GGCCGACCGG GAAGCAAACT GGTCGGCCGC AGACGTCAAA
CGGCGGATGG AATTCTCGCT CCGCTCCGCC TATGCCCATG GCACCAGCCT GATCCGTACG
CACCTGGACT CGCTTGCGCC ACAGCATCGC ATCTCATTCG AGGTTTTTGC CGAAATCCGG
GATACCTGGA AGGACAGGAT TGCATTGCAG GCGGTCGCCC TCTTCCCGCT GGATGCCATG
GCATCCAGCG CCTTCTTTGC CGATCTCGTC ACAACAATCC GGCAGAACGG CGGCCTGCTT
GGCGGCGTCA CCAGGATGGG GCCGGAGCTT GTCTGGCAGC TTGACACGCT CTTCAGGACC
GCCTGGGAGC ATGGCCTGGA TATCGATCTG CATGTCGACG AGACGGATGA TCGCGGCGCC
GAGACGCTGA AGGCAATCGC CGAGGCCGTG CTGCGCAACG GGTTCGAGGG CAAGGTCACC
GCCGGCCATT GTTGCTCGCT CGCCCGCCAG GACGAAGACA CCGCCGCGCG CACCGTCGAA
TTGGTCGCCA AGGCAGGTAT CGCGGTCATC GCGCTACCGA TGTGCAACAT GTATCTGCAG
GATCGCTATC CCGGCCGCAC CCCGCGCTGG CGCGGCGTCA CGCTGTTCCA GGAGCTGGCC
GCCGCCGGTG TCGCGACCGC GGTCGCATCC GACAATACCC GTGACCCCTT CTATGCCTAT
GGTGATCTCG ATCCGGTGGA GGTTTTCCGT GAGGCCGTGC GGATTCTGCA TCTCGATCAC
CCGCTCGATA CCGCCGCCCG TGTCGTCACC ACCTCGCCCG CCGCCATCGT CGGGCGACCG
GACAAGGGCC GTATCGCCGC CGGCGATCCT GCCGATCTCG TGCTCTTCAG CGCGCGGCGC
TGGAGTGAAT TCCTGTCCCG TCCGCAGTCT GACCGCGTCG TGCTTCGCCG CGGCAAGGTG
ATCGACCGCA GCCTGCCGGA CTACCGTGAA CTCGATAACG TCGTTGGAGC CTGA
 
Protein sequence
MTYSFISPPN AARFVLSNAT VPAVTVEHVD VPVTEGLATV DIVISDGMVA AIRPAGAAPA 
DYARIDLKDG MVWPCFADIH THLDKGHIWP RQANPDGSFM GALDAVRADR EANWSAADVK
RRMEFSLRSA YAHGTSLIRT HLDSLAPQHR ISFEVFAEIR DTWKDRIALQ AVALFPLDAM
ASSAFFADLV TTIRQNGGLL GGVTRMGPEL VWQLDTLFRT AWEHGLDIDL HVDETDDRGA
ETLKAIAEAV LRNGFEGKVT AGHCCSLARQ DEDTAARTVE LVAKAGIAVI ALPMCNMYLQ
DRYPGRTPRW RGVTLFQELA AAGVATAVAS DNTRDPFYAY GDLDPVEVFR EAVRILHLDH
PLDTAARVVT TSPAAIVGRP DKGRIAAGDP ADLVLFSARR WSEFLSRPQS DRVVLRRGKV
IDRSLPDYRE LDNVVGA