Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1675 |
Symbol | |
ID | 6980412 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 1704838 |
End bp | 1706151 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643396399 |
Product | cytosine deaminase-like protein |
Protein accession | YP_002281189 |
Protein GI | 209549272 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00400409 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCCATT CCTTCCTTTC CCCGCCCAAT GCCGGCCGCT TCGTGCTGAG CAATGCAACG CTGCCTGCCG TCGCTGTCGA GGGATTTGAC GCGCCGGCCA CCGAGGGGCT GGTCAAGGCG GATATCGTCA TCGCCGATGG CGTGATCTCC ACGCTGCTGC CGGCGGGTGC CGCGCCGGTG GAATTGCCAA GATCCGATCT GAAGGATGGC ATGGTCTGGC CCTGCTTCAC CGATATGCAC ACCCATCTCG ACAAGGGGCA TATCTGGCCT CGCAACGCCA ATCCCGACGG CACCTTCATC GGCGCCCTGG ACGCAGTGCG GGCCGACCGG GAGGCGTACT GGTCGGCGGC GGATGTCAGC AAGCGCATGG AATTCTCGCT GCGCTCGGCC TATGCGCACG GGACCAGCCT GATCCGCACG CATCTCGATT CGCTGGCGCC GCAGCACCGT ATTTCCTTCG AGGTTTTTGC CGAGATGCGC GAGGCCTGGA AAGACCGGAT CGCGCTGCAG GCGGTGGCGC TCTTCCCGCT GGAAAATATG GTGGACGCGA CCTATTTCGC CGACCTGGTC GCCGTCGTCC GCGAAAAAGG CGGGCTGATC GGCGGCGTCA CCCGGATGAC CGCCGATCTC GACAGCCAGC TCGATGTACT TTTCAGGGCC GCCGCCGATA ACGGCCTCGA CGTCGATCTT CATGTCGACG AGAGCGACGA TCCCGCCGCC GAGACGCTGA AGGCGATCGC GCAAGCCGTG CTGCGCAACC GGTTCGACGG CAAGGTGACG GCAGGCCATT GCTGCTCGCT CGCCAGGCAG GACGAGGAGA CGGCAAAACG CGCGGTCGAG CTCGTCGCGA AAGCTGGCAT CTCCATCGTC TCGCTGCCGA TGTGCAACCT CTATCTACAG GACCGCTATC CCGGCCGAAC GCCTCGCTGG CGCGGCGTCA CCCTGTTCAA GGAACTGGCG GCGGCCGGCG TTGCGACGGC AGTTTCCTCC GACAACACGC GTGATCCCTT TTATGCCTAT GGCGATCTCG ACCCGGTCGA GGTGTTGCGC GAGGCGGTGC GCATTCTGCA TCTCGATCAC CCGCTGGATA CGGCCGCGCG CGTCGTTACC ACCTCGCCTG CGGATATTCT CGGCCGGCCC GACAAGGGGC GCATCGCCGT CGGCGCCCTG GCCGACCTCG TGCTCTTCAG CGCCCGGCGC TGGAGCGAAT TTCTTTCCCG TCCGCAGTCT GACCGCGTCG TGCTTCGCCG CGGCAAGGTG ATCGACCGCA GCCTGCCGGA CTATCGTGAA CTCGACAGCG TCGTTGGAGC ATGA
|
Protein sequence | MTHSFLSPPN AGRFVLSNAT LPAVAVEGFD APATEGLVKA DIVIADGVIS TLLPAGAAPV ELPRSDLKDG MVWPCFTDMH THLDKGHIWP RNANPDGTFI GALDAVRADR EAYWSAADVS KRMEFSLRSA YAHGTSLIRT HLDSLAPQHR ISFEVFAEMR EAWKDRIALQ AVALFPLENM VDATYFADLV AVVREKGGLI GGVTRMTADL DSQLDVLFRA AADNGLDVDL HVDESDDPAA ETLKAIAQAV LRNRFDGKVT AGHCCSLARQ DEETAKRAVE LVAKAGISIV SLPMCNLYLQ DRYPGRTPRW RGVTLFKELA AAGVATAVSS DNTRDPFYAY GDLDPVEVLR EAVRILHLDH PLDTAARVVT TSPADILGRP DKGRIAVGAL ADLVLFSARR WSEFLSRPQS DRVVLRRGKV IDRSLPDYRE LDSVVGA
|
| |