Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4666 |
Symbol | |
ID | 8007144 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012848 |
Strand | + |
Start bp | 29714 |
End bp | 31027 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 644821602 |
Product | cytosine deaminase-like protein |
Protein accession | YP_002972862 |
Protein GI | 241113027 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.69631 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCTATT CCTTCATATC CCCGCCGAAT GCGGCCCGCT TCGTGTTGAG CAACGCGACA GTGCCCGCCG TCACCGTCGA GCATGTCGAC GTGCCGGTCA CCGAAGGGCT GGCCACGGTC GATATCGTCA TCAGCGACGG CATGGTTGCC GCCATTCGGC CGGCCGGCGC CGCACCCGCC GATTATGCCA GGATCGATCT CAAGGACGGC ATGGTCTGGC CGTGCTTTGC CGATATCCAT ACTCATCTCG ACAAAGGCCA TATCTGGCCG CGCCAGGCCA ATCCCGACGG CAGCTTCATG GGCGCGCTCG ATGCCGTCAG GGCCGACCGG GAAGCAAACT GGTCGGCCGC AGACGTCAAA CGGCGGATGG AATTCTCGCT CCGCTCCGCC TATGCCCATG GCACCAGCCT GATCCGTACG CACCTGGACT CGCTTGCGCC ACAGCATCGC ATCTCATTCG AGGTTTTTGC CGAAATCCGG GATACCTGGA AGGACAGGAT TGCATTGCAG GCGGTCGCCC TCTTCCCGCT GGATGCCATG GCATCCAGCG CCTTCTTTGC CGATCTCGTC ACAACAATCC GGCAGAACGG CGGCCTGCTT GGCGGCGTCA CCAGGATGGG GCCGGAGCTT GTCTGGCAGC TTGACACGCT CTTCAGGACC GCCTGGGAGC ATGGCCTGGA TATCGATCTG CATGTCGACG AGACGGATGA TCGCGGCGCC GAGACGCTGA AGGCAATCGC CGAGGCCGTG CTGCGCAACG GGTTCGAGGG CAAGGTCACC GCCGGCCATT GTTGCTCGCT CGCCCGCCAG GACGAAGACA CCGCCGCGCG CACCGTCGAA TTGGTCGCCA AGGCAGGTAT CGCGGTCATC GCGCTACCGA TGTGCAACAT GTATCTGCAG GATCGCTATC CCGGCCGCAC CCCGCGCTGG CGCGGCGTCA CGCTGTTCCA GGAGCTGGCC GCCGCCGGTG TCGCGACCGC GGTCGCATCC GACAATACCC GTGACCCCTT CTATGCCTAT GGTGATCTCG ATCCGGTGGA GGTTTTCCGT GAGGCCGTGC GGATTCTGCA TCTCGATCAC CCGCTCGATA CCGCCGCCCG TGTCGTCACC ACCTCGCCCG CCGCCATCGT CGGGCGACCG GACAAGGGCC GTATCGCCGC CGGCGATCCT GCCGATCTCG TGCTCTTCAG CGCGCGGCGC TGGAGTGAAT TCCTGTCCCG TCCGCAGTCT GACCGCGTCG TGCTTCGCCG CGGCAAGGTG ATCGACCGCA GCCTGCCGGA CTACCGTGAA CTCGATAACG TCGTTGGAGC CTGA
|
Protein sequence | MTYSFISPPN AARFVLSNAT VPAVTVEHVD VPVTEGLATV DIVISDGMVA AIRPAGAAPA DYARIDLKDG MVWPCFADIH THLDKGHIWP RQANPDGSFM GALDAVRADR EANWSAADVK RRMEFSLRSA YAHGTSLIRT HLDSLAPQHR ISFEVFAEIR DTWKDRIALQ AVALFPLDAM ASSAFFADLV TTIRQNGGLL GGVTRMGPEL VWQLDTLFRT AWEHGLDIDL HVDETDDRGA ETLKAIAEAV LRNGFEGKVT AGHCCSLARQ DEDTAARTVE LVAKAGIAVI ALPMCNMYLQ DRYPGRTPRW RGVTLFQELA AAGVATAVAS DNTRDPFYAY GDLDPVEVFR EAVRILHLDH PLDTAARVVT TSPAAIVGRP DKGRIAAGDP ADLVLFSARR WSEFLSRPQS DRVVLRRGKV IDRSLPDYRE LDNVVGA
|
| |