Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6176 |
Symbol | |
ID | 8016189 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012852 |
Strand | - |
Start bp | 220734 |
End bp | 222023 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644827482 |
Product | cytosine deaminase |
Protein accession | YP_002978682 |
Protein GI | 241258798 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0175267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.144345 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGATC TGATCGTGAG AAATGCAAAT GTCCCCGATG GCCGAAAGGG TATTGATATT GGCATCCAGG GCGGCAAGAT CGTTGCCATT GAGAGTAATC TCCAGGCGCA GGCGGGAGAA GAAATCGACG CGACCGGCCG GCTGGTCAGT CCGCCTTTTG TCGATCCGCA TTTCCATATG GACGCCACCC TGTCGCTCGG CCTGCCGCGC ATGAACGTGT CCGGCACCCT GCTCGAAGGC ATTGCGCTCT GGGGGGAGCT GCGCCCGATC GTGACGAAGG AGGAACTGGT CGATCGCGCG CTGCGCTATT GCGATCTGGC GGTCACGCAG GGCCTGCTCT TCATCCGCAG CCATGTCGAT ACCAGTGATC CCAGGCTTGT GACCGTCGAG GCGATGATCG AGGTTCGCGA GAAGGTCGCG CCCTATATCG ATCTGCAGCT GGTCGCCTTT CCCCAGGACG GTTACTACCG CTCGCCGGGC GCGATCGACG CGCTCAACCG CGCCCTCGAC ATGGGCGTCG ATATCGTCGG CGGCATTCCC CACTTCGAAC GGACGATGGG CGAAGGGACG GCGTCTGTCG AGGCGCTCTG CCGCATCGCC GCCGATCGCG GCCTGCCGGT CGATATCCAC TGCGACGAAA CCGACGATCC GCTCTCGCGC CATATCGAGA CGCTGTCTGC GGAAACCATC CGCTTCGGCT TGCAAGGGCG CGTCGCCGGC TCGCATCTGA CCTCGATGCA CTCGATGGAC AATTATTACG TCTCCAAGCT CATTCCGCTG ATGGCGGAGG CCGAGATCAA CGTCATCCCC AATCCGCTGA TCAACATCAT GCTGCAGGGC CGGCACGACA CCTATCCGAA ACGCCGCGGC ATGACCCGCG TGCGGGAATT GATGGATGCC GGGCTCAATG TCTCCTTCGG GCACGACTGC GTCATGGACC CCTGGTATTC GATGGGGTCG GGCGACATGC TGGAGGTTGG CCATATGGCA ATCCATGTCG CGCAGATGGC CGGCATCGAC GACAAGAAGA GGATCTTCGA CGCGCTGACC GTCAATTCGG CAAAGACGAT GGGGCTTGCA GATTACGGCC TGGAAAAGGG ATGCAACGCC GACCTCGTCA TCCTCCAGGC GAGCGACACG CTGGAAGCAC TGCGGCTGAA GCCGAACCGC CTGGCGGTCA TCCGCCGCGG CAAGGTCATC GCCCGCTCGG CGCCGCGCAT CGGCGAGCTT TTCCTCGACG GACGCCCGGC ACGGATCGAC GGCGGGTTGG ATTACGTGCC TCGTTATTGA
|
Protein sequence | MFDLIVRNAN VPDGRKGIDI GIQGGKIVAI ESNLQAQAGE EIDATGRLVS PPFVDPHFHM DATLSLGLPR MNVSGTLLEG IALWGELRPI VTKEELVDRA LRYCDLAVTQ GLLFIRSHVD TSDPRLVTVE AMIEVREKVA PYIDLQLVAF PQDGYYRSPG AIDALNRALD MGVDIVGGIP HFERTMGEGT ASVEALCRIA ADRGLPVDIH CDETDDPLSR HIETLSAETI RFGLQGRVAG SHLTSMHSMD NYYVSKLIPL MAEAEINVIP NPLINIMLQG RHDTYPKRRG MTRVRELMDA GLNVSFGHDC VMDPWYSMGS GDMLEVGHMA IHVAQMAGID DKKRIFDALT VNSAKTMGLA DYGLEKGCNA DLVILQASDT LEALRLKPNR LAVIRRGKVI ARSAPRIGEL FLDGRPARID GGLDYVPRY
|
| |