Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6355 |
Symbol | |
ID | 6983657 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011371 |
Strand | - |
Start bp | 26 |
End bp | 1315 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643399355 |
Product | cytosine deaminase |
Protein accession | YP_002284111 |
Protein GI | 209552196 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.616526 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.149808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTCGATT TGATCGTCAG AAATGCAAAT CTCCCCGATG GCTCAGAGAG TATCGATATC GGCATCCAGG GCGGCAAGAT CATCGCTCTT GAGCACAATC TCCAGGCGCA GGCGGGTGAA GAGATCGACG CGACCGGTCG GCTGGTCAGC CCGCCCTTCG TCGATCCGCA TTTCCATATG GATGCCACCC TGTCCCTCGG CTTGCCGCGC ATGAATGTAT CCGGCACGCT GCTTGAAGGA ATCGCGCTCT GGGGCGAGCT TCGCCCGATC GTGACAAAAG AAGAATTGGT CGACCGTGCG TTGCGCTATT GCGACCTGGC TGTCACCCAG GGGCTGCTCT TCATTCGCAG CCATGTCGAC ACCAGCGATC CCAGGCTGGT GACCGTCGAG GCGATGATCG AGGTTCGCGA AAAGGTCGCC CCCTATATCG ACCTGCAATT GGTCGCTTTT CCTCAGGATG GCTATTACCG CTCGCCCGGC GCGATCGACA CCCTCAACCG CGCCCTCGAC ATGGGTGTCG ATATCGTTGG CGGCATTCCC CACTTCGAAC GGACGATGGG CGAAGGCACG GCGTCGGTCG AGGCGCTCTG CCGCATCGCT GCCGACCGCG GCCTGCCGGT CGACATGCAT TGCGACGAAA CCGATGATCC GCACTCGCGC CATATCGAGA CGCTGGCTGC GGAGACCATC CGCTTCGGGC TCAAGGGGCG CGTTGCCGGC TCGCATCTGA CCTCAATGCA TTCGATGGAT AATTATTATG TCTCCAAGCT CATTCCGCTG ATGGCGGAGG CCGAGATCAA CGTCATCCCC AATCCGCTGA TCAACATCAT GCTGCAGGGC CGGCACGACA CCTATCCGAA ACGCCGCGGC ATGACCCGCG TGCGCGAATT GATGGATGCC GGGCTTAATG TTTCCTTCGG GCACGATTGC GTCATGGACC CCTGGTACTC GATGGGATCG GGCGACATGC TGGAAGTCGG CCATATGGCG ATCCATGTCG CGCAGATGGC CGGCATCGAC GACAAGAAGA GGATATTCGA GGCGCTGACC GTCAATTCGG CGAGGACGAT GGGGCTTGCA GGCTATGGCC TGGAAAAGGG ATGCAACGCC GACCTCGTCA TCCTCCAGGC GAGCGACACG CTGGAAGCGC TGCGGCTGAA GCCGAGCCGG CTGGCGGTGA TCCGGCGCGG TAAGGTCGTC GCCCGCTCGG CGCCGCGCAT CGACGAGCTT TTCCTGGACG GGCGCCCTGC CCGGATCGAT GGCGGCCTGG ACTATATCCC CCGCTATTGA
|
Protein sequence | MFDLIVRNAN LPDGSESIDI GIQGGKIIAL EHNLQAQAGE EIDATGRLVS PPFVDPHFHM DATLSLGLPR MNVSGTLLEG IALWGELRPI VTKEELVDRA LRYCDLAVTQ GLLFIRSHVD TSDPRLVTVE AMIEVREKVA PYIDLQLVAF PQDGYYRSPG AIDTLNRALD MGVDIVGGIP HFERTMGEGT ASVEALCRIA ADRGLPVDMH CDETDDPHSR HIETLAAETI RFGLKGRVAG SHLTSMHSMD NYYVSKLIPL MAEAEINVIP NPLINIMLQG RHDTYPKRRG MTRVRELMDA GLNVSFGHDC VMDPWYSMGS GDMLEVGHMA IHVAQMAGID DKKRIFEALT VNSARTMGLA GYGLEKGCNA DLVILQASDT LEALRLKPSR LAVIRRGKVV ARSAPRIDEL FLDGRPARID GGLDYIPRY
|
| |