Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3988 |
Symbol | |
ID | 8014799 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4064562 |
End bp | 4065557 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 644826557 |
Product | agmatinase |
Protein accession | YP_002977768 |
Protein GI | 241206672 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family |
TIGRFAM ID | [TIGR01230] agmatinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0861741 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAGATC CGCACTTCCG CGCCGTGGCG GCGAGCGTAT TCAAGGACGG CGACAGCCGC AAATGGCCCT TCGCCGATCC TGCGACCTTT CTGGATGCCC GTTTCATCGA GAACGGCTTG CGGCCCGAGG TGCTTGAGGC GCTTGACGTG GCTCTGATCG GCGTGCCGAT GGACCTCGGC GTCACCAATC GCGCCGGCGC GCGGCTGGGG CCGCGGGCCG TCCGGGCGAT CGAGCGTATC GGTCCCTACG AGCATGTTCT GCGTGTCGCG CCGATGGGAG GGCTAAAGGT CGCCGATGTC GGCGACGTGC CGATGCGCAG CCGGTTCGGC CTCGCCGAGT GCCATGCCGA CATCGAGGCC TGCTACCGGA TGATCGCGGC AACCGGGGTT ATCCCGCTGT CGGTCGGCGG CGACCATTCG ATCTCCGGCG CCATCCTCAA GGGCCTGGCG GCCGGCCAGC CGGTCGGCAT GATCCACATC GACGCTCATT GCGACACCGC TGGTCCCTAT GAGGGCTCGA AGTTCCATCA CGGCGCGCCC TTCCGCGAGG CGGTTCTGGC GGGCGTGCTC GATCCGAAGC GTACGATCCA GATCGGCATC CGCGGCGGCG GCGAATATCT CTGGGAGTTC TCCTTTGTCT CCGGCATGAC CGTCATCCAC GCCGAAGAGG TGGCGGAGAT GGGCCTCAAG GCTGTGATCG CAAAGGCTCT AGAGGTTGTC GGCGCCGGTC CGACCTATCT CAGTTTCGAC GTCGACAGCC TCGATCCGGC CTTCGCTCCG GGAACCGGCA CGCCGGAAGT CGGCGGGCTT CAGCCGAGGG AGGCCCTGAC CCTGCTGCGC GGCTTCAAAG GCATCAACCT CATCGGCGGC GACGTCGTGG AAATCGCGCC GCAATACGAC AACACCACCA ACACCGCGCA GATCGCCGCG CAGGTCCTGT TCGAACTCCT GTGCCTCGCG ATGTTCAGTC CCGCGGTCAG GACAAAGCTG ACCTGA
|
Protein sequence | MEDPHFRAVA ASVFKDGDSR KWPFADPATF LDARFIENGL RPEVLEALDV ALIGVPMDLG VTNRAGARLG PRAVRAIERI GPYEHVLRVA PMGGLKVADV GDVPMRSRFG LAECHADIEA CYRMIAATGV IPLSVGGDHS ISGAILKGLA AGQPVGMIHI DAHCDTAGPY EGSKFHHGAP FREAVLAGVL DPKRTIQIGI RGGGEYLWEF SFVSGMTVIH AEEVAEMGLK AVIAKALEVV GAGPTYLSFD VDSLDPAFAP GTGTPEVGGL QPREALTLLR GFKGINLIGG DVVEIAPQYD NTTNTAQIAA QVLFELLCLA MFSPAVRTKL T
|
| |