Gene Rleg_3988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3988 
Symbol 
ID8014799 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4064562 
End bp4065557 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content66% 
IMG OID644826557 
Productagmatinase 
Protein accessionYP_002977768 
Protein GI241206672 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0861741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAGATC CGCACTTCCG CGCCGTGGCG GCGAGCGTAT TCAAGGACGG CGACAGCCGC 
AAATGGCCCT TCGCCGATCC TGCGACCTTT CTGGATGCCC GTTTCATCGA GAACGGCTTG
CGGCCCGAGG TGCTTGAGGC GCTTGACGTG GCTCTGATCG GCGTGCCGAT GGACCTCGGC
GTCACCAATC GCGCCGGCGC GCGGCTGGGG CCGCGGGCCG TCCGGGCGAT CGAGCGTATC
GGTCCCTACG AGCATGTTCT GCGTGTCGCG CCGATGGGAG GGCTAAAGGT CGCCGATGTC
GGCGACGTGC CGATGCGCAG CCGGTTCGGC CTCGCCGAGT GCCATGCCGA CATCGAGGCC
TGCTACCGGA TGATCGCGGC AACCGGGGTT ATCCCGCTGT CGGTCGGCGG CGACCATTCG
ATCTCCGGCG CCATCCTCAA GGGCCTGGCG GCCGGCCAGC CGGTCGGCAT GATCCACATC
GACGCTCATT GCGACACCGC TGGTCCCTAT GAGGGCTCGA AGTTCCATCA CGGCGCGCCC
TTCCGCGAGG CGGTTCTGGC GGGCGTGCTC GATCCGAAGC GTACGATCCA GATCGGCATC
CGCGGCGGCG GCGAATATCT CTGGGAGTTC TCCTTTGTCT CCGGCATGAC CGTCATCCAC
GCCGAAGAGG TGGCGGAGAT GGGCCTCAAG GCTGTGATCG CAAAGGCTCT AGAGGTTGTC
GGCGCCGGTC CGACCTATCT CAGTTTCGAC GTCGACAGCC TCGATCCGGC CTTCGCTCCG
GGAACCGGCA CGCCGGAAGT CGGCGGGCTT CAGCCGAGGG AGGCCCTGAC CCTGCTGCGC
GGCTTCAAAG GCATCAACCT CATCGGCGGC GACGTCGTGG AAATCGCGCC GCAATACGAC
AACACCACCA ACACCGCGCA GATCGCCGCG CAGGTCCTGT TCGAACTCCT GTGCCTCGCG
ATGTTCAGTC CCGCGGTCAG GACAAAGCTG ACCTGA
 
Protein sequence
MEDPHFRAVA ASVFKDGDSR KWPFADPATF LDARFIENGL RPEVLEALDV ALIGVPMDLG 
VTNRAGARLG PRAVRAIERI GPYEHVLRVA PMGGLKVADV GDVPMRSRFG LAECHADIEA
CYRMIAATGV IPLSVGGDHS ISGAILKGLA AGQPVGMIHI DAHCDTAGPY EGSKFHHGAP
FREAVLAGVL DPKRTIQIGI RGGGEYLWEF SFVSGMTVIH AEEVAEMGLK AVIAKALEVV
GAGPTYLSFD VDSLDPAFAP GTGTPEVGGL QPREALTLLR GFKGINLIGG DVVEIAPQYD
NTTNTAQIAA QVLFELLCLA MFSPAVRTKL T