Gene Rleg2_4643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4643 
Symbol 
ID6977737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp276892 
End bp277947 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content58% 
IMG OID643393817 
Productagmatinase 
Protein accessionYP_002278635 
Protein GI209546717 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTGGG ACAAAATGCG GCTTGAACAG CTGCGCGCGG AATTTGCGGA CACCAATGGC 
GGCGAGATAT TCGATGAGAC GTTTCGCAAA GTGGCGGAGA AGATCATCTC CAAAAACGGC
ACGCGACTGG CGCCATATTC TGGAGTGCCA ACTTTCCTCA GCGCACCCTA TATGCAGGTC
GCTGCCGAAG AGCCGGACTT CGGCAATCTC CAGGTCGCAA TCACCGGAAT CCCGATGGAT
CTCGGTGTCA CCAACCGTCC GGGTTCTCGT TTCGGACCGC GCGCTCTTCG CGCCATCGAA
AGGATCGGCC CGTACAATCA CGTTCTCGCC ACCGCCCCGG TCTTTGATCT TCGGGTCGCC
GACATCGGCG ACGTTTCGTT CCAGAGCCGT TACCGGCTGG AGCTCAGCCA CGACGACATC
GAAAGGCGCA TTGGCCAAAT CGTTGATGCC GGCATTGCCC CGCTTTCCGT CGGAGGCGAT
CATTCGATCA GCCATCCGAT TTTGAAGGCC ATCGGCCGGC AACAACCAGT CGGTCTTATC
CATATCGATG CCCATTGCGA CACAAGCGGC GCATTTGATC GGACAAAGTT TCATCATGGC
GGACCGTTCC GGAACGCTGT GCTGGACGGC GCACTCGACC CGACAAGAAC AATCCAGATC
GGCATCCGTG GTTCGGCCGA ATATTTGTGG GAATTCTCCT ATGCGTCGGG AATGACCGTG
ATCCATGCCG AGGAGATCAG CGGAATGGGG ATTGCGGCGA TCGTCGCCAA GGCAAAATCC
ATCGTCGGCG ACGGCCCAAC ATATATTTCC TTCGACGTCG ACAGCCTGGA TCCGAGCTTT
GCACCCGGCA CCGGCACACC CGAGGTCGGC GGATTGACCA CGCGTGAGGT TCTTGAATTG
ATCCGCGGCT TGAAAGGCAT AAACCTCGTG GGCGGCGACG TCGTCGAAGT TGCGCCGCAA
TATGACGCAA CGAGCAACAC TGCTCACGCC GCAGCGCAGG TACTCTTCGA GATCTTGAGC
CTGATGGTGT TCAGCCCATC GATCGGCGGA CGGTGA
 
Protein sequence
MVWDKMRLEQ LRAEFADTNG GEIFDETFRK VAEKIISKNG TRLAPYSGVP TFLSAPYMQV 
AAEEPDFGNL QVAITGIPMD LGVTNRPGSR FGPRALRAIE RIGPYNHVLA TAPVFDLRVA
DIGDVSFQSR YRLELSHDDI ERRIGQIVDA GIAPLSVGGD HSISHPILKA IGRQQPVGLI
HIDAHCDTSG AFDRTKFHHG GPFRNAVLDG ALDPTRTIQI GIRGSAEYLW EFSYASGMTV
IHAEEISGMG IAAIVAKAKS IVGDGPTYIS FDVDSLDPSF APGTGTPEVG GLTTREVLEL
IRGLKGINLV GGDVVEVAPQ YDATSNTAHA AAQVLFEILS LMVFSPSIGG R