Gene Rleg2_4872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_4872 
Symbol 
ID6977966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011368 
Strand
Start bp510397 
End bp511452 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content56% 
IMG OID643394030 
Productagmatinase 
Protein accessionYP_002278848 
Protein GI209546930 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.739258 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.617113 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATC CGGAAAAACT GGCACGCCTT CGGGAAAGAT ACGCGAACGC GTCAGGCGGC 
GACATCTTCG ACACGGAATT TGCCGTCGTC GCACGATCAC AGTTTACGAC TGGTGACAAG
CGAAAGTGGC CATTCGCTGG GATCCCGACC TTGCTGGACG CTCCATGTCG TCCGGAGTTC
CAGGATCTTC CCGATTTTGG CGGTCTGGAT ATTGCGCTCT TGGGCGTTCC GATGGATCTC
GGAGTGACGA ACCGCAATGG CAGTCGATTT GGTCCTCGCG CCGTTCGAAC AGTCGAACGC
ATCGGGCCCT ATGACCACGT CCTCAAATGC GCTCCCTTTG GAATGAGAAA AATCGCCGAC
ATAGGCGATG TTCCAATGCA AAGCAGGTAC GATCTTGCCC AATGCCATCA TGATATCGAA
CAGTTTTACA AGAAGCTAAT TGCGGCTGGC GTCAGTCCGC TTTCTGTGGG GGGCGACCAT
TCCATCACGT CGTCAATACT CAGGGCCCTT GGCGAAAAAC AGCCGGTTGG AATGATCCAC
ATCGATGCCC ATTGCGATAC CGCGGGTCCT TACGAGGGGG CGAAGTTTCA GCATGGCGGT
CCGTTTCGGC TTGCCGTTCT CGATGGCGTC CTTGATCCTG ATCGTACAAT CCAGATTGGG
ATTCGTGGCG GTGCGGAGTA TCTCTGGGAG TTCTCCTACG AAAGCGGGAT GACCGTCATT
CATGCCGAGG AAATCAAAGG TATCGGCATG GAAGCACTCA TCGCTCGCGC TCGCCAGATC
GTTGGTACTG GCCCAACCTA TATTTCCTTC GACATCGACA GCATCGATCC GGGATTCGCA
CCGGGCACCG GTACGCCGGA GGTTGGGGGA TTGATGCCGC GCGAGGTTCT CGAGCTTTTG
CGTGGCCTCA AGGGGCTTAA CGTGGTGGGC GCTGACGTCG TCGAGGTGGC TCCCCAATAC
GATGCAACGA CAAACACTGC CCAGATCGCT GCGCAGATGC TGTTCACCAT CTTATGTCTG
ATGGTGCATG CGAAGAGCGA ACCGGCAGGA GGTTGA
 
Protein sequence
MNNPEKLARL RERYANASGG DIFDTEFAVV ARSQFTTGDK RKWPFAGIPT LLDAPCRPEF 
QDLPDFGGLD IALLGVPMDL GVTNRNGSRF GPRAVRTVER IGPYDHVLKC APFGMRKIAD
IGDVPMQSRY DLAQCHHDIE QFYKKLIAAG VSPLSVGGDH SITSSILRAL GEKQPVGMIH
IDAHCDTAGP YEGAKFQHGG PFRLAVLDGV LDPDRTIQIG IRGGAEYLWE FSYESGMTVI
HAEEIKGIGM EALIARARQI VGTGPTYISF DIDSIDPGFA PGTGTPEVGG LMPREVLELL
RGLKGLNVVG ADVVEVAPQY DATTNTAQIA AQMLFTILCL MVHAKSEPAG G