Gene Rleg_6439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6439 
Symbol 
ID8017052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012854 
Strand
Start bp158595 
End bp159650 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content59% 
IMG OID644828234 
Productagmatinase 
Protein accessionYP_002979434 
Protein GI241554221 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0010] Arginase/agmatinase/formimionoglutamate hydrolase, arginase family 
TIGRFAM ID[TIGR01230] agmatinase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0137522 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTGGG ATAAAACGCG GCTCGAGCAA CTGCGCGCAG AATTTGCAGA CGCCAATGGC 
GGCGAGATAT TCGATGAGAA GTTTCGCAAA GTAGCGGAGA AGATCATCTC CAAGAGCGGC
ACGAGACTGG CGCCATACGC CGGAGTGCCG ACCTTCCTCA GCGCACCCTA CATGCAGGTC
GCGGCCGATG ATCCGGATTT CGGCAATCTC CAGGTTGCGA TCACCGGGAT CCCCATGGAT
CTTGGCGTCA CCAATCGTCC AGGCTCGCGT TTCGGACCGA GAGCACTTCG CGCCATCGAA
AGGATCGGCC CCTACAATCA TGTTCTCGCT ACGGCGCCGG TCTTCGATCT TCGGGTCGCC
GATATCGGCG ACATATCGTT CCAAAGCCGT TACCGGTTGG AACTCAGCCA CGACGACATC
GAAAAGCGGA TCGGCCAGAT CGTCGATGCC GGCGTGGCCC CGCTTTCCGT CGGAGGCGAT
CATTCCATCA GCCACCCGAT ATTGAAGGCC ATCGGCCGGC ACCAACCGGT CGGCCTCATC
CATATTGATG CCCATTGCGA TACAAGCGGC GCTTTCGATC AGACGAAGTT TCATCACGGT
GGGCCGTTCC GCAATGCGGT GCTTGACGGC GTGCTCGATC CGACACGGAC TATCCAGATC
GGCATCCGCG GTTCAGCGGA ATATTTGTGG GAATTCTCCT ACGCTTCGGG AATGACCGTG
ATCCACGCAG AGGACATCAG CGGAATGGGG ATTGCGGCCG TCATTGCCAA GGCAAAATCC
ATCGTCGGCG ACGGCCCCAC CTATCTTTCC TTCGACGTCG ACAGCCTCGA TCCGAGCTTT
GCGCCCGGCA CGGGCACGCC CGAGGTCGGT GGATTGACCA CGCGTGAAGT CCTTGAACTG
ATACGCGGAC TGAAGGGGAT AAATCTGGTG GGTGGTGACG TCGTCGAAGT CGCCCCGCAA
TATGACGCAA CGACCAACAC GGCGCACGCC GCAGCACAGG TGCTCTTTGA GGTCCTGAGC
CTCATGGTGT TTAGTCCATC GATCGGCAGG CGCTAA
 
Protein sequence
MVWDKTRLEQ LRAEFADANG GEIFDEKFRK VAEKIISKSG TRLAPYAGVP TFLSAPYMQV 
AADDPDFGNL QVAITGIPMD LGVTNRPGSR FGPRALRAIE RIGPYNHVLA TAPVFDLRVA
DIGDISFQSR YRLELSHDDI EKRIGQIVDA GVAPLSVGGD HSISHPILKA IGRHQPVGLI
HIDAHCDTSG AFDQTKFHHG GPFRNAVLDG VLDPTRTIQI GIRGSAEYLW EFSYASGMTV
IHAEDISGMG IAAVIAKAKS IVGDGPTYLS FDVDSLDPSF APGTGTPEVG GLTTREVLEL
IRGLKGINLV GGDVVEVAPQ YDATTNTAHA AAQVLFEVLS LMVFSPSIGR R