Gene Rleg_2537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2537 
Symbol 
ID8013507 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2537022 
End bp2538233 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content62% 
IMG OID644825117 
Productargininosuccinate synthase 
Protein accessionYP_002976347 
Protein GI241205251 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.994167 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAAGA TAGAGAAAAT TGTGCTTTCT TATTCGGGCG GGCTGGATAC CTCGATCATC 
CTCAAATGGC TGCAGGAGAC CTACGGATGC GAGGTGGTGA CGTTCACCGC CGATCTCGGC
CAGGGCGAGG AGCTCGAGCC GGCGCGGGCC AATGCGGAGA TGGTGGGAGT CAAGGACATC
CGCATCGTCG ATCTGCGCGA GGAGTTCGTC AGCGACTTCG TCTTTCCGAT GCTGCGGGCG
AACGCGCTCT ATGAGGGGCA GTACCTGCTG GGCAGTTCCA TCGCACGGCC GCTGATTGCC
AAGCATCTGG TCGGCATAGC CCGGGAGGTT GGCGCCGACG CCGTTGCCCA CGGCGCGACC
GGCAAGGGCA ATGATCAGAT CCGGTTCGAG CTGGCGGTCA ATGCGCTCGA CCCGTCGATT
AAGGTCATTG CTCCCTGGCG CCAATGGAAC ATCCGCTCCC GCATGCAGCT TCTGGAATAT
GCCGAGAAGC ATCATATCCC GGTGCCGAGC GACAAGCGCG GCGAAGCGCC GTTCTCGATC
GACGCCAACC TGCTGCACAC CTCGACCGAA GGCAAGATTC TCGAAAATCC GGCGGAGGTC
GCGCCCGATC ATGTCTATCA GCGCACGGTC GATCCCGTCG ACGCGCCCGA CACACCCGAG
ATCGTCACCA TCGGCTTCGA TTGGGGCGAT CCGGTTTCCG TCAACGGCAA GTCCATGACA
CCGGCCGCAT TGCTGACCGA GCTCAATGGA CTGGGCGGCC GGCATGGCAT CGGGCGGCTC
GATCTGGTCG AAAACCGCTT CATCGGCATG AAGTCGAGGG GAATATACGA AACGCCGGGA
GGCACGATCC TGCTCACGGC ACATCGCGGC ATTGAATCGA TCACGCTCGA CCGCGCCGCC
GCCCATCTGA AGGACGAGAT CATGCCCCGC TACGCCGAGC TGATCTACAA TGGTTTCTGG
TTCGCTCCCG AGCGGGAAAT GCTGCAAGCC CTGATCGACC ACAGCCAGGC TTTCGTCAGC
GGCGAAGTGA CGCTCAGGCT CTACAAGGGA AGCGCCTCGG TCATCTCCCG CGCCTCCCCC
TGCTCCCTCT ACTCCGCCGA CCTCGTCACC TTCGAAGAAA GCACCATCGC CTTCGATCAT
CACGACGCGG AAGGTTTCAT CAGGCTCAAC GGATTGCGGC TCAGGAGCTG GGTCGCCCGC
AACGGCAGAT GA
 
Protein sequence
MTKIEKIVLS YSGGLDTSII LKWLQETYGC EVVTFTADLG QGEELEPARA NAEMVGVKDI 
RIVDLREEFV SDFVFPMLRA NALYEGQYLL GSSIARPLIA KHLVGIAREV GADAVAHGAT
GKGNDQIRFE LAVNALDPSI KVIAPWRQWN IRSRMQLLEY AEKHHIPVPS DKRGEAPFSI
DANLLHTSTE GKILENPAEV APDHVYQRTV DPVDAPDTPE IVTIGFDWGD PVSVNGKSMT
PAALLTELNG LGGRHGIGRL DLVENRFIGM KSRGIYETPG GTILLTAHRG IESITLDRAA
AHLKDEIMPR YAELIYNGFW FAPEREMLQA LIDHSQAFVS GEVTLRLYKG SASVISRASP
CSLYSADLVT FEESTIAFDH HDAEGFIRLN GLRLRSWVAR NGR