Gene Rleg_3858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3858 
Symbol 
ID8014683 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3928941 
End bp3930344 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content63% 
IMG OID644826428 
Productargininosuccinate lyase 
Protein accessionYP_002977640 
Protein GI241206544 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.129189 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACG ACACCACGGA CACCAAATCC TCCAACCAGA TGTGGGGCGG TCGCTTCGCC 
TCCGGCCCGG ACGCGATCAT GGAGGAGATA AATGCCTCGA TCGGTTTCGA CAAGAAGCTA
TTCGCGCAGG ACATCCGCGG TTCGATTGCC CATGCGACGA TGCTCGCCCA TCAGGGGATC
ATTTCCGCTG ACGATAAGGA CAAGATCGTT CACGGGCTAA ACACGATCCT GTCAGAAATC
GAAAGCGGCA ATTTCGAATT CTCGCGCCAG CTCGAAGACA TCCATATGAA TGTCGAAGCA
CGCCTGGCGA CGCTGATCGG ACCGGCCGCC GGCCGGTTGC ACACCGCCCG CTCGCGCAAT
GATCAGGTGG CGCTCGACTT CCGTCTCTGG GTGAAGGAAG AGCTGCAGAA GACCGAGCAG
ATGCTGACCG GCCTGATCGC GGCTTTCCTC GACCGCGCCG AAGAACATGC CGAAAGCGTC
ATGCCGGGCT TCACCCATCT GCAGGCAGCC CAGCCCGTTA CCTTCGGCCA TCACTGCATG
GCCTATGTCG AAATGTTCGG CCGCGATCGC TCGCGCGTGC GCCACGCCAT CGAACATCTG
GATGAAAGCC CGATCGGTGC CGCCGCACTT GCCGGCACCG GCTATCCGAT CGACCGCCAC
ATGACGGCCA ACGCGCTCGG TTTCCGCGAG CCGACCCGCA ACTCCATCGA TACGGTCTCC
GACCGCGATT TCGCCATCGA ATTCCTGTCG ATCGCGGCGA TTGCGGGCAT GCACCTGTCG
CGTCTGGCAG AAGAGATCGT CATCTGGTCG ACCCCGCAAT TCGGTTTTGT GCGTCTCTCC
GACGCCTTCT CGACCGGCTC GTCGATCATG CCGCAGAAGA AGAACCCGGA TGCCGCCGAA
CTGGTGCGCG CCAAGACCGG CCGCATCAAC GGCTCGCTGG TGGCGCTGCT GACGATCATG
AAGGGCCTGC CGCTCGCCTA TTCCAAGGAC ATGCAGGAAG ACAAGGAACA GGTCTTCGAC
GCCGCCGAGA GCCTGGAACT GGCAATTGCC GCCATGACCG GCATGGTGCG CGACATGACC
GTCAACACCG CGCGCATGAA GGCTGCGGCC GGCTCCGGCT ATTCGACGGC GACCGATCTT
GCCGACTGGC TGGTGCGCGA AGCGGGCCTC CCCTTCCGCG ACGCCCATCA CGTCACCGGC
CGCGCCGTAG CGCTCGCCGA AAGCAAGGGC TGCGACCTTG CCGAGCTGCC GCTCTCCGAT
CTGCAGGCGA TCCATCCCGA CATCACCGAC AAGGTCTACA ACGTGCTGAC CGTCGAGGCC
TCAGTCGCCA GCCGCAAGAG CTTCGGCGGC ACCGCGCCGT CCGAAGTGCG CAGGCAGATC
GCCTTCTGGC GCGCCCGCAA CTAA
 
Protein sequence
MADDTTDTKS SNQMWGGRFA SGPDAIMEEI NASIGFDKKL FAQDIRGSIA HATMLAHQGI 
ISADDKDKIV HGLNTILSEI ESGNFEFSRQ LEDIHMNVEA RLATLIGPAA GRLHTARSRN
DQVALDFRLW VKEELQKTEQ MLTGLIAAFL DRAEEHAESV MPGFTHLQAA QPVTFGHHCM
AYVEMFGRDR SRVRHAIEHL DESPIGAAAL AGTGYPIDRH MTANALGFRE PTRNSIDTVS
DRDFAIEFLS IAAIAGMHLS RLAEEIVIWS TPQFGFVRLS DAFSTGSSIM PQKKNPDAAE
LVRAKTGRIN GSLVALLTIM KGLPLAYSKD MQEDKEQVFD AAESLELAIA AMTGMVRDMT
VNTARMKAAA GSGYSTATDL ADWLVREAGL PFRDAHHVTG RAVALAESKG CDLAELPLSD
LQAIHPDITD KVYNVLTVEA SVASRKSFGG TAPSEVRRQI AFWRARN