Gene Rleg2_3565 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3565 
Symbol 
ID6982326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3692691 
End bp3694091 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content64% 
IMG OID643398290 
Productargininosuccinate lyase 
Protein accessionYP_002283058 
Protein GI209551141 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.754251 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.282214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA CCACGGATAC CAAATCTTCA AACCAGATGT GGGGCGGGCG TTTCGCTTCC 
GGCCCGGACG CGATCATGGA GGAGATAAAT GCCTCGATCG GTTTCGACAA GAAGCTTTTC
GCGCAGGATA TCCGCGGTTC GATTGCCCAC GCGACCATGC TCGCCCATCA GGGGATCATA
TCGGCCGAGG ATAAGGACAA GATCGTTCAC GGGCTAAACA CGATCCTGTC AGAAATCGAA
AGCGGCAATT TCGAATTTTC GCGCCGGCTC GAAGACATCC ATATGAATAT CGAAGCGCGC
CTGGCGACGC TGATTGGACC GGCCGCCGGC CGGCTGCACA CCGCCCGCTC GCGCAACGAC
CAGGTGGCGC TCGACTTCCG CCTCTGGGTG AAGGAAGAGC TCGAGAAGAC CGAGAAGATG
CTGACCGGCC TGATCGCCGC CTTCCTCGAC CGAGCCGACG AGCACGCCGA AAGCGTCATG
CCGGGCTTCA CCCATCTGCA GACCGCCCAG CCCGTCACCT TCGGCCATCA CTGCATGGCC
TATGTCGAAA TGTTCGGCCG CGACCGCTCA CGCGTGCGCC ACGCCATCGA GCATCTGGAC
GAAAGCCCGA TTGGTGCGGC CGCCCTTGCC GGCACCGGCT ATCCGATCGA TCGCCATATG
ACCGCCAAGG CGCTCGGTTT CCGCGAGCCG ACCCGCAACT CCATCGACAC CGTCTCCGAC
CGCGATTTCG CCATCGAGTT CCTGTCGATC GCGGCAATAA CAGGCATGCA CCTGTCGCGC
CTGGCCGAAG AGATCGTCAT CTGGTCGACC CCGCAATTCG GTTTCGTCCG CCTTTCCGAC
GCCTTCTCCA CCGGCTCCTC GATCATGCCG CAGAAGAAGA ACCCGGATGC CGCCGAACTG
GTGCGTGCCA AGACCGGCCG CATCAACGGC TCGCTGGTGG CGCTGCTGAC GATCATGAAG
GGCCTGCCGC TCGCTTATTC CAAGGACATG CAGGAAGACA AGGAACAGGT CTTCGACGCG
GCCGAGAGCC TGGAACTGGC AATCGCCGCC ATGACCGGCA TGGTGCGCGA CATGACCGTC
AACACCGCGC GCATGAAGGC GGCGGCCGGC TCCGGCTTCT CGACGGCGAC CGACCTTGCC
GACTGGCTGG TGCGCGAAGC GGGCCTGCCG TTCCGCGATG CCCATCACGT CACCGGTCGG
GCCGTGGCGC TCGCCGAAAG CAAGGGCTGC GATCTGGCCG AACTGCCGCT CGCCGACCTG
CAGGCGATCC ATGCCTCGAT CACCGACAAG GTCTACGACG TGCTGACCGT CGAAGCCTCG
GTCGCCAGCC GCAAGAGCTT CGGCGGCACC GCACCCTCCG AAGTGCGCAG GCAGATCGCT
TTCTGGCGCG CCCGCAACTG A
 
Protein sequence
MADTTDTKSS NQMWGGRFAS GPDAIMEEIN ASIGFDKKLF AQDIRGSIAH ATMLAHQGII 
SAEDKDKIVH GLNTILSEIE SGNFEFSRRL EDIHMNIEAR LATLIGPAAG RLHTARSRND
QVALDFRLWV KEELEKTEKM LTGLIAAFLD RADEHAESVM PGFTHLQTAQ PVTFGHHCMA
YVEMFGRDRS RVRHAIEHLD ESPIGAAALA GTGYPIDRHM TAKALGFREP TRNSIDTVSD
RDFAIEFLSI AAITGMHLSR LAEEIVIWST PQFGFVRLSD AFSTGSSIMP QKKNPDAAEL
VRAKTGRING SLVALLTIMK GLPLAYSKDM QEDKEQVFDA AESLELAIAA MTGMVRDMTV
NTARMKAAAG SGFSTATDLA DWLVREAGLP FRDAHHVTGR AVALAESKGC DLAELPLADL
QAIHASITDK VYDVLTVEAS VASRKSFGGT APSEVRRQIA FWRARN