Gene Rleg2_3089 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3089 
Symbol 
ID6981834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3151415 
End bp3152713 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content64% 
IMG OID643397799 
Productadenylosuccinate synthetase 
Protein accessionYP_002282582 
Protein GI209550665 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0104] Adenylosuccinate synthase 
TIGRFAM ID[TIGR00184] adenylosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.126084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.070043 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGAACG TAGTCGTGGT CGGTTCGCAA TGGGGTGACG AAGGCAAGGG CAAGATTGTC 
GACTGGCTTT CGGAGCGCGC GGATATCGTT GTGCGCTATC AGGGCGGACA TAATGCCGGC
CATACGCTCG TCATCGACGG CACCAGTTAC AAGCTGTCGC TGCTGCCCTC CGGCGTCGTG
CGCCCGGGCA AAATGGCGGT GATCGGCAAC GGCGTCGTCG TCGATCCGCA TGCGCTGATC
GCCGAGATCG GTCGGCTGGA AGCCCAGGGC GTGACAGTGA CGCCCGACAA TCTGCGTATC
GCCGACAATG CGACGCTCAT TCTGTCGCTG CACCGCGAAC TCGACGCGAT GCGCGAGGAT
GCGGCGTCGA ACAGCGGCAC CAAGATCGGC ACGACACGCC GCGGCATCGG CCCTGCATAT
GAAGACAAGG TCGGCCGCCG CGCCATCCGG GTGATGGATC TTGCCGATCT CGACAGCCTG
GCCGGCAAGG TCGACCGTAT TCTGACGCAT CACAATGCGC TTCGCCGCGG CCTCGGCGTC
GCCGAAGTCA GCCACCAGAC GATCATGGAC GAACTGACCT CGATCGCCGA TCGGGTGCTG
CCGTTCCGTG ACACCGTCTG GCTTTTCCTC GACAAGGAGC GCCGCAAGGG CTCCCGCATC
CTCTTCGAAG GCGCGCAGGG CAGCCTGCTC GACATCGACC ACGGCACCTA TCCTTTCGTG
ACCTCGTCGA ACACCGTGGC CGGCCAGGCC GCGGCCGGTT CCGGCATGGG GCCGGGCTCG
CTCGGCTATA TCCTCGGCAT CACCAAGGCC TATACGACGC GTGTCGGCGA AGGCCCGTTC
CCGACGGAGC TGAAGGATGC GATCGGTGAG TTCCTTGGCG AAAAAGGCCA TGAGTTCGGC
GTGGTGACCG GGCGCAAGCG GCGTTGCGGC TGGTTCGATG CCGCCCTCGT GCGCCAGTCG
ATCGCCACCA ACGGCATCAC GGGCATCGCG CTCACCAAGC TCGACGTGCT CGACGGCCTT
GAGGAGTTGA AGATCTGCGT CGGTTACATG CTCGACGGCG AACAGATTGA TCATCTTCCC
GCAAGCCAGG GAGCGCAAGC TAGGGTCGAA CCGGTCTATA TCACGTTGGA AGGGTGGAAG
GAATCGACCG TCGGCGCCCG CAGTTGGGCG GACCTGCCGG CACAGGCGAT CAAATATGTT
CGCCAGGTCG AAGAGCTGAT CGGCGCGCCT GTCGCGCTGC TTTCCACCAG CCCGGAGCGG
GATGACACGA TACTTGTGAC CGATCCGTTT GAGGATTAA
 
Protein sequence
MTNVVVVGSQ WGDEGKGKIV DWLSERADIV VRYQGGHNAG HTLVIDGTSY KLSLLPSGVV 
RPGKMAVIGN GVVVDPHALI AEIGRLEAQG VTVTPDNLRI ADNATLILSL HRELDAMRED
AASNSGTKIG TTRRGIGPAY EDKVGRRAIR VMDLADLDSL AGKVDRILTH HNALRRGLGV
AEVSHQTIMD ELTSIADRVL PFRDTVWLFL DKERRKGSRI LFEGAQGSLL DIDHGTYPFV
TSSNTVAGQA AAGSGMGPGS LGYILGITKA YTTRVGEGPF PTELKDAIGE FLGEKGHEFG
VVTGRKRRCG WFDAALVRQS IATNGITGIA LTKLDVLDGL EELKICVGYM LDGEQIDHLP
ASQGAQARVE PVYITLEGWK ESTVGARSWA DLPAQAIKYV RQVEELIGAP VALLSTSPER
DDTILVTDPF ED