Gene Smed_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_0471 
Symbol 
ID5321305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp507600 
End bp508871 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content64% 
IMG OID640789406 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_001326163 
Protein GI150395696 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTC TTCTGATCGG ATCGGGCGGC CGCGAACATG CGCTTGCATG GAAGATCGCG 
CAATCGCCGA GGCTGACCAC TCTTTATGCC GCGCCCGGCA ATCCCGGCAT CGCCGAAGAG
GCCGCAATCG TTGCGCTCGA CGTGGGAAAC CACGTGGCCG TGGTCGATTT CTGTCGCAGC
CATGCAATCG ACCTCGTGGT CGTCGGTCCC GAGGCGCCGC TGGTCGCCGG TCTTGCCGAC
GTCCTTATCG GGGCGGGCAT TCCGGTTTTC GGCCCTTCCG CCGCAGCGGC TCAGCTAGAA
GGTTCCAAGG GTTTTACCAA AGACCTCTGC GCGCGATATG GAATCCCGAC CGGCGCCTAT
AAGCGTTTCT CCGACGCCGA GTCCGCCCGG GCCTATGTTC GCGAACAGGG CGCGCCGATC
GTCATCAAGG CGGACGGGCT TGCCGCCGGC AAGGGTGTGA CCGTCGCGAT GAACCTCGAA
GAGGCCCTTG CCGCAGTCGA CGAATGCTTC GCCGGCGCCT TTGGCGCGGC CGGCGCCGAG
GTGGTGGTCG AGGCCTATCT CGATGGCGAG GAAGCGAGCT TCTTCTGCCT TTGCGACGGC
AAGAACGTGC TGCCGCTCGG TTCTGCGCAG GACCACAAGC GTGTCGGAGA CGGCGACACC
GGACCGAACA CCGGCGGCAT GGGGGCCTAT TCGCCTGCGC CGGTCATGAG CCCCGAGGTG
GTCGAAAGGA CGATGAGAGA GATAATCCAG CCTACCGTTC GCGGCATGTC CGAAAACGGT
TATCCCTTCA CCGGCGTGTT CTTTGCCGGC CTGATGATTA CGGAGAAGGG GCCGGAACTC
ATCGAATACA ATGTGCGCTT CGGCGATCCC GAATGCCAGG TGCTTATGAT GCGCCTCCAA
AGCGACCTCC TTCCACTGCT CTATGCGGCG GCCACCGGGA CGCTCGAAGG CATGAAGGCC
GAATGGCGGG ACGACGTTGC TCTGACCGTG GTGATGGCCG CGCGCGGCTA TCCAGGCTCC
TACGAAAAAG ATACGCCGAT CGACGCCCTC CCCGAGGCAT CCGCCACAAC GAAAGTGTTC
CATGCCGGCA CCACAATGAA GGACGGCAGA CTGGTTGCTA CCGGCGGCCG CGTGCTGAAC
GTCACCGCCA CGGGAAAGTC GGTCTCGGTG GCGAAGGACG CGGCTTATGC GGCCGTTCGC
AACGTTACCT GGGAAAACGG CTTCCATCGC AACGACATCG GCTGGCGGGC GGTCGCGCGC
GAGACGGCCT GA
 
Protein sequence
MKVLLIGSGG REHALAWKIA QSPRLTTLYA APGNPGIAEE AAIVALDVGN HVAVVDFCRS 
HAIDLVVVGP EAPLVAGLAD VLIGAGIPVF GPSAAAAQLE GSKGFTKDLC ARYGIPTGAY
KRFSDAESAR AYVREQGAPI VIKADGLAAG KGVTVAMNLE EALAAVDECF AGAFGAAGAE
VVVEAYLDGE EASFFCLCDG KNVLPLGSAQ DHKRVGDGDT GPNTGGMGAY SPAPVMSPEV
VERTMREIIQ PTVRGMSENG YPFTGVFFAG LMITEKGPEL IEYNVRFGDP ECQVLMMRLQ
SDLLPLLYAA ATGTLEGMKA EWRDDVALTV VMAARGYPGS YEKDTPIDAL PEASATTKVF
HAGTTMKDGR LVATGGRVLN VTATGKSVSV AKDAAYAAVR NVTWENGFHR NDIGWRAVAR
ETA