Gene Smed_1158 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1158 
SymbolargS 
ID5322004 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1232913 
End bp1234670 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content59% 
IMG OID640790099 
Productarginyl-tRNA synthetase 
Protein accessionYP_001326844 
Protein GI150396377 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.381889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.900625 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTTT TCACCGACTT CGAAGCGAGA ATTAATAGAA TTCTGGAATC GATTGATGTT 
ATTCGTGAAA AGCGATCCGA GTTGGACTTT CGGCGCATCA ATGTGGAGCC GCCGCGCGAT
GCGAGTCACG GTGATGTGGC GACCAATGCG GCAATGGTGC TTGCCAAGCC GCTCGGCATG
AACCCACGCG CGCTGGCGGA CCTTATTGGC GACAAGCTGG CACAGGATCC GGAAGTCGCC
GAGGTTTCCG TTGCCGGTCC CGGGTTCATC AATGTCCGCC TGTCGGTTTC CTATTGGCAG
AAGCTTCTGG CGGTCATAAC GCGCGCCGGC GTCGATTACG GTCGCAGCGC ATTAGGCGCC
GGCCGCAAGA TCAACGTGGA ATATGTCTCG GCCAATCCGA CCGGCCCTAT GCATGTCGGC
CACTGCCGCG GTGCGGTCGT CGGAGACGCG CTTGCCAATC TTTTGGCCTT TTCGGGTTTC
GACGTAACCA AGGAATACTA CATCAACGAT GCGGGTTCGC AGATCGAAGT GCTCGCCCGG
TCGGCATTCC TGCGTTATCG TCAGGCGCTT GGCGAGGACA TAGGCGAGAT TCCGGCGGGC
CTCTACCCCG GCGACTATCT CGTACCGGCG GGAGAGGCGC TCGCGGACGA GTACGGCACG
AGCCTCAGGA TCATGCCGGA GGACAAATGG ATGCCGCTCG TCAAGGAGCG CGTCATCGAC
GCAATGATGG CGATGATCCG GGAGGATCTT GCGGCACTCA ATGTCAATCA CGACGTGTTC
TTTTCCGAGC GGGCCTTGCA CGACAACGGT GCAGCGCGAA TCCGCACGGC GATCAACGAT
CTGACTTTTA AGGGGCATGT CTACAAAGGC ATGCTGCCGC CGCCGAAAGG CCAGTTGCCG
GAAGATTGGG AAGACAGGGA GCAGACGCTC TTCCGCTCGA CCGAGGTCGG TGACGACATC
GATCGCCCGC TTATCAAATC CGACGGCAGC TATACCTACT TCGCAGCCGA CGTCGCCTAC
TTCAAGGACA AGTTCGACCG CGGTTTCGAC GAAATGATCT ACGTGCTCGG CGCCGACCAC
GGCGGCTATG TAAAGCGGCT GGAGGCGCTT GCGCGTGCGA TCTCTGGCGG ATCGGCAAAG
TTGACCGTGC TGCTGTGCCA GCTCGTGAAG CTCTACCGCA ATGGCGAGCC GGTGAAGATG
TCCAAGCGCT CGGGCGATTT CGTGACGCTG CGCGAAGTCG TGGACGAAGT CGGGCGCGAT
CCCGTCCGGT TCATGATGCT GTACCGCAAG AGCTCCGAGC CTCTCGATTT CGACTTTGCC
AAGGTGACGG AACAGTCGAA GGACAACCCT GTCTTTTACG TGCAATACGC CCATGCGCGC
TGCCGTTCCG TTTTCCGCCA GGCTGCTGAA GCATTTCCTG ACCTGGATCT GTCGTCCATC
GATCTCGCCG GCGCGGCGGG CGCTATTGCC GATCCCACGG AAATGCAGCT CGTCGCCAAG
CTCGCAGAAT ATCCCCGTGT AGTGGAGGCC GCAGCATTCT CGCATGAGCC GCACCGAATC
GCTTTTTACC TGTATGATCT TGCGGCGGTC TTCCATGGCC ACTGGAACAA AGGTAAGGAA
AACCCGGCAT TACGTTTTGT TAACGATAAG AATAGAGAAT TAAGCATTGC CAGACTCGGG
CTGGTGCATG CTGTCGCCTC GGTATTGAAG TCGGGCCTGT CGATTACAGG GACTTCGGCA
CCGGATGAGA TGCGGTAA
 
Protein sequence
MNLFTDFEAR INRILESIDV IREKRSELDF RRINVEPPRD ASHGDVATNA AMVLAKPLGM 
NPRALADLIG DKLAQDPEVA EVSVAGPGFI NVRLSVSYWQ KLLAVITRAG VDYGRSALGA
GRKINVEYVS ANPTGPMHVG HCRGAVVGDA LANLLAFSGF DVTKEYYIND AGSQIEVLAR
SAFLRYRQAL GEDIGEIPAG LYPGDYLVPA GEALADEYGT SLRIMPEDKW MPLVKERVID
AMMAMIREDL AALNVNHDVF FSERALHDNG AARIRTAIND LTFKGHVYKG MLPPPKGQLP
EDWEDREQTL FRSTEVGDDI DRPLIKSDGS YTYFAADVAY FKDKFDRGFD EMIYVLGADH
GGYVKRLEAL ARAISGGSAK LTVLLCQLVK LYRNGEPVKM SKRSGDFVTL REVVDEVGRD
PVRFMMLYRK SSEPLDFDFA KVTEQSKDNP VFYVQYAHAR CRSVFRQAAE AFPDLDLSSI
DLAGAAGAIA DPTEMQLVAK LAEYPRVVEA AAFSHEPHRI AFYLYDLAAV FHGHWNKGKE
NPALRFVNDK NRELSIARLG LVHAVASVLK SGLSITGTSA PDEMR