Gene Sare_4035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4035 
SymbolargS 
ID5705015 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4591996 
End bp4593660 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content69% 
IMG OID641273460 
Productarginyl-tRNA synthetase 
Protein accessionYP_001538816 
Protein GI159039563 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.205832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00704959 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACTCCCG CAGAACTCGC CGAGGTCGTC CTCACCGCAG CCCACACCGT CCTCGCCCAG 
CGGGGCCTGG ACCGCGCCAT GCTTCCGGAG CAGACCGCGG TGGAGCGACC CCGTAACGCC
GAGCACGGGG ACTACGCCTC GACGCTGGCC CTCCAACTCA GCAAGAAGGT GGGTGTTCCG
CCGCGGGAAC TCGCCGCCGC ACTGGCCGAC CAGCTCGGCC AGACCGTGGG GATCAAGTCG
GTGGAGATCG CCGGCCCGGG CTTCCTGAAC ATCCGCCTCG ACGCAGCCGC CGCGGGTCAG
CTGGCCCAGG TGATCGTCGA GGCCGGCGAG GAGTACGGCC GCAGTGACCG GCTTGTCGGT
CAATCGATCA ACCTCGAGTT CGTGTCGGCC AACCCGACCG GTCCGGTACA CATCGGCGGC
GTCCGTTGGG CCGCGGTCGG TGACGCGCTG AGTCGGTTGC TGCGTGCCAC CGGGGCCGAG
GTCGGCACCG AGTACTACTT CAATGACGCC GGATCACAGA TCGACCGGTT CGCCCGCTCG
TTGCTGGCCG CCGCGAAGGG GGAGCCGCCG CCGGAGGACG GGTACGCCGG CGCGTACCTG
ACGGAGATCG CGCAGCAGGT CCAGTCCCGG CGGCCGGACG TGTTGGCGCT CGACGACGCC
GCCGCCCAGG AGGTGTTCCG GGTCGAGGGC GTCGAGCTGA TGTTCGCTGA GATCAAGTCG
TCGCTGCGTG ACTTCGGCGT GGAGTTCGAC ACCTACTTCA ACGAGAAGGA CCTGCACGAC
CGGGGCGAGT TGGAACTCGC CCTCGAGCGG CTACGGCAGC AGGGCCACAT CTCCGAGGCC
GATGGCGCCA CCTGGCTGCG CACCACCCAC TTCGGTGACG ACAAGGACCG GGTACTGCGT
AAGTCCAACG GCGAGTGGAC CTACTTCGCC GCCGACTGCG CCTACTACCT GGACAAGCGG
GAGCGCGGCT ACGAGCGCGT CGTACTGATG CTCGGCGCGG ACCACCACGG CTACCTTGGC
CGGATGAAGG CCATGGTCGC CTGCTTCGGC GATGACCCGG CCCACAATCT GGAGATCCTC
ATCGGGCAGA TGGTCAACCT GGTCCGCGAC GGAGCACCCG TGCGGATGAG CAAGCGGGCC
GGCACCGTGG TGCGTTTGGA GGACCTGGTC GACGCGATCG GTGTGGACGC CGCCCGGTAC
GCGTTGGCGC GGTACTCCAT CGACTCACCG ATCGACATCG ACATCGAGCT GTGGACCCGA
GCAACCCGCG ACAATCCCGT CTACTACGTG CAGTACGTGG CGGCGCGCAC GGCCAGTGTC
GGGCGCAACG CCGCGGAGGT CGGGCTGACC CGGGGTCAGC CGACGGACTT CCACCCCGAG
CTGCTCGACC ACGACAAGGA AAACGAGCTG CTGAAGGCGC TCGCCGAGTT TCCCGCTGTG
GTGGCCACCG CCGCCGAGCT GCGCGAGCCG CACCGGATCG CCCGGTACCT GGAGGACCTG
GCCGGGGCGT ACCACCGGTT CTACGACAAC TGCCGGGTGC TGCCCCGGGG TGACGAGGAG
ATCACCGACC TGCACCGCGC CCGGCTCTGG CTGAACGACG CCACCCGGGT GGTCATCGCC
AACGGCCTGC GGCTGCTCGG CGTGTCGGCC CCGGAGCGGA TGTGA
 
Protein sequence
MTPAELAEVV LTAAHTVLAQ RGLDRAMLPE QTAVERPRNA EHGDYASTLA LQLSKKVGVP 
PRELAAALAD QLGQTVGIKS VEIAGPGFLN IRLDAAAAGQ LAQVIVEAGE EYGRSDRLVG
QSINLEFVSA NPTGPVHIGG VRWAAVGDAL SRLLRATGAE VGTEYYFNDA GSQIDRFARS
LLAAAKGEPP PEDGYAGAYL TEIAQQVQSR RPDVLALDDA AAQEVFRVEG VELMFAEIKS
SLRDFGVEFD TYFNEKDLHD RGELELALER LRQQGHISEA DGATWLRTTH FGDDKDRVLR
KSNGEWTYFA ADCAYYLDKR ERGYERVVLM LGADHHGYLG RMKAMVACFG DDPAHNLEIL
IGQMVNLVRD GAPVRMSKRA GTVVRLEDLV DAIGVDAARY ALARYSIDSP IDIDIELWTR
ATRDNPVYYV QYVAARTASV GRNAAEVGLT RGQPTDFHPE LLDHDKENEL LKALAEFPAV
VATAAELREP HRIARYLEDL AGAYHRFYDN CRVLPRGDEE ITDLHRARLW LNDATRVVIA
NGLRLLGVSA PERM