Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_4389 |
Symbol | argS |
ID | 5901850 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 4763514 |
End bp | 4765355 |
Gene Length | 1842 bp |
Protein Length | 613 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641564907 |
Product | arginyl-tRNA synthetase |
Protein accession | YP_001686007 |
Protein GI | 167648344 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0018] Arginyl-tRNA synthetase |
TIGRFAM ID | [TIGR00456] arginyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.872152 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGATT TGAAGCGGTC CCTGAGCGAG GCGGCCGCGG CCGCCTTCCA GGCCGCCGGA CTGTCCCCCG ACTTCGGTCG CGTCACCGCG TCCGACCGGC CCGACCTGGC CGATTTCCAG TGCAACGGGG CCCTGGCGGC AGCCAAGAGC GCCAAGCGCA ATCCGCGCGA GATCGCCGTC CAGGTGGTCG ACGTCCTGAA GGCCGACCCG CGCCTGGCCT CGGTCGAGAT CGCCGGCGTC GGCTTCATCA ACATGCGCGT CAGCACCGAC GCCCTGTCGA TCCGCGCCAA CGAGATCGCC GCCGACCCGC GCGCCGGGGC CGAGCCCCTG GCCCATCCGC GCCGCGTGCT GGTCGACTAT GCCGGCCCCA ACGTCGCCAA GCCGATGCAC GTGGGCCACC TGCGCGCCTC GATCATCGGC GAGTCGGTCA AGCGCCTGTA CCGCTTCCGC GGCGACGACG TTGTGGGCGA CGCCCATTTC GGCGACTGGG GCTTCCAGAT GGGCCTGCTG ATCAGCGCCA TCATGGAGGA GGACGCCTTC ATCCGCGCCC TGCTCGAGCG CCTGGTCGAG GCCCCGCGCG AGTTCTCCAA GGCCGACGAG GACAAGGTCA TGTCCGAGTT TGCTCAGCGC GTCACCCTGG ACGACCTCGA CCGCCTCTAT CCGGCAGCCT CGGCGCGCCA GAAGGAAGAC CCCGAGTTCA AGGAAAAGGC CCGCAAGGCC ACCGCCGAAC TGCAGAACGG CCGCTTCGGC TACCGCCTGC TGTGGCGTCA CTTCGTCAAC ATCAGCCGCG TGGCCCTGGA GCGCGAGTTC CATGCCCTGG GCGTCGATTT CGACCTCTGG AAGGGCGAGA GCGACGTGCA GGACCTGATC GCCCCGATGG TCCGCCAGCT CGAGGTCAAG GGCCTGCTGG TCGACGACCA GGGCGCCCGC ATCGTCCGCG TGGCCCGTCC CGGCGAGACG AAGAAGAAGA AGCTGCCCGA CGGCTCGGTG GTCGAGGTCG AAAGTCCCGA CCCGCTGCTG GTGGTGTCGT CGGAAGGTTC GGCCATGTAC GGCACGACCG ACCTGGCGAC GATCCTGGAT CGCCGCAAGT CGTTCGACCC GCACCTGATC CTCTATTGCG TGGACCAGCG CCAGGCCGAC CACTTCGAGC AGGTGTTCCG CGCCGCCTAT CTGGCCGGCT ACGCCGAGCC CGGCAGCCTG GAGCACATCG GCTTCGGCAC CATGAACGGC AGCGACGGCA AGCCGTTCAA GACCCGGGCC GGCGGCGTCC TGAAGCTGCA CGACCTGATC GAGATGGCCC GCGAGAAGGC CCGCGAGCGC CTGCGCGAAG CTGGCCTCGG GGCCGAGCTT TCGCAAGAGG CCTTCGAGGA GACCGCCCAC AAGGTGGGGA TCGCGGCCCT GAAGTTCGCG GACCTGCAGA ACTTCCGTGG CACCTCCTAC GTCTTCGACC TCGACCGTTT CACAAGCTTC GAGGGCAAGA CCGGGCCGTA CCTGCTGTAC CAGTCGGTGC GCATCAAGAG CATCCTGCGC AAGGCGGCCG AGCAGAAGGT CGTGTCGGGC GCGATCATCG TCGGCGAGCC GGCCGAGCGC GACCTGACGC TGCTGTTGGA CGCGTTCGAA GGCGCCTTGA GCGAGGCCTA CGACAAGAAG GCCCCCAACT TCATCGCCGA GCACGCCTAC AAGCTGGCGC AGACCTTCTC GAAGTTCTAC GCCGCTTGCC CGATCCTCAG CGCCGACAAT GACGCCACCC GGGCCTCGCG CCTCGCCCTG GCCGAGACGA CGTTGAAGCA GCTGGAGCTG GCTTTGGATC TGCTGGGCAT CGAAGCGCCG GAACGGATGT AG
|
Protein sequence | MSDLKRSLSE AAAAAFQAAG LSPDFGRVTA SDRPDLADFQ CNGALAAAKS AKRNPREIAV QVVDVLKADP RLASVEIAGV GFINMRVSTD ALSIRANEIA ADPRAGAEPL AHPRRVLVDY AGPNVAKPMH VGHLRASIIG ESVKRLYRFR GDDVVGDAHF GDWGFQMGLL ISAIMEEDAF IRALLERLVE APREFSKADE DKVMSEFAQR VTLDDLDRLY PAASARQKED PEFKEKARKA TAELQNGRFG YRLLWRHFVN ISRVALEREF HALGVDFDLW KGESDVQDLI APMVRQLEVK GLLVDDQGAR IVRVARPGET KKKKLPDGSV VEVESPDPLL VVSSEGSAMY GTTDLATILD RRKSFDPHLI LYCVDQRQAD HFEQVFRAAY LAGYAEPGSL EHIGFGTMNG SDGKPFKTRA GGVLKLHDLI EMAREKARER LREAGLGAEL SQEAFEETAH KVGIAALKFA DLQNFRGTSY VFDLDRFTSF EGKTGPYLLY QSVRIKSILR KAAEQKVVSG AIIVGEPAER DLTLLLDAFE GALSEAYDKK APNFIAEHAY KLAQTFSKFY AACPILSADN DATRASRLAL AETTLKQLEL ALDLLGIEAP ERM
|
| |