Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1756 |
Symbol | argS |
ID | 5055232 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1575382 |
End bp | 1577274 |
Gene Length | 1893 bp |
Protein Length | 630 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640469299 |
Product | arginyl-tRNA synthetase |
Protein accession | YP_001153959 |
Protein GI | 145591957 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0018] Arginyl-tRNA synthetase |
TIGRFAM ID | [TIGR00456] arginyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0324309 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00414873 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGATCCTC TGAAGTTGCC TAAGCAAGAG TTCGCCGACG CATTAGGCAA AATATCTAGC CGTCTGGGCT TGGCGGAGGT GCCCGAAATT GAGAAGACGC GTCGTTACGG CTACTTCTCG GCAAGGTTTC ACAAATACAA GATCGACCCA ACGAGACTAA GGGATGCTGT GGAAGAGCTG AGCAACGCCG GTTTTCAGTA CATCTCTGGT CTGTCCGCAG AGGGGCTTTA CGTCAATGCT GACTTAAACG CAAAAAGGCT GGGGGAGCTC GTCTTCGAGG CTGTGGCTAA GATGGGGAAG AAGTACGGAT TTACGGAGGA GTGTCAGCTG GGGTCTTTTC TGGTGGAGCA CACCTCTGCC AATCCCATAC ACCCGTTGCA CATAGGCCAT GGCAGAAACG CCATACTGGG CGACTCGCTT GCAAGACTGC TGAGGTTCTG CGACAACCGT GTGGAGGTCC ATTTCTACGT CGACGACTGC GGCGTGCAGG TGATGTACGC AACAATTGGT TACAACGCTG TTAGGGATGA GGCCAGAGAG TGGATTGAAA GAGCGAAGCC TGATCTTGTT GTTGGGCATA TATACTCGGC AACAAACGCC GTGGCCGAGA TCGGCCGTCT TAAAAAAGAG GCGGAGAGAG CGCAAGACGA TGAGCACAAG CGTAGTCTGA TAGGGGAAAT AGACGAGTGG GTGGCTGTGT TGAAGAGGCT TATGGAGAGT GAGGGAGATC TAGTTGCCAA GGTTGTCGAG AGGCTTGGCC AGAGAGACGT GGCCGGGGAG GCAGTGGAGC TGAATAGGCG CTACGAGGCC GGCGACCCCG AGGCAAAGAG GGTCGTACGA GAGGTGGTAG ACCTCGTGCT GAGGGGGCAA AGAGAAACTC TTGCCAGGCT CGGCATCGAG ATAGACAGGT GGGATTATGA AAGCGAGCTG GCGGTGTGGT CTGGCGAAGC TTCTCGCATA GTTGAGGAAC TTCAGAGAAG GTGGCCCCAG TACGTTGAGT ATAAGGGCGG GGCGGTGGTG TTCCGTGCCG ACAAATTCGT GGATGATTTC AAGCTCTGGG ATGTCTTAGA CTTGCCTAAG TTCATTCCAC CTGTCACCTT GACGAGATCT GATGGGACCA CTCTCTATGT TACGAGAGAC GTGGCCTACG CGCTGTGGCA GGCCCGGCAG GGATTCGACA AAGTTGTACG CGTAATCTCG ACTGAGCAAA CCCACGAGCA GGCTCACGTC CGTATTATCC TCTACGCGCT TGGTTTTGAA GACGTAGCTA AGAAGATTGT CCACTACGCC TACGAGATGG TTAATCTGCC GGGGATGAAA ATGTCGGCGC GTCGCGGGCG ATATATCTCG CTTGATGAAA TACTTGACGA GGCAGCCGAG CGCTCTGCTA GTTTAGTCAA AGAGAAGAGC CCGGAGATAG CTGGGGTGAT AGCTGAGAAG GTGGGAGTGG GGTCGGTGAG ATATGCGTTC CTCTCCACCA GCCCGCGTAA GCCTATAGAG TTTAGGTGGG AAGTAGTCCT AAACCTTAGG CAAAACTCAG GTACGTTCTT GCAGTACACC TATGTGAGGG CCTACTCTAT TCTTGAGAAG GCGCCAGATG TGGAGAGGGC CTCCGTCCCC GAGCAGATGC TAGAGGAAGA GAAGGAGCTT CTTGTAAAAA TTGCCGAGTG GCCTAGTGTT GTGAGAGAGG CCGTGAGGGC GCTTAGGCCG GACTACGTGG CGGAATACCT AGACGGTTTG GCGTTGCTTT TCAACAGCTA TTACGAAAAG GCGCCGGTGC TCAAGGCTGT AGAAGGCGTC AGGAAGTTCA GAATAGCGTT GGTAAACGCC GTGAAGACGG TGCTGGAGGC TGGGTTCTAC ATCCTGGGCA TACCAACGCT GACAAAGATG TGA
|
Protein sequence | MDPLKLPKQE FADALGKISS RLGLAEVPEI EKTRRYGYFS ARFHKYKIDP TRLRDAVEEL SNAGFQYISG LSAEGLYVNA DLNAKRLGEL VFEAVAKMGK KYGFTEECQL GSFLVEHTSA NPIHPLHIGH GRNAILGDSL ARLLRFCDNR VEVHFYVDDC GVQVMYATIG YNAVRDEARE WIERAKPDLV VGHIYSATNA VAEIGRLKKE AERAQDDEHK RSLIGEIDEW VAVLKRLMES EGDLVAKVVE RLGQRDVAGE AVELNRRYEA GDPEAKRVVR EVVDLVLRGQ RETLARLGIE IDRWDYESEL AVWSGEASRI VEELQRRWPQ YVEYKGGAVV FRADKFVDDF KLWDVLDLPK FIPPVTLTRS DGTTLYVTRD VAYALWQARQ GFDKVVRVIS TEQTHEQAHV RIILYALGFE DVAKKIVHYA YEMVNLPGMK MSARRGRYIS LDEILDEAAE RSASLVKEKS PEIAGVIAEK VGVGSVRYAF LSTSPRKPIE FRWEVVLNLR QNSGTFLQYT YVRAYSILEK APDVERASVP EQMLEEEKEL LVKIAEWPSV VREAVRALRP DYVAEYLDGL ALLFNSYYEK APVLKAVEGV RKFRIALVNA VKTVLEAGFY ILGIPTLTKM
|
| |