Gene Pars_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1756 
SymbolargS 
ID5055232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1575382 
End bp1577274 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content54% 
IMG OID640469299 
Productarginyl-tRNA synthetase 
Protein accessionYP_001153959 
Protein GI145591957 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0324309 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00414873 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGATCCTC TGAAGTTGCC TAAGCAAGAG TTCGCCGACG CATTAGGCAA AATATCTAGC 
CGTCTGGGCT TGGCGGAGGT GCCCGAAATT GAGAAGACGC GTCGTTACGG CTACTTCTCG
GCAAGGTTTC ACAAATACAA GATCGACCCA ACGAGACTAA GGGATGCTGT GGAAGAGCTG
AGCAACGCCG GTTTTCAGTA CATCTCTGGT CTGTCCGCAG AGGGGCTTTA CGTCAATGCT
GACTTAAACG CAAAAAGGCT GGGGGAGCTC GTCTTCGAGG CTGTGGCTAA GATGGGGAAG
AAGTACGGAT TTACGGAGGA GTGTCAGCTG GGGTCTTTTC TGGTGGAGCA CACCTCTGCC
AATCCCATAC ACCCGTTGCA CATAGGCCAT GGCAGAAACG CCATACTGGG CGACTCGCTT
GCAAGACTGC TGAGGTTCTG CGACAACCGT GTGGAGGTCC ATTTCTACGT CGACGACTGC
GGCGTGCAGG TGATGTACGC AACAATTGGT TACAACGCTG TTAGGGATGA GGCCAGAGAG
TGGATTGAAA GAGCGAAGCC TGATCTTGTT GTTGGGCATA TATACTCGGC AACAAACGCC
GTGGCCGAGA TCGGCCGTCT TAAAAAAGAG GCGGAGAGAG CGCAAGACGA TGAGCACAAG
CGTAGTCTGA TAGGGGAAAT AGACGAGTGG GTGGCTGTGT TGAAGAGGCT TATGGAGAGT
GAGGGAGATC TAGTTGCCAA GGTTGTCGAG AGGCTTGGCC AGAGAGACGT GGCCGGGGAG
GCAGTGGAGC TGAATAGGCG CTACGAGGCC GGCGACCCCG AGGCAAAGAG GGTCGTACGA
GAGGTGGTAG ACCTCGTGCT GAGGGGGCAA AGAGAAACTC TTGCCAGGCT CGGCATCGAG
ATAGACAGGT GGGATTATGA AAGCGAGCTG GCGGTGTGGT CTGGCGAAGC TTCTCGCATA
GTTGAGGAAC TTCAGAGAAG GTGGCCCCAG TACGTTGAGT ATAAGGGCGG GGCGGTGGTG
TTCCGTGCCG ACAAATTCGT GGATGATTTC AAGCTCTGGG ATGTCTTAGA CTTGCCTAAG
TTCATTCCAC CTGTCACCTT GACGAGATCT GATGGGACCA CTCTCTATGT TACGAGAGAC
GTGGCCTACG CGCTGTGGCA GGCCCGGCAG GGATTCGACA AAGTTGTACG CGTAATCTCG
ACTGAGCAAA CCCACGAGCA GGCTCACGTC CGTATTATCC TCTACGCGCT TGGTTTTGAA
GACGTAGCTA AGAAGATTGT CCACTACGCC TACGAGATGG TTAATCTGCC GGGGATGAAA
ATGTCGGCGC GTCGCGGGCG ATATATCTCG CTTGATGAAA TACTTGACGA GGCAGCCGAG
CGCTCTGCTA GTTTAGTCAA AGAGAAGAGC CCGGAGATAG CTGGGGTGAT AGCTGAGAAG
GTGGGAGTGG GGTCGGTGAG ATATGCGTTC CTCTCCACCA GCCCGCGTAA GCCTATAGAG
TTTAGGTGGG AAGTAGTCCT AAACCTTAGG CAAAACTCAG GTACGTTCTT GCAGTACACC
TATGTGAGGG CCTACTCTAT TCTTGAGAAG GCGCCAGATG TGGAGAGGGC CTCCGTCCCC
GAGCAGATGC TAGAGGAAGA GAAGGAGCTT CTTGTAAAAA TTGCCGAGTG GCCTAGTGTT
GTGAGAGAGG CCGTGAGGGC GCTTAGGCCG GACTACGTGG CGGAATACCT AGACGGTTTG
GCGTTGCTTT TCAACAGCTA TTACGAAAAG GCGCCGGTGC TCAAGGCTGT AGAAGGCGTC
AGGAAGTTCA GAATAGCGTT GGTAAACGCC GTGAAGACGG TGCTGGAGGC TGGGTTCTAC
ATCCTGGGCA TACCAACGCT GACAAAGATG TGA
 
Protein sequence
MDPLKLPKQE FADALGKISS RLGLAEVPEI EKTRRYGYFS ARFHKYKIDP TRLRDAVEEL 
SNAGFQYISG LSAEGLYVNA DLNAKRLGEL VFEAVAKMGK KYGFTEECQL GSFLVEHTSA
NPIHPLHIGH GRNAILGDSL ARLLRFCDNR VEVHFYVDDC GVQVMYATIG YNAVRDEARE
WIERAKPDLV VGHIYSATNA VAEIGRLKKE AERAQDDEHK RSLIGEIDEW VAVLKRLMES
EGDLVAKVVE RLGQRDVAGE AVELNRRYEA GDPEAKRVVR EVVDLVLRGQ RETLARLGIE
IDRWDYESEL AVWSGEASRI VEELQRRWPQ YVEYKGGAVV FRADKFVDDF KLWDVLDLPK
FIPPVTLTRS DGTTLYVTRD VAYALWQARQ GFDKVVRVIS TEQTHEQAHV RIILYALGFE
DVAKKIVHYA YEMVNLPGMK MSARRGRYIS LDEILDEAAE RSASLVKEKS PEIAGVIAEK
VGVGSVRYAF LSTSPRKPIE FRWEVVLNLR QNSGTFLQYT YVRAYSILEK APDVERASVP
EQMLEEEKEL LVKIAEWPSV VREAVRALRP DYVAEYLDGL ALLFNSYYEK APVLKAVEGV
RKFRIALVNA VKTVLEAGFY ILGIPTLTKM