Gene BAS2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS2021 
SymbolargS 
ID2848358 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp2024593 
End bp2026281 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content36% 
IMG OID637505271 
Productarginyl-tRNA synthetase 
Protein accessionYP_028284 
Protein GI49185032 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000995488 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTATA AAACGCAGTT TGCGGAAAGT TTATCTAATA TTTTTACGAA TGAATTAACG 
CAACAGCAAA TTTTAGATTT AATTGAAACA CCGAAACAAG ATGAATTTGG AGATGCTGCA
TTTCCGTGTT TTTCACTTGC GAAGCAATAT AAAAAATCAC CAGCTATTAT CGCAAAGGAA
GTTGCAGAGA AATTAAGTGA TCCGTTTTTT ACGAAAGTAG AGGCTGTTGG TCCTTATGTA
AATGTATTTT TTAATCGTGA TACAGTAAGT GATGCAGTAT TAAAAACGAT TTTAGCGGAG
AAAGAAGAGT ACGGTAAAAA TTATTTTGGA TGTGAAAAAA CGGTCGTTAT CGATTATTCC
TCACCTAATA TCGCGAAACC TTTTTCAATG GGGCATTTAC GTTCTACAAT GATTGGAAAT
TCATTGAAGC ATATCGCTGA AAAATGTGGG TATGAAGTTG TAGGAATTAA TTATATTGGA
GACTGGGGAA CACAGTTTGG AAAGTTAATT ACGGCTTATA AAAAATGGGG AAATGAAGCA
GTAGTGAAAG AGGATCCAAT ACGTGAATTA TTTAAGTTAT ATGTTCAATT TCATGAAGAG
GTAAAAGACG ACGAAGAATT AGAAGAAGAA GGACGCGCTT GGTTTAAGAA ATTAGAAGAA
GGTGATGAAG AAGCTGTTGA ACTTTGGAAT TGGTTCCGCC ACGAATCCTT AAAAGAATTT
TCTCGTATTT ATGAACTTCT CGGTGTGGAA TTTACTAATT TTCAAGGAGA AGCTTTTTAT
AATAATTTAA TGGAAGACTT TATTGGGATT TTAGAGGAAC ATGATTTACT TGAAGAGTCA
GAAGGTGCAT TAGTCGTTAA TTTAGAAGAA GAGGGCATGC CACCTTGCTT AATTAGAAAA
TCAGATGGTG CGACGATTTA CGCAACGCGT GACTTAACGG CAGCTCTATA TCGTCAAAAC
ACATTTGGTT TTGATAAAGC GTTATACGTA GTTGGCCCAG AACAAAGTTT ACACTTCAAT
CAATTCTTCA CTGTATTAAA AAAGCTCGGC TACACTTGGG TTGATGGCAT GGAACATGTA
CCGTTTGGGT TCATTTTAAA AGACGGTAAG AAAATGTCCA CACGTAAAGG AAGAGTTATT
TTACTTGAAG AAGTACTTGA GGAAGCAATC GAACTTGCAA AACAAAATAT TGAAGAGAAA
AATCCAAACT TGAAACAGAA AGAAGAAGTA GCAAAGCAAG TCGGCGCTGG CGCAGTCATC
TTCCACGATT TAAAAAATGA GCGTATGCAC AATATTGAAT TCTCATTAGA AAATATGCTG
AAATTCGAAG GGGAAACAGG CCCGTACGTA CAATACACAC ATGCACGTGC TTGCTCTATT
TTAAGAAAAG AAAGTGTAGA ATTTGAAACG TGTACATTTG CATTAAAAGA TGATCATAGC
TGGAGTGTTG TAAAATTACT CAATAAATTC CCACAAGTAA TTGAAATAGC CTTCAACAAA
AATGAACCAT CGGTTATTTC GAAATACGTA TTAGATGTAG CGCAATCGTT TAATAAATAT
TACGGGAATG TGCGTATATT AGAAGAGAGT GAAGAGAAAG ACAGTAGACT GGCATTAGTG
TATGCTGTGA CGGTTGTATT AAAAGAGGGG TTACGTTTAC TTGGGGTGGA GGCACCTGAG
GAGATGTAA
 
Protein sequence
MDYKTQFAES LSNIFTNELT QQQILDLIET PKQDEFGDAA FPCFSLAKQY KKSPAIIAKE 
VAEKLSDPFF TKVEAVGPYV NVFFNRDTVS DAVLKTILAE KEEYGKNYFG CEKTVVIDYS
SPNIAKPFSM GHLRSTMIGN SLKHIAEKCG YEVVGINYIG DWGTQFGKLI TAYKKWGNEA
VVKEDPIREL FKLYVQFHEE VKDDEELEEE GRAWFKKLEE GDEEAVELWN WFRHESLKEF
SRIYELLGVE FTNFQGEAFY NNLMEDFIGI LEEHDLLEES EGALVVNLEE EGMPPCLIRK
SDGATIYATR DLTAALYRQN TFGFDKALYV VGPEQSLHFN QFFTVLKKLG YTWVDGMEHV
PFGFILKDGK KMSTRKGRVI LLEEVLEEAI ELAKQNIEEK NPNLKQKEEV AKQVGAGAVI
FHDLKNERMH NIEFSLENML KFEGETGPYV QYTHARACSI LRKESVEFET CTFALKDDHS
WSVVKLLNKF PQVIEIAFNK NEPSVISKYV LDVAQSFNKY YGNVRILEES EEKDSRLALV
YAVTVVLKEG LRLLGVEAPE EM