Gene GM21_1940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1940 
SymbolargS 
ID8137274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2249944 
End bp2251608 
Gene Length1665 bp 
Protein Length554 aa 
Translation table11 
GC content64% 
IMG OID644869554 
Productarginyl-tRNA synthetase 
Protein accessionYP_003021751 
Protein GI253700562 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones112 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGAGC AGCTGCGCGC CTGCATCCTG AAGGGAATCG AGGGGTGTTT CGCCGACGGC 
ACGCTCACCT CTGGTGAAGT TCCGGCCATC AACGTGGAGA AGCCGGCGCA TGCCGAGCAC
GGCGATTTCG CCACCAACGT CGCCATGCAG ATGGCCAAGC AGCAGAGAAA GGCGCCGCGC
GCCGTGGCCG AGATCCTGGT CGCGAAGCTC GCCGGCGCCT CGGACCTGAT CGAGAGCCTG
GAAATCGCCG GCCCCGGCTT CATCAACTTC TTCATAAAAG ACGGCGCCTG GAGAAGGACC
TTAAGCGAGA TCGACCGCGC CGGCGACGCC TGGGGCAAAA GCGGCATCGG CCGCGGCAAG
AAGGTGCAGG TCGAGTTCGT CAGCGCCAAC CCGACCGGGC CTTTGCACAT AGGGCACGGC
CGCGGCGCCG CGACGGGGGA CGCCGTCGCC TCGCTTTTAT CCGCTGCGGG CTTCGACGTA
CAGCGCGAGT ACTACATCAA CGACGCCGGG AACCAGATGA ACACCCTGGG GCTCTCCGGG
CTCTTGCGCT ACAAGGAGCT TCTGGGGGAG AAGATCGAGT TCCCCGAGAC CTGCTACCAG
GGCGACTACA TGAAGGACAT CGCCCGCGAC GCGGTCACCA AGTACGGGGA CCGCTTCCTG
AAGGTATCCC AGGAGGAGGG GGTGGCCTTC TTCTCCAAGA TGGGGGGAGA CCTGATCCTC
GCCGGGATCG ACCAGGACCT GCAGGACTTC GGCGTCCGTT TCGACCACTG GTTCTCCGAA
CAGTCGCTCT TCGACGAGGG GAAGGTCAAC TCCGCCATCG AGGAGATGCA GGCCAAGGGG
CTCATCTACG AGCAGGAGGG GGCGCTCTGG TTCCGCACCA CCGACTACGG CGACGACAAG
GACCGCGTCG TGGTGAGGAG CAACGGGGTC ACCACCTATT TCGCCTCCGA CATCGCCTAC
CACAGGGACA AGTTCGCCCG CGGCTTCGAC TGGGTCATCG ACGTCTGGGG TGCCGACCAT
CACGGTTACG TCCCGAGGCT TAAGAGCGTG GTGCAGGGGC TTGGGCGCGA CGCGTCCGAC
CTCGGCATCA TCCTGGTGCA GCTCGTTTCG CTTTTGCGCG ACGGCGTGCC TGTGGCCATG
TCCACCAGAA GCGGCGAGTT CGTGACCCTG AAGGAGGTCG TCGACGAGGT CGGGCGCGAC
GCGGCACGCT TCTTCTTCCT GATGCGCCGC TCGGACAGCC AGCTCGACTT CGACCTGGAG
CTCGCCAAGC GCCAGAGTAA CGACAACCCG GTCTACTACG TGCAGTACGC CCACGCGAGG
ATCAAGAGCA TCTTCGACAC TGCGCGGGAA AGGGGCGTCG AGCCGCTCTT TGACAGCGTC
AAGTTCGAAC TGCTGCAGAC CCCGGAAGAC CTGAGCCTGA TCAAGAAGCT CTCCGTCTAC
CCGGAGATTC TCGAAGGTGG CGCGGTGAAC TTCGAGCCGC ACCGGATCAC CTACTACCTG
CAGGAGCTTG CCGGCGAATT CCACAGCTTC TACAACAAAA GCCGCGTGAT CACCCCCGAA
GAGCCGGAGC TGACCCAGGC GAGGCTTTTC CTTTTGCACT GCGTCGCCAT CACCCTCAAA
AACGCGCTCA CCGTCCTCGG CATCTCGGCG CCGGAAAGGA TGTAG
 
Protein sequence
MKEQLRACIL KGIEGCFADG TLTSGEVPAI NVEKPAHAEH GDFATNVAMQ MAKQQRKAPR 
AVAEILVAKL AGASDLIESL EIAGPGFINF FIKDGAWRRT LSEIDRAGDA WGKSGIGRGK
KVQVEFVSAN PTGPLHIGHG RGAATGDAVA SLLSAAGFDV QREYYINDAG NQMNTLGLSG
LLRYKELLGE KIEFPETCYQ GDYMKDIARD AVTKYGDRFL KVSQEEGVAF FSKMGGDLIL
AGIDQDLQDF GVRFDHWFSE QSLFDEGKVN SAIEEMQAKG LIYEQEGALW FRTTDYGDDK
DRVVVRSNGV TTYFASDIAY HRDKFARGFD WVIDVWGADH HGYVPRLKSV VQGLGRDASD
LGIILVQLVS LLRDGVPVAM STRSGEFVTL KEVVDEVGRD AARFFFLMRR SDSQLDFDLE
LAKRQSNDNP VYYVQYAHAR IKSIFDTARE RGVEPLFDSV KFELLQTPED LSLIKKLSVY
PEILEGGAVN FEPHRITYYL QELAGEFHSF YNKSRVITPE EPELTQARLF LLHCVAITLK
NALTVLGISA PERM