Gene EcolC_1756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1756 
SymbolargS 
ID6066595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1954119 
End bp1955852 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content53% 
IMG OID641601171 
Productarginyl-tRNA synthetase 
Protein accessionYP_001724733 
Protein GI170019779 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATATTC AGGCTCTTCT CTCAGAAAAA GTCCGTCAGG CCATGATTGC GGCAGGCGCG 
CCTGCGGATT GCGAACCGCA GGTTCGTCAG TCAGCAAAAG TTCAGTTCGG CGACTATCAG
GCTAACGGCA TGATGGCAGT TGCTAAAAAA CTGGGTATGG CACCGCGACA ATTAGCAGAG
CAGGTGCTGA CTCATCTGGA TCTTAACGGT ATCGCCAGCA AAGTTGAGAT CGCCGGTCCA
GGCTTTATCA ACATTTTCCT TGATCCGGCA TTCCTGGCTG AACATGTTCA GCAGGCGCTG
GCGTCCGATC GTCTCGGTGT TGCTACGCCA GAAAAACAGA CCATTGTGGT TGACTACTCT
GCGCCAAACG TGGCGAAAGA GATGCACGTC GGTCACCTGC GCTCTACCAT TATTGGTGAC
GCAGCAGTGC GTACTCTGGA GTTCCTCGGT CACAAAGTGA TTCGCGCAAA CCACGTCGGC
GACTGGGGCA CTCAGTTCGG TATGCTGATT GCATGGCTGG AAAAGCAGCA GCAGGAAAAC
GCCGGTGAAA TGGAGCTGGC TGACCTTGAA GGTTTCTACC GCGATGCGAA AAAGCATTAC
GACGAAGATG AAGAGTTTGC CGAGCGTGCA CGTAACTACG TGGTAAAACT GCAAAGCGGT
GACGAATATT TCCGCGAGAT GTGGCGCAAA CTGGTCGACA TCACCATGAC GCAGAACCAG
ATCACTTACG ATCGTCTCAA CGTGACGCTG ACCCGTGATG ACGTGATGGG CGAAAGCCTC
TACAACCCAA TGCTGCCAGG AATTGTGGCG GATCTCAAAG CCAAAGGTCT GGCAGTAGAA
AGCGAAGGGG CGACCGTTGT ATTCCTTGAT GAGTTTAAAA ACAAGGAAGG CGAACCGATG
GGCGTGATCA TTCAGAAGAA AGATGGCGGC TATCTCTACA CCACCACTGA TATCGCCTGT
GCGAAATATC GTTATGAAAC ACTGCATGCC GATCGCGTGC TGTATTACAT CGACTCCCGT
CAGCATCAAC ACCTGATGCA GGCATGGGCG ATCGTCCGTA AAGCAGGCTA TGTACCGGAA
TCCGTACCGC TGGAACACCA CATGTTCGGC ATGATGCTGG GTAAAGACGG CAAACCGTTC
AAAACCCGCG CGGGTGGTAC AGTGAAACTG GCCGATCTGC TGGATGAAGC CCTGGAACGT
GCACGCCGTC TGGTGGCAGA AAAGAACCCG GATATGCCAG CCGACGAGCT GGAAAAACTG
GCTAACGCGG TTGGTATTGG TGCGGTGAAA TATGCGGATC TCTCCAAAAA CCGCACCACG
GACTACATCT TCGACTGGGA CAACATGCTG GCGTTTGAGG GTAATACCGC GCCATACATG
CAGTATGCAT ACACGCGTGT ATTGTCCGTG TTCCGTAAGG CAGAAATTAA CGAAGAGCAA
CTGGCTGCAG CTCCGGTAAT CATCCGTGAA GATCGTGAAG CGCAACTGGC AGCTCGCCTG
CTGCAGTTTG AAGAAACCCT CACCGTGGTT GCCCGTGAAG GCACGCCGCA TGTGATGTGT
GCTTACCTGT ACGATCTGGC TGGTCTGTTC TCTGGCTTCT ACGAGCACTG CCCGATCCTC
AGCGCAGAAA ACGAAGAAGT GCGTAACAGC CGTCTGAAAC TGGCACAACT GACAGCGAAG
ACGCTAAAGC TGGGTCTGGA TACGCTGGGT ATTGAGACAG TAGAGCGTAT GTAA
 
Protein sequence
MNIQALLSEK VRQAMIAAGA PADCEPQVRQ SAKVQFGDYQ ANGMMAVAKK LGMAPRQLAE 
QVLTHLDLNG IASKVEIAGP GFINIFLDPA FLAEHVQQAL ASDRLGVATP EKQTIVVDYS
APNVAKEMHV GHLRSTIIGD AAVRTLEFLG HKVIRANHVG DWGTQFGMLI AWLEKQQQEN
AGEMELADLE GFYRDAKKHY DEDEEFAERA RNYVVKLQSG DEYFREMWRK LVDITMTQNQ
ITYDRLNVTL TRDDVMGESL YNPMLPGIVA DLKAKGLAVE SEGATVVFLD EFKNKEGEPM
GVIIQKKDGG YLYTTTDIAC AKYRYETLHA DRVLYYIDSR QHQHLMQAWA IVRKAGYVPE
SVPLEHHMFG MMLGKDGKPF KTRAGGTVKL ADLLDEALER ARRLVAEKNP DMPADELEKL
ANAVGIGAVK YADLSKNRTT DYIFDWDNML AFEGNTAPYM QYAYTRVLSV FRKAEINEEQ
LAAAPVIIRE DREAQLAARL LQFEETLTVV AREGTPHVMC AYLYDLAGLF SGFYEHCPIL
SAENEEVRNS RLKLAQLTAK TLKLGLDTLG IETVERM