Gene EcSMS35_1310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1310 
SymbolargS 
ID6147131 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1298565 
End bp1300298 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content53% 
IMG OID641616188 
Productarginyl-tRNA synthetase 
Protein accessionYP_001743368 
Protein GI170683284 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value0.363287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATATTC AGGCTCTTCT CTCAGAAAAA GTCCGTCAGG CCATGATTGC GGCAGGCGCG 
CCTGCGGATT GCGAACCGCA GGTTCGTCAG TCAGCAAAAG TTCAGTTCGG CGACTATCAG
GCTAACGGCA TGATGGCAGT TGCTAAAAAA CTGGGTATGG CACCGCGACA ATTAGCAGAG
CAGGTGCTGA CTCATCTGGA TCTTAACGGT ATCGCCAGCA AAGTTGAGAT CGCGGGTCCT
GGCTTTATCA ACATTTTCCT TGATCCGGCA TTCCTGGCTG ATCATGTTCA GCAGGCGCTG
GCGTCCGATC GTCTCGGTGT TGCTACGCCA GAAAAACAGA CCATTGTGGT TGACTACTCT
GCGCCAAACG TGGCGAAAGA GATGCATGTC GGTCACCTGC GCTCTACCAT TATTGGTGAC
GCAGCAGTTC GTACTCTGGA GTTCCTCGGT CACAAAGTGA TTCGCGCAAA CCACGTCGGC
GACTGGGGCA CTCAGTTCGG TATGCTGATT GCGTGGCTGG AAAAGCAGCA GCAGGAAAAC
GCCGGTGAAA TGGAGCTGGC TGACCTTGAA GGTTTCTACC GCGATGCGAA AAAGCATTAC
GACGAAGATG AAGAGTTTGC CGAGCGTGCA CGTAACTACG TGGTAAAACT GCAAAGCGGT
GACGAATATT TCCGCGAGAT GTGGCGCAAA CTGGTCGACA TCACCATGAC GCAGAACCAG
ATCACCTACG ATCGTCTCAA CGTGACGCTG ACCCGTGATG ACGTGATGGG TGAAAGCCTC
TACAACCCAA TGCTGCCGGG TATCGTAGCG GACCTGAAAG CCAAAGGTCT GGCAGTAGAA
AGTGAAGGCG CGACCGTCGT ATTCCTTGAT GAGTTCAAAA ACAAGGAAGG CGAACCGATG
GGCGTGATCA TTCAGAAGAA AGATGGTGGC TATCTCTACA CCACCACTGA TATCGCCTGT
GCGAAATATC GTTATGAAAC ACTGCATGCC GATCGCGTGC TGTATTACAT CGACTCCCGT
CAGCATCAAC ACCTGATGCA GGCGTGGGCT ATCGTCCGCA AAGCAGGCTA TGTACCGGAA
TCCGTACCGC TGGAACACCA CATGTTCGGC ATGATGCTGG GTAAAGACGG CAAACCGTTC
AAAACCCGCG CGGGTGGTAC AGTGAAACTG GCAGACCTGC TGGATGAAGC CCTGGAACGC
GCACGCCGTC TGGTGGCAGA GAAGAACCCG GATATGCCAG CCGATGAGCT GGAAAAACTG
GCTAATGCGG TTGGTATTGG TGCGGTTAAA TACGCAGATC TCTCCAAAAA CCGTACCACT
GACTACATCT TCGACTGGGA CAACATGCTG GCGTTTGAGG GTAATACCGC GCCATACATG
CAGTATGCTT ACACGCGTGT ATTGTCCGTG TTCCGTAAAG CAGAAATTGA CGAAGAGCAA
CTGGCTGCAG CTCCGGTAAT CATCCGTGAA GATCGTGAAG CACAACTGGC AGCTCGCCTG
CTGCAGTTTG AAGAAACCCT CACTGTGGTT GCCCGTGAAG GCACGCCGCA TGTGATGTGT
GCTTACCTGT ACGATCTGGC CGGTCTGTTC TCTGGCTTCT ACGAGCACTG CCCGATCCTC
AGCGCAGAAA ACGAAGAAGT GCGTAACAGC CGTCTGAAAC TGGCACAACT GACGGCGAAG
ACGCTGAAAC TGGGTCTGGA TACGCTGGGT ATTGAGACAG TAGAGCGTAT GTAA
 
Protein sequence
MNIQALLSEK VRQAMIAAGA PADCEPQVRQ SAKVQFGDYQ ANGMMAVAKK LGMAPRQLAE 
QVLTHLDLNG IASKVEIAGP GFINIFLDPA FLADHVQQAL ASDRLGVATP EKQTIVVDYS
APNVAKEMHV GHLRSTIIGD AAVRTLEFLG HKVIRANHVG DWGTQFGMLI AWLEKQQQEN
AGEMELADLE GFYRDAKKHY DEDEEFAERA RNYVVKLQSG DEYFREMWRK LVDITMTQNQ
ITYDRLNVTL TRDDVMGESL YNPMLPGIVA DLKAKGLAVE SEGATVVFLD EFKNKEGEPM
GVIIQKKDGG YLYTTTDIAC AKYRYETLHA DRVLYYIDSR QHQHLMQAWA IVRKAGYVPE
SVPLEHHMFG MMLGKDGKPF KTRAGGTVKL ADLLDEALER ARRLVAEKNP DMPADELEKL
ANAVGIGAVK YADLSKNRTT DYIFDWDNML AFEGNTAPYM QYAYTRVLSV FRKAEIDEEQ
LAAAPVIIRE DREAQLAARL LQFEETLTVV AREGTPHVMC AYLYDLAGLF SGFYEHCPIL
SAENEEVRNS RLKLAQLTAK TLKLGLDTLG IETVERM