Gene ECH_0537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0537 
SymbolargS 
ID3927980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp537290 
End bp539020 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content30% 
IMG OID637901659 
Productarginyl-tRNA synthetase 
Protein accessionYP_507350 
Protein GI88657836 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTAA TTAATACTGT TAAAAATTGT ATAGTGGAAA AGTTACATAT ATTAAGTGAT 
AAGAAATTAA TAGTGCTGGA TGATGTTATA TTAAGTAAGT TAATAGTTGA TTATCCTAAT
AATCATAATC ATGGGGATTT GTATACTAAT GCAGCTTTAA TTTTAAGTAG TCATATAAAA
AAAAGTCCAC TAGAGATAGC AGAAATTTTA CTATCTGAAT TTTCTAATAT TAAGGAAATA
TCTAGCATTA ATGTTGTGAA GCCAGGTTTT ATTAATTTTA GTATTTCTCT TTATGTATGG
TATGAAGTAG TAGCTTCTAT TAATATGTTA AAAGAGGGTT TTGCTAATGT TAATATAGGA
AATGGACAGA AAGTTAATGT AGAATTTGTT TCTGCGAATC CTACTGGACC AATGCATATT
GGACATGCTC GTGGTGCTAT ATTTGGTGAT GTATTAGCGA ATTTATTGGA AAAAGTAGGA
TATCAAGTTG TTAGGGAATA TTATATTAAT GATGCTGGTA CACAAATAGA CGTGTTAGTT
GAATCTGTAT ATTTAAGATA TAAAGAAGCT ACAGGTCAAG ATATTGTAAT AGGCAGTGGG
TTGTACCCAG GATTATATTT ACGAGAGATA GGAAAGCTTC TATATGAAAA ATATGGGACA
GATTTATTAG AGATGAGTTT TGTCCGTAAG ATGAAGATTA TACGTGATGT ATCTCTTGAA
TATCTTATGA ATCTTATTAA AGAAGATCTT GCATTGTTGG GAATTGAGCA TGATGTTTTT
ACGTCAGAGG CTGAGTTATT AAAAAATAAT ATTGTAGAAA AGTGTGTGAA ACTTTTAGAA
GATAAGCAAT TAATATATTA TGGAGTGTTA GAGCAACCAA AAGGGACAGA AATGCAAAAT
TGGAAGCCTA GAACACAAAT GTTGTTTAAA TCTACTGATT TCGGAGATGA TGTAGATAGA
GCTTTACAGA AAGTTGATGG TAGTTGGACT TATTTTGCAA ACGATATAGC TTATCATTTT
GATAAGATAT CACGTGGTTT TCAACATATG ATTTTAGAGT TGGGTTCTGA CCATATTGGT
TATGTAAAAA GATTAAAAGC TGCAGTGAAG GCATTAAGTA ATAATAATGC TACTGTAGAT
ATAAAATTGC ACAATACTGT TAATTTTCTT GATGATGGAG TGCAAGTAAA GATGTCGAAA
AGATCTGGTG AATTTCTAAC TATAAGAGAT GTTATAGAAA AAGTTGGTAA AGATGTAGTC
AGATTTATGA TGTTAACTCG TAAGAGTGAT GTAGTTTTGG ATTTTGATTT TGCTAAAGTT
GTTGAACAAT CTAAAAATAA TCCAATATTT TATGTACAAT ATGCTCATGC TCGTGTTTGT
TCATTAATGC GTAATGCTCC AAATATTTTA GGAATAGAGG ATACGGATTT TTCTGTGTTA
TCTTCAAAGG AAGAAATACT GTTAATTAAG TTACTTGCAA AATGGCCAAA TGTAGTTGAG
ATGTCTGCAA AAACTGCAGA GCCACATCGC ATAACTTTTT ATTTAATAGA AGTTGCTGAA
GCTTTTCATG TGTTATGGGG ATATGGAAAT AAGAATGCAA ATAGACGTTT TATTATAGAT
AATGATGTTA ATCTTACATC TGCAAGGATA TATTTGGCTA AATCTGTTGC GTATATTATC
AGTAGTGGGT TAAAAATATT TTCCATAGTT CCTTTAGAGG AAATGCATTA A
 
Protein sequence
MNLINTVKNC IVEKLHILSD KKLIVLDDVI LSKLIVDYPN NHNHGDLYTN AALILSSHIK 
KSPLEIAEIL LSEFSNIKEI SSINVVKPGF INFSISLYVW YEVVASINML KEGFANVNIG
NGQKVNVEFV SANPTGPMHI GHARGAIFGD VLANLLEKVG YQVVREYYIN DAGTQIDVLV
ESVYLRYKEA TGQDIVIGSG LYPGLYLREI GKLLYEKYGT DLLEMSFVRK MKIIRDVSLE
YLMNLIKEDL ALLGIEHDVF TSEAELLKNN IVEKCVKLLE DKQLIYYGVL EQPKGTEMQN
WKPRTQMLFK STDFGDDVDR ALQKVDGSWT YFANDIAYHF DKISRGFQHM ILELGSDHIG
YVKRLKAAVK ALSNNNATVD IKLHNTVNFL DDGVQVKMSK RSGEFLTIRD VIEKVGKDVV
RFMMLTRKSD VVLDFDFAKV VEQSKNNPIF YVQYAHARVC SLMRNAPNIL GIEDTDFSVL
SSKEEILLIK LLAKWPNVVE MSAKTAEPHR ITFYLIEVAE AFHVLWGYGN KNANRRFIID
NDVNLTSARI YLAKSVAYII SSGLKIFSIV PLEEMH