Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0740 |
Symbol | proS |
ID | 3927018 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 749166 |
End bp | 750440 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 637901859 |
Product | prolyl-tRNA synthetase |
Protein accession | YP_507541 |
Protein GI | 88657672 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0442] Prolyl-tRNA synthetase |
TIGRFAM ID | [TIGR00409] prolyl-tRNA synthetase, family II |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.944832 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTTTAT CAGATTACTA TGTACCTACA TTAAAAGAAA CATCTGCTGA TATATCAGTA ATATCACATA AATATTCTAT ACGTGCTGGT CTTATCAAGC AAATTGCCTC CGGCATATAC ACTTGGCTTC CCTTAGGGTT AAAGGTACTA AAAAATATTG AAAACATAGT CAGAGAAGAA ATGAATAAAT CAGGGTCTTT AGAAATATTG ATGCCACTGA TACAACCAGC AAGCTTATGG AAAGAATCAG GAAGATACGA TGACTATGGA TCTGAAATGT TACGCATTAC AGATAGAAAT CAACGAGAAA TGCTTTTTGG TCCAACCCAT GAAGAAGTAA TCACTGATAT TCTAAGGACA ACACCAGTAA GTCATAAAGA TCTACCACTA ATTCTATACC AAATACAATG GAAATTTCGC GATGAATTAC GGCCAAGATA TGGTATAATG AGATGTAGAG AATTCTTAAT GAAAGATGCA TATAGTTTTG ATAAAGATTT CAGTGGTGCT ATTTCATCTT ATAACTTAAT GTTCAAAACT TACATCAAAA TTTTTCAAAA GTTAGGCTTA ACTCCAATAG CAGTTAAAGC AGATTCAGGA CCCATAGGAG GAAATTTAAG TCATGAATTT CATGTATTAG CAAATTCTGG AGAAAGCACC TTATACTATG ACCAAGACAT TATTGAATTA ATGAATAGTG AGAGTATTGA TGTTGAAAAA ATAAAAAATA CTTACACTGC AGCAGATGAC ATGCATGATC CTCAGGCTTG CCCTATTTCA TCAGATAAAG TAAAAATAAG TAAAGGAATA GAGATAGGTC ATATCTTTCA TCTAGGAGAT AAATATTCAA AACCTATGAA TGCTAATTTT TGCGATAGCA ATAATAATAA GCTCCTACAA ATGGGATGTT ATGGCATAGG AGTATCAAGG CTAGTAGCAG CAATAATTGA AGTATTTCAT GATAATAAAG GCATTATTTG GCCAGAAACA GTAGCCCCAT TTAAATTTTC CTTAGTAAAC TTATATACAT CAAATGATAA ATGTAAGAAA GTTGCAGAAA ATCTACACAT GCAGTTATAT GATGACGTTC TATATGATGA CACAGATGAT AGTCCTGGTA TTAAGTTAGC AAGAACAGAT CTGCTAGGTA TGCCATGGCA AGTTATAATT GGTAAATCAA CAGTAGAACA AGACCTTATT GAAGTAAGGA ATAGATTAAC AAAAGATAAA GTTTTAATTT CCACAGAACA ATTCTTAAAT AAATTAAAAA AATGA
|
Protein sequence | MRLSDYYVPT LKETSADISV ISHKYSIRAG LIKQIASGIY TWLPLGLKVL KNIENIVREE MNKSGSLEIL MPLIQPASLW KESGRYDDYG SEMLRITDRN QREMLFGPTH EEVITDILRT TPVSHKDLPL ILYQIQWKFR DELRPRYGIM RCREFLMKDA YSFDKDFSGA ISSYNLMFKT YIKIFQKLGL TPIAVKADSG PIGGNLSHEF HVLANSGEST LYYDQDIIEL MNSESIDVEK IKNTYTAADD MHDPQACPIS SDKVKISKGI EIGHIFHLGD KYSKPMNANF CDSNNNKLLQ MGCYGIGVSR LVAAIIEVFH DNKGIIWPET VAPFKFSLVN LYTSNDKCKK VAENLHMQLY DDVLYDDTDD SPGIKLARTD LLGMPWQVII GKSTVEQDLI EVRNRLTKDK VLISTEQFLN KLKK
|
| |