Gene ECH_0740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0740 
SymbolproS 
ID3927018 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp749166 
End bp750440 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content32% 
IMG OID637901859 
Productprolyl-tRNA synthetase 
Protein accessionYP_507541 
Protein GI88657672 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.944832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTAT CAGATTACTA TGTACCTACA TTAAAAGAAA CATCTGCTGA TATATCAGTA 
ATATCACATA AATATTCTAT ACGTGCTGGT CTTATCAAGC AAATTGCCTC CGGCATATAC
ACTTGGCTTC CCTTAGGGTT AAAGGTACTA AAAAATATTG AAAACATAGT CAGAGAAGAA
ATGAATAAAT CAGGGTCTTT AGAAATATTG ATGCCACTGA TACAACCAGC AAGCTTATGG
AAAGAATCAG GAAGATACGA TGACTATGGA TCTGAAATGT TACGCATTAC AGATAGAAAT
CAACGAGAAA TGCTTTTTGG TCCAACCCAT GAAGAAGTAA TCACTGATAT TCTAAGGACA
ACACCAGTAA GTCATAAAGA TCTACCACTA ATTCTATACC AAATACAATG GAAATTTCGC
GATGAATTAC GGCCAAGATA TGGTATAATG AGATGTAGAG AATTCTTAAT GAAAGATGCA
TATAGTTTTG ATAAAGATTT CAGTGGTGCT ATTTCATCTT ATAACTTAAT GTTCAAAACT
TACATCAAAA TTTTTCAAAA GTTAGGCTTA ACTCCAATAG CAGTTAAAGC AGATTCAGGA
CCCATAGGAG GAAATTTAAG TCATGAATTT CATGTATTAG CAAATTCTGG AGAAAGCACC
TTATACTATG ACCAAGACAT TATTGAATTA ATGAATAGTG AGAGTATTGA TGTTGAAAAA
ATAAAAAATA CTTACACTGC AGCAGATGAC ATGCATGATC CTCAGGCTTG CCCTATTTCA
TCAGATAAAG TAAAAATAAG TAAAGGAATA GAGATAGGTC ATATCTTTCA TCTAGGAGAT
AAATATTCAA AACCTATGAA TGCTAATTTT TGCGATAGCA ATAATAATAA GCTCCTACAA
ATGGGATGTT ATGGCATAGG AGTATCAAGG CTAGTAGCAG CAATAATTGA AGTATTTCAT
GATAATAAAG GCATTATTTG GCCAGAAACA GTAGCCCCAT TTAAATTTTC CTTAGTAAAC
TTATATACAT CAAATGATAA ATGTAAGAAA GTTGCAGAAA ATCTACACAT GCAGTTATAT
GATGACGTTC TATATGATGA CACAGATGAT AGTCCTGGTA TTAAGTTAGC AAGAACAGAT
CTGCTAGGTA TGCCATGGCA AGTTATAATT GGTAAATCAA CAGTAGAACA AGACCTTATT
GAAGTAAGGA ATAGATTAAC AAAAGATAAA GTTTTAATTT CCACAGAACA ATTCTTAAAT
AAATTAAAAA AATGA
 
Protein sequence
MRLSDYYVPT LKETSADISV ISHKYSIRAG LIKQIASGIY TWLPLGLKVL KNIENIVREE 
MNKSGSLEIL MPLIQPASLW KESGRYDDYG SEMLRITDRN QREMLFGPTH EEVITDILRT
TPVSHKDLPL ILYQIQWKFR DELRPRYGIM RCREFLMKDA YSFDKDFSGA ISSYNLMFKT
YIKIFQKLGL TPIAVKADSG PIGGNLSHEF HVLANSGEST LYYDQDIIEL MNSESIDVEK
IKNTYTAADD MHDPQACPIS SDKVKISKGI EIGHIFHLGD KYSKPMNANF CDSNNNKLLQ
MGCYGIGVSR LVAAIIEVFH DNKGIIWPET VAPFKFSLVN LYTSNDKCKK VAENLHMQLY
DDVLYDDTDD SPGIKLARTD LLGMPWQVII GKSTVEQDLI EVRNRLTKDK VLISTEQFLN
KLKK