Gene ECH_0006 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0006 
SymbolthrS 
ID3927301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp3683 
End bp5584 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content32% 
IMG OID637901131 
Productthreonyl-tRNA synthetase 
Protein accessionYP_506841 
Protein GI88657649 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.999215 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAAACA TCCACTTTAG TAACAATTTA TGCAAGCAGT TTCATCGTGG AATTAAAGGA 
CATGATATAG TAAATGACCT ATTTCCAAAA TTAAAAAACG AAACAATAGC TGCAAAAGTA
AACGGAGAAT TATACGATTT ATCAAGAGAA ATCATAGAAA ATTGCACTTT TGAAGTCATA
ACGATAAACA GCGAAGAAGG TCTTGAAATT ATACGTCATG ATACTGCTCA CATCATGGCA
CAAGCCGTGA AAGAAATGTT TCCAGATGTT CAAATAACTA TTGGACCAAC TATTAAAGAC
GGTTTTTATT ATGATTTCGC AACAAATCAT AACTTTTCTA GTGATGACCT AGAAATAATA
GAAAAGAAAA TGATAGAAAT TATAAATAAA AATGAAAGCT TTATACGGGA AGTATGGTCT
AGAGAAGAAG CTATAAAATT CTTCTCAAGT ATAGGAGAAG ACTATAAGGT CAAAATAATA
TCTAATATTC CAAGTAATGA GAATATTACT GTTTATAAGC AGGGAAGTTT TACTGATTTA
TGTCGTGGAC CACATGCACC TTCAACAAAA ACATCAAGAG CATTTAAATT AACAAAAGTA
TCTGGATCTT ATTGGCAAGG CAATTCAAAT AATGAAAGAT TACAAAGAAT TTATGGTACA
GCATGGCGTA ATGAAGAAGA ATTAAAACTT TACTTAAATA ATTTAATAGA AGCAGAAAAA
AGAGATCACA GAAAAATTGG TAGGGAATTA GAGTTGTTTC ATATTCAAAA TGAAGCATGC
GGTCAAATAT TTTGGCACAC AAAAGGGTGG ACTATTTATC GTATTATAGA AAATTATATA
CGAAAAAAAC TAGAAAATAA TGGATATATA GAAGTAAAAA CCCCCATATT GTTAAACAAA
GAACTTTGGG AAAAATCAGG ACATTGGGAT AAGTTTCGTG AAAACATGTT TTTAAGTGAA
GCAGAAGATA AAACTTTAGC CATAAAACCA ATGAATTGCC CATGCCACAT ACAGATCTTT
AACTCAAAAA TTAGAAGTTA TCGAGATTTA CCAATACGAA TGGCAGAATT TGGAACATGT
CATAGGTATG AAGCATCAGG AGCATTACAT GGGTTAATGA GAGTTCGCGG TTTTACTCAA
GATGATGCTC ACATTTTCTG TACAGAAAGC CAAATTACAT CTGAAGCTCT AAAGTTTTGT
AATTTACTCA TAGAAATCTA CAAAGATTTT GGTTTCACCG ATATTCTAGT AAAATTTTCA
GATCGTCCAA AGAACAGAGC TGGTAGCGAT GAAATATGGG ATAAAGCTGA AGCAGCATTA
AAGAAATCAG TCGAAGTTGC TAACTTAAGT TATGTTTTAA ACCCTGGAGA TGGAGCATTC
TATGGACCAA AATTAGAATT TACATTAAAA GATGCTATTG GCAGAGAATG GCAATGTGGA
ACTCTACAAA TGGATTTCGT ATTACCTGAA AGGTTAGGTG CCTACTATAT AGGTAGTGAT
GGGAAAAAAC ATCATCCAGT AATGCTACAC CGTGCTATCT TAGGTACTTT TGAAAGGTTC
ATAGGAATAT TAATTGAACA TCATTCAGGC AAATTTCCAA TGTGGTTAGC TCCTATACAA
TTATCAATAT TAACTATCAG TGAAGATTCC ATTAACTATG CTAATTCCTT GAAAATAAAA
GCCGAAGAAC ATAATATAAG AGTTGAACTA GATACTACAA ACGAAAAAAT AAATTATAAG
ATACGGAACC ATATCCACAA GAAAGTACCT GTGTTCTGGA TAGTAGGTAA AAAAGAAGTT
GAAGAAAATT CTGTATCTAT ACGTTATTTA GAGTCCAATA AGCAGCACGT TATGCCTATT
GATAAAGCGT TGAAAACATT ATTAACTTGT GCTAGCATCT AG
 
Protein sequence
MINIHFSNNL CKQFHRGIKG HDIVNDLFPK LKNETIAAKV NGELYDLSRE IIENCTFEVI 
TINSEEGLEI IRHDTAHIMA QAVKEMFPDV QITIGPTIKD GFYYDFATNH NFSSDDLEII
EKKMIEIINK NESFIREVWS REEAIKFFSS IGEDYKVKII SNIPSNENIT VYKQGSFTDL
CRGPHAPSTK TSRAFKLTKV SGSYWQGNSN NERLQRIYGT AWRNEEELKL YLNNLIEAEK
RDHRKIGREL ELFHIQNEAC GQIFWHTKGW TIYRIIENYI RKKLENNGYI EVKTPILLNK
ELWEKSGHWD KFRENMFLSE AEDKTLAIKP MNCPCHIQIF NSKIRSYRDL PIRMAEFGTC
HRYEASGALH GLMRVRGFTQ DDAHIFCTES QITSEALKFC NLLIEIYKDF GFTDILVKFS
DRPKNRAGSD EIWDKAEAAL KKSVEVANLS YVLNPGDGAF YGPKLEFTLK DAIGREWQCG
TLQMDFVLPE RLGAYYIGSD GKKHHPVMLH RAILGTFERF IGILIEHHSG KFPMWLAPIQ
LSILTISEDS INYANSLKIK AEEHNIRVEL DTTNEKINYK IRNHIHKKVP VFWIVGKKEV
EENSVSIRYL ESNKQHVMPI DKALKTLLTC ASI