Gene ECH_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1000 
SymbolmetG 
ID3927157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1024001 
End bp1025521 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content33% 
IMG OID637902116 
Productmethionyl-tRNA synthetase 
Protein accessionYP_507787 
Protein GI88657921 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0143] Methionyl-tRNA synthetase 
TIGRFAM ID[TIGR00398] methionyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA TATATATTAC TACTCCTATT TATTATGTTA ATGATGTGCC ACATATAGGT 
CATGTTTATA CAACTTTGAT ATCTGATATT ATCGCACGGT TTATGCGTCT TGATGGACAT
AGTGTAAAGT TTATAACTGG AACAGATGAA CATGGGCAAA AAATAGAAAA AGCAGCCCAA
GAACGTAACA TGTCATTATT AGATTTTACT GATAATACAA GTGCTGTTTT TAGACAGTTA
GCTGATGTTA TGAACTATAG CTATGATGAT TTTATTCGTA CGACAGAAAC AAGGCACAAA
AAGACTGTGG TAGCTTTATG GCAGCGTTTG TGTGATAATG GGGCAATATA TTTAGGTAGT
TATTCTGGTT GGTATTCGGT AAGAGATGAA ACGTTTTATC AGGAGAAAGA ATTGGTCGAT
GGTAAAGCTC CTACAGGAGC TGATGTAGAA TGGATAGAAG AGCCGAGTTA TTTTTTTCGA
TTGTCAAATT TTCAGGAAAG GTTGTTAGCT TTTTACGAGG AAAATCCTGA TTTTGTTATT
CCTAAATATC GGTATAATGA AGTTATATCA TTTGTTAAAT CAGGGTTGAA AGATCTTTCT
GTATCAAGAC AAAATGTTTT ATGGGGAATA AAAGTACCAA ATGATGATAA ACATGTAATT
TATGTTTGGG TAGATGCTTT AGCAAATTAC TTGACAGTAT TAGGGTTTCC TGATGAGAAC
CATAAAGATT ATCAAGCTTA TTGGGCAAGT GATAGTAGTT CTGTTTTACA GGTTGTGGGT
AAGGATATAT TGAGGTTTCA TGCTGTATAT TGGCCTGCTA TTTTAATGGC AGCTGAATTG
CCTTTACCTA AAAAAATCTT GGCACATGGT TGGTGGACTA ATGAGGGGCA AAAAATTTCT
AAGTCGTTAG GTAATGTTAT TAAACCTTTT GACCTAGTAG AAGAATTTGG AGTTGATCAA
TTAAGATATT TTTTAATTAA AGAAATGCCA ATAGGGAACG ATGGAGATTT TAAAAGAAAT
AGTCTTATTA ATTGTATAAA TTATGATTTA GCAAATAATA TAGGGAACCT TGTACAAAGA
ACTGTTTCAC TATTGTATAA AGAATGTGGA GGGATAGTAC CAACAGTAAG TGGCAATTTG
CTGCAGGGTG AGGAAGTATT ACCAGATTAT CAAGAGATTC TTGAAAAGGT TAGAGATTGT
GTAATGCGTT GTAATCTGAA TGAGATGATA CATATTATAG AGCAATTATC TTCTGCAGCT
AATGAATATA TTGCATCACG AGCGCCATGG AGGTTATCTA AGAGTGACCC TAAAATTATG
GAAGCAGTAT TGTATAAATT ACTTGAATAT ATTAAATGTA TAGGGTTGCT GTTGCAACCT
GTCATGCCTA AATTATCATC TAAAATATTA GATCAAATTG GTTTACCAGA ATGTAATCGT
GACTTTAGTC GATTTTCTAT ACCTATAAAT ATGAATACAG TTTTACCAAA ACCAGAGCCT
ATTTTTGCAA AAATCTTATG A
 
Protein sequence
MNNIYITTPI YYVNDVPHIG HVYTTLISDI IARFMRLDGH SVKFITGTDE HGQKIEKAAQ 
ERNMSLLDFT DNTSAVFRQL ADVMNYSYDD FIRTTETRHK KTVVALWQRL CDNGAIYLGS
YSGWYSVRDE TFYQEKELVD GKAPTGADVE WIEEPSYFFR LSNFQERLLA FYEENPDFVI
PKYRYNEVIS FVKSGLKDLS VSRQNVLWGI KVPNDDKHVI YVWVDALANY LTVLGFPDEN
HKDYQAYWAS DSSSVLQVVG KDILRFHAVY WPAILMAAEL PLPKKILAHG WWTNEGQKIS
KSLGNVIKPF DLVEEFGVDQ LRYFLIKEMP IGNDGDFKRN SLINCINYDL ANNIGNLVQR
TVSLLYKECG GIVPTVSGNL LQGEEVLPDY QEILEKVRDC VMRCNLNEMI HIIEQLSSAA
NEYIASRAPW RLSKSDPKIM EAVLYKLLEY IKCIGLLLQP VMPKLSSKIL DQIGLPECNR
DFSRFSIPIN MNTVLPKPEP IFAKIL