Gene ECH_0688 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0688 
Symbol 
ID3927841 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp697666 
End bp698859 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content30% 
IMG OID637901809 
Productputative deoxyguanosinetriphosphate triphosphohydrolase 
Protein accessionYP_507493 
Protein GI88658433 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0232] dGTP triphosphohydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.154263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAGATC TAGTAAAATA TGCATGTAAT CCTACTGAAA CAAGAGGTAG AATATTTCAT 
GAAGAGGAAG ACCAATATTG CGACTGTTAT CAAAGGGATC GTGACCGCAT TATATATTCG
GGTGCATTCC GTAAATTACA GTATAAAACA CAAGTATTTA TTAATTATGA AAATGATTAT
TATAGAACAA GACTAACTCA TAGTCTTGAA GTTGCTCAAA TAGCAAGGTC ATTAAGTAGA
AAGCTGAGAT TTAATGAAGA TTTAACAGAA GCTATATCTT TAGCACATGA TTTGGGTCAT
CCTCCATTTG GGCATGCAGG AGAAGATGCT TTAAATGAGA TGACTATGCA TCATTTAGGA
TTTGATCATA ATATTCAGGC TCTGAGGATA TTAACATTTT TAGAAAAAAG ATATATAAAA
TTTGATGGCA TGAACCTTAC TTGGGAAACA TTGGAAGGAG TAGCTAAACA TAATGGTCCT
ATTACAGGGG AAAATCGAAT TAATTCTAAT AAAAAAATAC ATAAGTTTAT GTTAGATTAT
GATTCATATT ATAAGTTAGA CCTTGACAAT TTTTCTAGTG CTGAAGCTCA AATTGCATCT
ATCTCTGATG ATATAGCTTA TAATATGCAT GATATTGATG ATGGAATAAG AGCAAAAATT
TTGGTTATAG AGGAATTATT AGAATTACCA TTGATTGGGG ACATCTTAAA GAAAGTAATA
GATGATAACT CTGGATTAAG TGTATCTGAT AATCGAATTG TGCATGAGTT TCTAAGAAGA
ACTGTTGATA TTATGTTAAT GGATATAATA TCTCAGGTTA CAAACAATAT TAAAGAGTAT
GATATACGTT CTCATGATGA TATTAGGAAG TTAGGTAAAG TATTTGTACA TTTTTCTGAA
GAAATGAATC AGTATAAAAT AGGCTTACAA AACTTTCTTA GAACTAAATT ATACAACTAC
TATAAAGTGA AAAGGGTAAA AAATAAAGTA AAGCGTATAA TAAAAGAATT ATTTCAAGTT
TTTTATGATG ATCCACAGAT TTTACCTTCT GATTGGGGTG TGAAAGCAAT GGATGCTAAT
TTAATAGATC GATCAATTGT AATTTGTGAT TTTATCTCAG GCATGACAGA TCGTTTTGCT
ATTCAAGAGC ATAGAAAAAT CTTTGATACA ACGTACGAGA TGTTGGTATT TTAG
 
Protein sequence
MLDLVKYACN PTETRGRIFH EEEDQYCDCY QRDRDRIIYS GAFRKLQYKT QVFINYENDY 
YRTRLTHSLE VAQIARSLSR KLRFNEDLTE AISLAHDLGH PPFGHAGEDA LNEMTMHHLG
FDHNIQALRI LTFLEKRYIK FDGMNLTWET LEGVAKHNGP ITGENRINSN KKIHKFMLDY
DSYYKLDLDN FSSAEAQIAS ISDDIAYNMH DIDDGIRAKI LVIEELLELP LIGDILKKVI
DDNSGLSVSD NRIVHEFLRR TVDIMLMDII SQVTNNIKEY DIRSHDDIRK LGKVFVHFSE
EMNQYKIGLQ NFLRTKLYNY YKVKRVKNKV KRIIKELFQV FYDDPQILPS DWGVKAMDAN
LIDRSIVICD FISGMTDRFA IQEHRKIFDT TYEMLVF