Gene ECH_0649 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0649 
Symbol 
ID3927526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp652059 
End bp653075 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content33% 
IMG OID637901771 
Productpyridine nucleotide-disulphide oxidoreductase family protein 
Protein accessionYP_507458 
Protein GI88657729 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0492] Thioredoxin reductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGATT ATGTAACTGA TATTGCGGTA ATAGGAGCTG GTCCTGTTGG GATATTTACT 
GTATTTCAGG CTGGTATGTT GAAAATGCGA TGTTGTGTTA TTGATGCATT AAGTGAAATT
GGTGGGCAAT GTCTTGCATT GTATCCAGAG AAACCGATAT ATGATATTCC TGGATATCCA
GTAATTAATG GTAAGGAATT AATAGATAGT TTAAAAAAGC AGTCTGAGCC TTTTAATCCT
CAATATTTGT TAGGACAAGT TGCTGAAAAA ATAGAAGATT ACTCAGATTA TTTTTTGATA
AGAACTACAA CAGGAATTGT AGTACAAAGT AAAGTTATTA TCATTGCTGC TGGAGCTGGA
GCATTTGGTC CAAATCGTCT TCCTATAGAT AATATTCTTG ATTATGAGAA TAAATCAGTA
TTTTACCAAG TGAGAAAGGT TTCAGATTTT TGTGATAAAA ATATTATGAT AGCAGGAGGA
GGTGACTCAG CTGCTGATTG GGCAGTTGAG CTTTCTAAGG TTGCTAAACA GTTATATGTA
GTACATAGAA GGAAAAATTT TCGTTGTGCT CCTAATACTG CATTGCAGAT GGATAATTTA
TCACAGAGTG GAAAAATAAA GATTATTGTT CCATATCAAG TTAAAAAATT ATGTGGTGAA
AATGGTAAAC TGCATTCTGT AATTGTTAAG AATATTACGA ATCATGAAGA AATGGCGCTA
CAAGTTGATT ATTTATTTCC ATTTTTTGGT ACATCTGCAA ATCTTGGTCC TATATTGAAT
TGGGGAATGG AAGTAAAAAA CTATCAAATT CTTGTTAATG CTGAGACTTG TCTAACAAAT
CGCAATAGAA TATATGCAGT TGGTGATATA GCTACATATC CAGGAAAACT TAAGTTAATA
CTTACAGGAT TTTCAGAGGC TGCAATGGCA TGTCATCATA TATATCATGT AATATACCCT
AATTCTCCGT TAAATTTTCA ATATTCTACT TCAAAAGGTA TACCAGAAAA TTGTTAG
 
Protein sequence
MTDYVTDIAV IGAGPVGIFT VFQAGMLKMR CCVIDALSEI GGQCLALYPE KPIYDIPGYP 
VINGKELIDS LKKQSEPFNP QYLLGQVAEK IEDYSDYFLI RTTTGIVVQS KVIIIAAGAG
AFGPNRLPID NILDYENKSV FYQVRKVSDF CDKNIMIAGG GDSAADWAVE LSKVAKQLYV
VHRRKNFRCA PNTALQMDNL SQSGKIKIIV PYQVKKLCGE NGKLHSVIVK NITNHEEMAL
QVDYLFPFFG TSANLGPILN WGMEVKNYQI LVNAETCLTN RNRIYAVGDI ATYPGKLKLI
LTGFSEAAMA CHHIYHVIYP NSPLNFQYST SKGIPENC