Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0649 |
Symbol | |
ID | 3927526 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 652059 |
End bp | 653075 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637901771 |
Product | pyridine nucleotide-disulphide oxidoreductase family protein |
Protein accession | YP_507458 |
Protein GI | 88657729 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0492] Thioredoxin reductase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAGATT ATGTAACTGA TATTGCGGTA ATAGGAGCTG GTCCTGTTGG GATATTTACT GTATTTCAGG CTGGTATGTT GAAAATGCGA TGTTGTGTTA TTGATGCATT AAGTGAAATT GGTGGGCAAT GTCTTGCATT GTATCCAGAG AAACCGATAT ATGATATTCC TGGATATCCA GTAATTAATG GTAAGGAATT AATAGATAGT TTAAAAAAGC AGTCTGAGCC TTTTAATCCT CAATATTTGT TAGGACAAGT TGCTGAAAAA ATAGAAGATT ACTCAGATTA TTTTTTGATA AGAACTACAA CAGGAATTGT AGTACAAAGT AAAGTTATTA TCATTGCTGC TGGAGCTGGA GCATTTGGTC CAAATCGTCT TCCTATAGAT AATATTCTTG ATTATGAGAA TAAATCAGTA TTTTACCAAG TGAGAAAGGT TTCAGATTTT TGTGATAAAA ATATTATGAT AGCAGGAGGA GGTGACTCAG CTGCTGATTG GGCAGTTGAG CTTTCTAAGG TTGCTAAACA GTTATATGTA GTACATAGAA GGAAAAATTT TCGTTGTGCT CCTAATACTG CATTGCAGAT GGATAATTTA TCACAGAGTG GAAAAATAAA GATTATTGTT CCATATCAAG TTAAAAAATT ATGTGGTGAA AATGGTAAAC TGCATTCTGT AATTGTTAAG AATATTACGA ATCATGAAGA AATGGCGCTA CAAGTTGATT ATTTATTTCC ATTTTTTGGT ACATCTGCAA ATCTTGGTCC TATATTGAAT TGGGGAATGG AAGTAAAAAA CTATCAAATT CTTGTTAATG CTGAGACTTG TCTAACAAAT CGCAATAGAA TATATGCAGT TGGTGATATA GCTACATATC CAGGAAAACT TAAGTTAATA CTTACAGGAT TTTCAGAGGC TGCAATGGCA TGTCATCATA TATATCATGT AATATACCCT AATTCTCCGT TAAATTTTCA ATATTCTACT TCAAAAGGTA TACCAGAAAA TTGTTAG
|
Protein sequence | MTDYVTDIAV IGAGPVGIFT VFQAGMLKMR CCVIDALSEI GGQCLALYPE KPIYDIPGYP VINGKELIDS LKKQSEPFNP QYLLGQVAEK IEDYSDYFLI RTTTGIVVQS KVIIIAAGAG AFGPNRLPID NILDYENKSV FYQVRKVSDF CDKNIMIAGG GDSAADWAVE LSKVAKQLYV VHRRKNFRCA PNTALQMDNL SQSGKIKIIV PYQVKKLCGE NGKLHSVIVK NITNHEEMAL QVDYLFPFFG TSANLGPILN WGMEVKNYQI LVNAETCLTN RNRIYAVGDI ATYPGKLKLI LTGFSEAAMA CHHIYHVIYP NSPLNFQYST SKGIPENC
|
| |