Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0125 |
Symbol | gshA |
ID | 3927846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 113261 |
End bp | 114493 |
Gene Length | 1233 bp |
Protein Length | 410 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 637901249 |
Product | glutamate--cysteine ligase |
Protein accession | YP_506953 |
Protein GI | 88658400 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02049] glutamate--cysteine ligase, T. ferrooxidans family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGTAA TTATTGATAC ATTAAATGAT ATATTAACAA AATATAAGTT AGATATAGAG AATTGGTTTT TTAATAAATT TACTAAGTAT AACCCGGTTT TGAATATTTC GGTTGATTTA AGAGTATCTG ATTATAAGAT TGCTCCGGTA GATACTAATA TTTTTCCTGC AGGCTATAAT AATTTTACTG AGCAATCCCA AATTTATGCT TCTCGATTGT TAAGGGAATA TGTAGTTAAT TATGCGAACT GTAGCAAAAT TTTAATTATA GCAGAGAATC ATACAAGAAA TTTAAAATAT ATTGATAGTT TGATTGTTTT AAAAAATATA GTTAATAATG CAGGTTTTGT TGTTGAAGTA GGGATATGTA ATATAAATCA AAATATAGAA CTGATTTCAT CAACAGGACG TGTGATAAAT TGTTTATGTC TTACTAATGA TAATGGTGTA CTTCGTGCTG GATGTAGGTT TATTCCTGAT CTTATTTTAG TCAATAATGA TATGACTAGT GGGATTCCTG AAGTACTACA AGGTTTAAAA TATCAAAGTA TTATGCCATC TTTATTTTTG GGATGGTTTA ATAGAAGTAA ATCTAATCAT TTTTCTATTT ATAAAAAGTT ATCCAAAGAG TTTTGTGAGA GTTTTAATAT TGATCCTTGG TTAATTTCAG CATTTTTCTC TAGTTGTAGT AATATTTGTT TTTTCAACAG TCAAGGAATT GATGATATTG CTAATGAAGT GGATGTAGTT ATTAGCAAAA TACGTAATAA ATTCCAATTA TATAGTATTA AGGAACAGCC ATATGTGTTT GTGAAAGCTG ATAATGGAAC TTATGGTATG GGAATATTAG TAGCTTATTG TGGAGATGAT ATCTTAATGC TTAATAGGAA AAAGCGTAAT AAAATGAAAA AGATTAAAGA TGGTAATGTT GTCAGTAGTG TAATAATACA GGAAGGTATT ACTACTAGAG AGATCTTTAA TGGTTATGTA GCTGAGCCAT TAGTTTATTT TATAGGGCAT ACTCCTTCAT GTTACTTATA CAGGTATCAT TCTGTAAAGG ATAGATTTTC TAATTTAAAT TCTGTAGGCT GTGACTTTAT AGATATAAGT TATAAACAGC AAGACATATT GTATTGGAAT ATAATTGGAA AAATAGCTGT TTTAGCTGCT GCAATTGAGA TGCATGAGAT ATCAAATGTT AATGTAATGG AGCAAAATTG CTTACTAAGT TAA
|
Protein sequence | MTVIIDTLND ILTKYKLDIE NWFFNKFTKY NPVLNISVDL RVSDYKIAPV DTNIFPAGYN NFTEQSQIYA SRLLREYVVN YANCSKILII AENHTRNLKY IDSLIVLKNI VNNAGFVVEV GICNINQNIE LISSTGRVIN CLCLTNDNGV LRAGCRFIPD LILVNNDMTS GIPEVLQGLK YQSIMPSLFL GWFNRSKSNH FSIYKKLSKE FCESFNIDPW LISAFFSSCS NICFFNSQGI DDIANEVDVV ISKIRNKFQL YSIKEQPYVF VKADNGTYGM GILVAYCGDD ILMLNRKKRN KMKKIKDGNV VSSVIIQEGI TTREIFNGYV AEPLVYFIGH TPSCYLYRYH SVKDRFSNLN SVGCDFIDIS YKQQDILYWN IIGKIAVLAA AIEMHEISNV NVMEQNCLLS
|
| |