Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0937 |
Symbol | argH |
ID | 3927223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 957508 |
End bp | 958920 |
Gene Length | 1413 bp |
Protein Length | 470 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637902053 |
Product | argininosuccinate lyase |
Protein accession | YP_507726 |
Protein GI | 88657716 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0165] Argininosuccinate lyase |
TIGRFAM ID | [TIGR00838] argininosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.941991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAATC CTTTATGGGG AGGAAGGTTT ACTGTATCCC CCAGTGACAT TATGAAAAAG ATTAATGAAT CAATATCGTT TGACAAAATA CTATATGAAG AAGATATATC TGGGTCAATA GCACACTGTA AAATGTTAGT TAACCAAAAA ATCATTAGCA AATATGAAGG TCAACTTATT ATTCATGGAC TAGAAGTTAT ACAAAACCAA ATTTCATCTG GCACTTTTGA ATTCAGCACA GACCTAGAAG ACATACACAT GAACATAGAA CACCACTTAA AGAAAATGAT AGGTAACATT GCAGGAAAGT TGCATACTGC AAGATCTCGT AATGATCAAG TTGCAACAGA TTTTAAACTT TGGATACGGA AATCAATAGT AAAATTAGAA ACGCTATTAC ATGAATTACA ACAGACTATA CTTAATATAG CTGAAGCTAA TTACGATACT ATCATGCCAG GATTTACACA CTTACAAATT GCTCAACCTG TAACATTAGG TCATCATTTA ATGGCATATT TTGAAATGTT AAAAAGAGAC TGTTCACGCT GGCAAGATTT ACACAAACGC ATGAATCAAT GTCCTGCAGG ATCTGCAGCA TTAGCAGGAA CATCTTTTCC AATAGACAGA CATTTCATCG CACAAGAACT AAAATTTGAC AGCCCAACAG AAAATTCTAT AGATGCAGTA TCAGACAGAG ACTATGTTAT TGAATTTTTA TCAAATGCTT CAATATGCAT AATGCATTTA TCAAGGTTAG CAGAAGAAAT TATACTTTGG TGCAGCTACA ATTTTAAGTT TATAACACTT TCCGATAATA TCACAACCGG AAGTTCAATA ATGCCACAAA AGAAAAACCC AGATGCAGCA GAACTTATCA GAGGAAAAAC TGGAAGGATT TTTGCATCAT TAAACCAAAT ATTAGTCGTC ATGAAAGGAC TACCACTAGC ATATAGCAAA GATATGCAAG AAGACAAAGA ACCTGTCTTT GATGCAGCAA ACAACTTAAT GTTATGTATA GAAGCAATGA ACAGCATGTT AAACAATATT ACCATTAACA AAAGTAATAT GCTAAAAGCA GCAGAGCATG ACTATTCAAC AGCAACAGAT CTTGCAGACT GGCTAGTCAA AAATCTTAAT CTTTCATTTA GAGAATCTCA TGAAACTACT GGACAAATAG TCAAGTTAGC AGAGCAAAAC CACTGTAAAC TACATGAATT AACTCTAGAA CAAATGAAAA CGATCATCCC TTCTATAACT GAAGACGTCT TTTCAATATT ATCGGTAAAA AACTCAGTAG ACAGTAGAAC GAGCTATGGA GGAACTGCTC CTGCAAATGT AATCGAAGCA ATAAAAAGAG GAAAGTTATA TCTCAGCAAT ATTACTACTT TACATTCAGA AAACAATATG TAA
|
Protein sequence | MKNPLWGGRF TVSPSDIMKK INESISFDKI LYEEDISGSI AHCKMLVNQK IISKYEGQLI IHGLEVIQNQ ISSGTFEFST DLEDIHMNIE HHLKKMIGNI AGKLHTARSR NDQVATDFKL WIRKSIVKLE TLLHELQQTI LNIAEANYDT IMPGFTHLQI AQPVTLGHHL MAYFEMLKRD CSRWQDLHKR MNQCPAGSAA LAGTSFPIDR HFIAQELKFD SPTENSIDAV SDRDYVIEFL SNASICIMHL SRLAEEIILW CSYNFKFITL SDNITTGSSI MPQKKNPDAA ELIRGKTGRI FASLNQILVV MKGLPLAYSK DMQEDKEPVF DAANNLMLCI EAMNSMLNNI TINKSNMLKA AEHDYSTATD LADWLVKNLN LSFRESHETT GQIVKLAEQN HCKLHELTLE QMKTIIPSIT EDVFSILSVK NSVDSRTSYG GTAPANVIEA IKRGKLYLSN ITTLHSENNM
|
| |