Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1017 |
Symbol | argC |
ID | 3927783 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 1042055 |
End bp | 1043095 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 637902132 |
Product | N-acetyl-gamma-glutamyl-phosphate reductase |
Protein accession | YP_507803 |
Protein GI | 88657913 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0002] Acetylglutamate semialdehyde dehydrogenase |
TIGRFAM ID | [TIGR01850] N-acetyl-gamma-glutamyl-phosphate reductase, common form |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTTATC AAGTATCAGT TGCTGTAGTT GGGGCTACAG GTTATGTAGG GGTAGAGTTA GTACGTTTGT TATTATCTCA TCCTATGGTT AAGATAAAGT ATTTATGTGC AACTCAATCT AGTGGAAAGT TACTTTCTTC TAATTATTTC CACATTTCGC AGGATGATAT ATCCGTTAAT ATTTCGTCTT TTGATGATAT TGACTTATCT AAAGTAGATG TGGTTTTTTT GTGCTTACCT CATGGTACAT CAAGTGAAGT TGTAAGAAAA ATTCATGATG TAGTAAGAAT TATAGATTTA TCAGCTGATT TTAGAATTAA GGATGCTGAA GTATATAAAC AATGGTATGG CTCACATTGT TGTCCAGATC TTGTAAGAGA TTTTGTATAC GGGTTAACGG AGATATATTG GGAAGATATT CAAAGGTCAA GGTTTATAGC TTGTCCAGGA TGTTATCCTA CCTCTGTGCT AATACCATTA TTTCCATTGT TAAGACTTTG TTTAATAAAA AGTCAGGGTA TAATAGTTGA TGCTAAATCA GGTGTGAGTG GTGCTGGTAG GTCTGTAAAG CAGGATAAGT TGTTCTGTGA AGTTTATGAT GTTATTAAAT CGTATAAAAT TTCAGACCAT AGACATATTC CTGAAATAGA GCAAGAGCTT TGTTTTGCTG CCTGTAGAGA AGACATTAAT TTACAATTTG TGCCTAATTT AATTCCTGTC AAAAGAGGTA TGATGTCTAG TATATACCTT GAGTTAGAAG AAGGCGTATC GCTTACTGAT GTCCGTGAAG CATTGTTGCT TTTTTACAAA GATTCATCTT TTGTTTTTAT TGATGAAGAG AAAGCTATGA CAACTAGGTC TGTTGTGGGT ACGAATTATT GTTATTTAGG TGTTTTTCCT GGAAGGGTAC CTAACACGAT TATTATCATG TCTGTTATAG ATAATTTATT AAAAGGTGCA GCTGGTCAAG CAGTGCAAAA TTTTAATGTT ATGATGTCTT ATGATGAGAA AATTGCTTTG TCAAATATTC CTTATTTTTA A
|
Protein sequence | MSYQVSVAVV GATGYVGVEL VRLLLSHPMV KIKYLCATQS SGKLLSSNYF HISQDDISVN ISSFDDIDLS KVDVVFLCLP HGTSSEVVRK IHDVVRIIDL SADFRIKDAE VYKQWYGSHC CPDLVRDFVY GLTEIYWEDI QRSRFIACPG CYPTSVLIPL FPLLRLCLIK SQGIIVDAKS GVSGAGRSVK QDKLFCEVYD VIKSYKISDH RHIPEIEQEL CFAACREDIN LQFVPNLIPV KRGMMSSIYL ELEEGVSLTD VREALLLFYK DSSFVFIDEE KAMTTRSVVG TNYCYLGVFP GRVPNTIIIM SVIDNLLKGA AGQAVQNFNV MMSYDEKIAL SNIPYF
|
| |