Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_0342 |
Symbol | purM |
ID | 3927541 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | - |
Start bp | 333011 |
End bp | 334039 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 637901466 |
Product | phosphoribosylaminoimidazole synthetase |
Protein accession | YP_507162 |
Protein GI | 88657860 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0150] Phosphoribosylaminoimidazole (AIR) synthetase |
TIGRFAM ID | [TIGR00878] phosphoribosylaminoimidazole synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAT ACTCAAGCGC TGGTGTCAAT ATTGAATCTG GAAACAGTCT AGTAGAAAAA ATAAAACCAA TTGCATCATC AACATCTATA CCAGGGGTAG TTGACAGTAT AGGGTCCTTT GGAGCACTTT TTGACATATC AAGAATAAAA GAATACAACA AACCCGTCCT AGTATCTTCA ACTGATGGAG TTGGCACAAA GTTATGTATT GCACAAGGAG TAGATAATCA TAAAACTATT GGTATAGATC TCGTAGCAAT GTGTGTTAAT GATGTTTTAG CACAGGGAGC AGAACCATTA TTTTTCCTAG ATTATTTTGC AACAGGTAAA GTAAACCATG ATACTGCTTT AGAAATCATT AATAGCATAG CAGTAGGATG TAAAAAAGCA AATGTAGCAT TAATAGGTGG AGAAACCGCT GAAATGCCTG GAATGTACAG TGACAATAAA TATGATTTAG CTGGTTTTGC AGTAGGCATT ATCGAAGCAG ACAACATTTT ACCAAAAAGT CATAATATTA AAGTTGGAGA TAAAATACTT GGCTTAGCTT CAAGTGGATT ACATTCCAAT GGTTTTTCAC TAATCAGAAA AATTATCAGT GACAATAAAA TTAACTATCA TGATGTATGT CCTTGGTCTA ATCAAACATG GGCAAACTAT TTGCTGACGC CTACACGCAT CTATGTAAAA TCTTTATTAT CTACTATTCC ATTAGTGAAT GGTTTAGCTC ATATTACTGG AGGAGGATTC ACATATAACA TACCACGTAT TATTCCTAGC CATTTGTCAG CAACTATTGA CTTAAGTTCT TGGAAAATGC CAGAAATATT TCATTGGCTT AATACTGAAG TACAAATACA GCAAACAGAA CTACTCAAAA CATTCAATTG TGGTATTGGT ATGATACTGG TCACTTCACA AGAAAATGAA GATCAAGTAT TGTCACTCCT CAAAGTTACA GATGAAGTTG TATACAAAAT TGGAGAAATC ACAGAAAGAA ATACTGAAGA TCAAGTAATA TTCAAATAA
|
Protein sequence | MKTYSSAGVN IESGNSLVEK IKPIASSTSI PGVVDSIGSF GALFDISRIK EYNKPVLVSS TDGVGTKLCI AQGVDNHKTI GIDLVAMCVN DVLAQGAEPL FFLDYFATGK VNHDTALEII NSIAVGCKKA NVALIGGETA EMPGMYSDNK YDLAGFAVGI IEADNILPKS HNIKVGDKIL GLASSGLHSN GFSLIRKIIS DNKINYHDVC PWSNQTWANY LLTPTRIYVK SLLSTIPLVN GLAHITGGGF TYNIPRIIPS HLSATIDLSS WKMPEIFHWL NTEVQIQQTE LLKTFNCGIG MILVTSQENE DQVLSLLKVT DEVVYKIGEI TERNTEDQVI FK
|
| |