Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1034 |
Symbol | purK |
ID | 3927737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 1054606 |
End bp | 1055691 |
Gene Length | 1086 bp |
Protein Length | 361 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637902149 |
Product | phosphoribosylaminoimidazole carboxylase, ATPase subunit |
Protein accession | YP_507820 |
Protein GI | 88657693 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) |
TIGRFAM ID | [TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTACACG ATTCATACAT CGGCCCTGGA TCAACTATTG GGATTATAGG TGGAGGACAG TTAGGAAAAA TGATATCCAT TGCTGCAGCA AACTTAGGAT ATAAAACACA TTTATTAACT AATAACCCGG ATGATCCATC AGTCTACATT ACTAACAGTG CAACTATATC ACACAATTAT CAAAATACAG AATCATTGCT TGAGTTTGCA TCTAATGTCG ACATTGCAAC CTTAGAATTT GAGAACATTC CTACTACTAC TATAGATATA TTATCACAAA AAATCAAAGT TTATCCAGGA AAAGAAGCTT TATACATTTC CCAAAATAGA ATAAGAGAAA AACAGCACAT AAGAAATTTA GGAATCAAAA CTTCAGATTT TAGAGTAATT GATAACTACA ACAGTTTAGT CAAAAATACT ATAGAACTAG GATATCCCAC CTTACTTAAG ACCACAGAAT TAGGCTACGA TGGAAAAGGC CAGTATATAA TAAAACAACA AGATGATTTA AGTGCTTTAT CAACTCTCAA TTGGGACCAA TCATATATTT TAGAAAAATT TGTCAAAATT TATAAAGAAA TATCTGTTAT AATAACAAAA AGCATCAGTG GTTCTATAGA ATTTTTTCCA ACTGCGGAAA ACTGTCATAC TGATGGTATT TTAACCACAT CATCAGTACC AGCCCTAATC TCTCAAGAGA TAAATGTACA AGCACAAAAA ATTGCTTTAC AAATTGCAGA ATCTATTAAT TTAGTAGGTT TATTAGCAGT GGAATTTTTC ATAACAGATA CACAAGAACT TATAGTTAAT GAAATAGCTC CCCGCCCTCA CAACTCTGGA CATTGGAGCT TAGATGCTTG TAACATCAGC CAATTTGAAC AATTAATAAG AGCAATATGT GGATTACCTT TAAAGCCTGT AAAATTACTT TTTCCATGTA TTATGAATAA CATATTAGGA GATAATATAC ACAACTATTA TAAACATGAA ACCAAGGTTA ATGAAAACTT ATACATATAC GGCAAGAAAA AGGCCACTAA AAACAGAAAA ATGGGCCACA TTAACACATT AAAATTCAAC CAGTAA
|
Protein sequence | MLHDSYIGPG STIGIIGGGQ LGKMISIAAA NLGYKTHLLT NNPDDPSVYI TNSATISHNY QNTESLLEFA SNVDIATLEF ENIPTTTIDI LSQKIKVYPG KEALYISQNR IREKQHIRNL GIKTSDFRVI DNYNSLVKNT IELGYPTLLK TTELGYDGKG QYIIKQQDDL SALSTLNWDQ SYILEKFVKI YKEISVIITK SISGSIEFFP TAENCHTDGI LTTSSVPALI SQEINVQAQK IALQIAESIN LVGLLAVEFF ITDTQELIVN EIAPRPHNSG HWSLDACNIS QFEQLIRAIC GLPLKPVKLL FPCIMNNILG DNIHNYYKHE TKVNENLYIY GKKKATKNRK MGHINTLKFN Q
|
| |