Gene ECH_1034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1034 
SymbolpurK 
ID3927737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1054606 
End bp1055691 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content31% 
IMG OID637902149 
Productphosphoribosylaminoimidazole carboxylase, ATPase subunit 
Protein accessionYP_507820 
Protein GI88657693 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACACG ATTCATACAT CGGCCCTGGA TCAACTATTG GGATTATAGG TGGAGGACAG 
TTAGGAAAAA TGATATCCAT TGCTGCAGCA AACTTAGGAT ATAAAACACA TTTATTAACT
AATAACCCGG ATGATCCATC AGTCTACATT ACTAACAGTG CAACTATATC ACACAATTAT
CAAAATACAG AATCATTGCT TGAGTTTGCA TCTAATGTCG ACATTGCAAC CTTAGAATTT
GAGAACATTC CTACTACTAC TATAGATATA TTATCACAAA AAATCAAAGT TTATCCAGGA
AAAGAAGCTT TATACATTTC CCAAAATAGA ATAAGAGAAA AACAGCACAT AAGAAATTTA
GGAATCAAAA CTTCAGATTT TAGAGTAATT GATAACTACA ACAGTTTAGT CAAAAATACT
ATAGAACTAG GATATCCCAC CTTACTTAAG ACCACAGAAT TAGGCTACGA TGGAAAAGGC
CAGTATATAA TAAAACAACA AGATGATTTA AGTGCTTTAT CAACTCTCAA TTGGGACCAA
TCATATATTT TAGAAAAATT TGTCAAAATT TATAAAGAAA TATCTGTTAT AATAACAAAA
AGCATCAGTG GTTCTATAGA ATTTTTTCCA ACTGCGGAAA ACTGTCATAC TGATGGTATT
TTAACCACAT CATCAGTACC AGCCCTAATC TCTCAAGAGA TAAATGTACA AGCACAAAAA
ATTGCTTTAC AAATTGCAGA ATCTATTAAT TTAGTAGGTT TATTAGCAGT GGAATTTTTC
ATAACAGATA CACAAGAACT TATAGTTAAT GAAATAGCTC CCCGCCCTCA CAACTCTGGA
CATTGGAGCT TAGATGCTTG TAACATCAGC CAATTTGAAC AATTAATAAG AGCAATATGT
GGATTACCTT TAAAGCCTGT AAAATTACTT TTTCCATGTA TTATGAATAA CATATTAGGA
GATAATATAC ACAACTATTA TAAACATGAA ACCAAGGTTA ATGAAAACTT ATACATATAC
GGCAAGAAAA AGGCCACTAA AAACAGAAAA ATGGGCCACA TTAACACATT AAAATTCAAC
CAGTAA
 
Protein sequence
MLHDSYIGPG STIGIIGGGQ LGKMISIAAA NLGYKTHLLT NNPDDPSVYI TNSATISHNY 
QNTESLLEFA SNVDIATLEF ENIPTTTIDI LSQKIKVYPG KEALYISQNR IREKQHIRNL
GIKTSDFRVI DNYNSLVKNT IELGYPTLLK TTELGYDGKG QYIIKQQDDL SALSTLNWDQ
SYILEKFVKI YKEISVIITK SISGSIEFFP TAENCHTDGI LTTSSVPALI SQEINVQAQK
IALQIAESIN LVGLLAVEFF ITDTQELIVN EIAPRPHNSG HWSLDACNIS QFEQLIRAIC
GLPLKPVKLL FPCIMNNILG DNIHNYYKHE TKVNENLYIY GKKKATKNRK MGHINTLKFN
Q