Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1074 |
Symbol | purH |
ID | 3927717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 1103262 |
End bp | 1104776 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 31% |
IMG OID | 637902188 |
Product | phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
Protein accession | YP_507859 |
Protein GI | 88657596 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) |
TIGRFAM ID | [TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAATAA AAAGAGCAAT TATATCTGTA TATAATAAAA CAAACATTAT AGAACTTGCA AAATTTTTAA TTGAACAAAA GGTAGAAATT ATTGCAACAA GTAGTACTTA TAAAGCTTTA TTAGATGCAG GATTACAAGT AACAGAAGTA TCTAGTTACA CAAAGTTCCC AGAAATAATG GATGGGAGAG TAAAAACTTT ACATCCAAAA ATTCATGGTG GAATATTAAG TAATCGCAAC ACTCATGCTG CAGAATGTCT AAAGCTAGAC ATACATGATA TAGATTTAGT GATTGTGAAT TTATATCCCT TCTCTCAAGT TGCAAATAAT AAAACTTCAA CCGAAAATGA GATAATAGAA CAAATAGATA TAGGGGGAGT TACTTTATTA AGAGCTGGAG CAAAAAACTT CCACAATGTA ACAGTAATAT CAGATATCAA TGACTATGAT ACATTAAAGG CAGAAATGAT TAATAATCAA AATTCTACAA ACTTAACATA TAGAAAACAA TTAGCAAAAA AAGCATTTGC TATAACTTCA TCTTATGATA GTAGCATATA TAATTGGTTA AATAAAGATA GTAAAGATAT ACTACCTGAA ACGCTTGTCA TACATGGAAA TAAGGTACAG GATCTCAGAT GCGGGGAAAA TCCACATCAA AAAGGTGCGT TTTACAGTAC TTTAGGGGAT AAATACCCTC TCAAACAATT ACATGGAAAA GAATTAAGCT ACAACAATAT AGTAGATATA GAATCTGCTA TAAACATAGT GACAGAATTT ACGCAACCAG CAGCAGTGAT TATAAAACAT AGTAATCCTT GTGGTGCAGC AACTGCTGAT GACATTACAA CTGCATACAA CAAAGCTTTT GCTTGTGATC CAAAAAGCAG TTTTGGTGGT ATTGTTGCAT TAAATAGAGA AATCAACGAA GATATAGCAG AGGAAATCAA TAAAATTTTC ATCGAAGTAA TAGTTGGTCC ATCTATTACA GACAAAGCAA TGGAAATTAT ACAAAAGAAG AAAAATGTGC GTATTATGCT GTCCACAGAG TATAATCCAC CAAAATATAT AATTAAAAAT GTCAGTAACG GATTTCTACT ACAAGAAAGT AATACAAACC AATTATCAGA GAAAGACTTA ATACAAGTTA CAAATTTCCC AGTATCAGAT GATACTATTT CTAATCTATT ATTTGCTTGG AAAATATGTA AACATGTAAA ATCCAATGCA ATTGTTATTG CAAAAGATCA CCGTGCAATT GGAGTTGGGG CTGGTCAAAT GAGCCGTGTT GATAGTCTTG AAATTGCAAT AAAAAAAGCC CAAGATTGCG CAGGAGCAGT TTTAGCGTCC GACGCATTTT TTCCCTTTAC AGACAGTATA TTACTAAGTG CATCTGTAAA TATAAGTGCA ATAATTCAAC CTGGAGGATC ACTAAGAGAT CAAGAAGTCA TAGAAGAAGC AAACAATAAA AAGATTGCTA TGTTTTTCAC CAACATTAGA AACTTTTATC ACTAA
|
Protein sequence | MKIKRAIISV YNKTNIIELA KFLIEQKVEI IATSSTYKAL LDAGLQVTEV SSYTKFPEIM DGRVKTLHPK IHGGILSNRN THAAECLKLD IHDIDLVIVN LYPFSQVANN KTSTENEIIE QIDIGGVTLL RAGAKNFHNV TVISDINDYD TLKAEMINNQ NSTNLTYRKQ LAKKAFAITS SYDSSIYNWL NKDSKDILPE TLVIHGNKVQ DLRCGENPHQ KGAFYSTLGD KYPLKQLHGK ELSYNNIVDI ESAINIVTEF TQPAAVIIKH SNPCGAATAD DITTAYNKAF ACDPKSSFGG IVALNREINE DIAEEINKIF IEVIVGPSIT DKAMEIIQKK KNVRIMLSTE YNPPKYIIKN VSNGFLLQES NTNQLSEKDL IQVTNFPVSD DTISNLLFAW KICKHVKSNA IVIAKDHRAI GVGAGQMSRV DSLEIAIKKA QDCAGAVLAS DAFFPFTDSI LLSASVNISA IIQPGGSLRD QEVIEEANNK KIAMFFTNIR NFYH
|
| |