Gene ECH_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1074 
SymbolpurH 
ID3927717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1103262 
End bp1104776 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content31% 
IMG OID637902188 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_507859 
Protein GI88657596 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAA AAAGAGCAAT TATATCTGTA TATAATAAAA CAAACATTAT AGAACTTGCA 
AAATTTTTAA TTGAACAAAA GGTAGAAATT ATTGCAACAA GTAGTACTTA TAAAGCTTTA
TTAGATGCAG GATTACAAGT AACAGAAGTA TCTAGTTACA CAAAGTTCCC AGAAATAATG
GATGGGAGAG TAAAAACTTT ACATCCAAAA ATTCATGGTG GAATATTAAG TAATCGCAAC
ACTCATGCTG CAGAATGTCT AAAGCTAGAC ATACATGATA TAGATTTAGT GATTGTGAAT
TTATATCCCT TCTCTCAAGT TGCAAATAAT AAAACTTCAA CCGAAAATGA GATAATAGAA
CAAATAGATA TAGGGGGAGT TACTTTATTA AGAGCTGGAG CAAAAAACTT CCACAATGTA
ACAGTAATAT CAGATATCAA TGACTATGAT ACATTAAAGG CAGAAATGAT TAATAATCAA
AATTCTACAA ACTTAACATA TAGAAAACAA TTAGCAAAAA AAGCATTTGC TATAACTTCA
TCTTATGATA GTAGCATATA TAATTGGTTA AATAAAGATA GTAAAGATAT ACTACCTGAA
ACGCTTGTCA TACATGGAAA TAAGGTACAG GATCTCAGAT GCGGGGAAAA TCCACATCAA
AAAGGTGCGT TTTACAGTAC TTTAGGGGAT AAATACCCTC TCAAACAATT ACATGGAAAA
GAATTAAGCT ACAACAATAT AGTAGATATA GAATCTGCTA TAAACATAGT GACAGAATTT
ACGCAACCAG CAGCAGTGAT TATAAAACAT AGTAATCCTT GTGGTGCAGC AACTGCTGAT
GACATTACAA CTGCATACAA CAAAGCTTTT GCTTGTGATC CAAAAAGCAG TTTTGGTGGT
ATTGTTGCAT TAAATAGAGA AATCAACGAA GATATAGCAG AGGAAATCAA TAAAATTTTC
ATCGAAGTAA TAGTTGGTCC ATCTATTACA GACAAAGCAA TGGAAATTAT ACAAAAGAAG
AAAAATGTGC GTATTATGCT GTCCACAGAG TATAATCCAC CAAAATATAT AATTAAAAAT
GTCAGTAACG GATTTCTACT ACAAGAAAGT AATACAAACC AATTATCAGA GAAAGACTTA
ATACAAGTTA CAAATTTCCC AGTATCAGAT GATACTATTT CTAATCTATT ATTTGCTTGG
AAAATATGTA AACATGTAAA ATCCAATGCA ATTGTTATTG CAAAAGATCA CCGTGCAATT
GGAGTTGGGG CTGGTCAAAT GAGCCGTGTT GATAGTCTTG AAATTGCAAT AAAAAAAGCC
CAAGATTGCG CAGGAGCAGT TTTAGCGTCC GACGCATTTT TTCCCTTTAC AGACAGTATA
TTACTAAGTG CATCTGTAAA TATAAGTGCA ATAATTCAAC CTGGAGGATC ACTAAGAGAT
CAAGAAGTCA TAGAAGAAGC AAACAATAAA AAGATTGCTA TGTTTTTCAC CAACATTAGA
AACTTTTATC ACTAA
 
Protein sequence
MKIKRAIISV YNKTNIIELA KFLIEQKVEI IATSSTYKAL LDAGLQVTEV SSYTKFPEIM 
DGRVKTLHPK IHGGILSNRN THAAECLKLD IHDIDLVIVN LYPFSQVANN KTSTENEIIE
QIDIGGVTLL RAGAKNFHNV TVISDINDYD TLKAEMINNQ NSTNLTYRKQ LAKKAFAITS
SYDSSIYNWL NKDSKDILPE TLVIHGNKVQ DLRCGENPHQ KGAFYSTLGD KYPLKQLHGK
ELSYNNIVDI ESAINIVTEF TQPAAVIIKH SNPCGAATAD DITTAYNKAF ACDPKSSFGG
IVALNREINE DIAEEINKIF IEVIVGPSIT DKAMEIIQKK KNVRIMLSTE YNPPKYIIKN
VSNGFLLQES NTNQLSEKDL IQVTNFPVSD DTISNLLFAW KICKHVKSNA IVIAKDHRAI
GVGAGQMSRV DSLEIAIKKA QDCAGAVLAS DAFFPFTDSI LLSASVNISA IIQPGGSLRD
QEVIEEANNK KIAMFFTNIR NFYH