Gene ECH_0342 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0342 
SymbolpurM 
ID3927541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp333011 
End bp334039 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content34% 
IMG OID637901466 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_507162 
Protein GI88657860 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACAT ACTCAAGCGC TGGTGTCAAT ATTGAATCTG GAAACAGTCT AGTAGAAAAA 
ATAAAACCAA TTGCATCATC AACATCTATA CCAGGGGTAG TTGACAGTAT AGGGTCCTTT
GGAGCACTTT TTGACATATC AAGAATAAAA GAATACAACA AACCCGTCCT AGTATCTTCA
ACTGATGGAG TTGGCACAAA GTTATGTATT GCACAAGGAG TAGATAATCA TAAAACTATT
GGTATAGATC TCGTAGCAAT GTGTGTTAAT GATGTTTTAG CACAGGGAGC AGAACCATTA
TTTTTCCTAG ATTATTTTGC AACAGGTAAA GTAAACCATG ATACTGCTTT AGAAATCATT
AATAGCATAG CAGTAGGATG TAAAAAAGCA AATGTAGCAT TAATAGGTGG AGAAACCGCT
GAAATGCCTG GAATGTACAG TGACAATAAA TATGATTTAG CTGGTTTTGC AGTAGGCATT
ATCGAAGCAG ACAACATTTT ACCAAAAAGT CATAATATTA AAGTTGGAGA TAAAATACTT
GGCTTAGCTT CAAGTGGATT ACATTCCAAT GGTTTTTCAC TAATCAGAAA AATTATCAGT
GACAATAAAA TTAACTATCA TGATGTATGT CCTTGGTCTA ATCAAACATG GGCAAACTAT
TTGCTGACGC CTACACGCAT CTATGTAAAA TCTTTATTAT CTACTATTCC ATTAGTGAAT
GGTTTAGCTC ATATTACTGG AGGAGGATTC ACATATAACA TACCACGTAT TATTCCTAGC
CATTTGTCAG CAACTATTGA CTTAAGTTCT TGGAAAATGC CAGAAATATT TCATTGGCTT
AATACTGAAG TACAAATACA GCAAACAGAA CTACTCAAAA CATTCAATTG TGGTATTGGT
ATGATACTGG TCACTTCACA AGAAAATGAA GATCAAGTAT TGTCACTCCT CAAAGTTACA
GATGAAGTTG TATACAAAAT TGGAGAAATC ACAGAAAGAA ATACTGAAGA TCAAGTAATA
TTCAAATAA
 
Protein sequence
MKTYSSAGVN IESGNSLVEK IKPIASSTSI PGVVDSIGSF GALFDISRIK EYNKPVLVSS 
TDGVGTKLCI AQGVDNHKTI GIDLVAMCVN DVLAQGAEPL FFLDYFATGK VNHDTALEII
NSIAVGCKKA NVALIGGETA EMPGMYSDNK YDLAGFAVGI IEADNILPKS HNIKVGDKIL
GLASSGLHSN GFSLIRKIIS DNKINYHDVC PWSNQTWANY LLTPTRIYVK SLLSTIPLVN
GLAHITGGGF TYNIPRIIPS HLSATIDLSS WKMPEIFHWL NTEVQIQQTE LLKTFNCGIG
MILVTSQENE DQVLSLLKVT DEVVYKIGEI TERNTEDQVI FK