Gene ECH_0369 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0369 
SymbolpepA 
ID3928040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp361011 
End bp362513 
Gene Length1503 bp 
Protein Length500 aa 
Translation table11 
GC content35% 
IMG OID637901493 
Productleucyl aminopeptidase 
Protein accessionYP_507189 
Protein GI88658513 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAGATA TATCGTTTTC AAGTTTAATG CCAGGGGTAT CTTTGTTTTT AAAGACAACA 
GCAATAGTTG TGGGTATCTT TGAAGGTAGT AATCACTTGG AAGACTATAG TGCCTTAGCT
GAGCGTAGTG AGCAAATTAT GAAGGTGGTA GAAGGTTACA AGTCTTTTGA TGGTAAGTTT
GCAGAAGTAT TGCCAATTAC TGGATTAGAT GCAGGGTATC CTATAGTAAT AGTAGTAGGG
TTGGGTAAAC CTGAAGAGTT TGATGAAAAC AAATCTTTAA GGATTGGTGG TGTTATATAT
TCTGAACTTA ATAGGATGAA AATATCAGAG GCCTCAATTA TAAGTAGTAA TGATAGTGAT
ATTATGGCTA ATGTTGCCTA TGGGGCATTT TTACGCAGTT TTAAGTTTGA TAAATATTTT
GTTCAGAAAA AAGATGAAAA TGCAACTTAT GTGCGTAAGT TAGAGTTCTT TTCAAAAACT
AATCCTCAAA AAACAGCTGT TTTGTTTGAT AATTTAAAAG CCGAAGGTGA GTCAGTATTT
TTAGCTCGTT CTTTTGTTTC AGAACCTCCT AATATTCTTT ATCCAGAAAT TTATGCTCAG
ATGATATATG AGGAATTAAG TAAAGTAGAT GTTAAAGTTG AAATATTTGA TGAAGATTAT
ATGAAAGCAA ATCAGATGAT GGCACTTCTA GGTGTAGGAC AAGGTAGTGC TAAGAAATCT
AGGCTTGTGG TTATGAGGTG GAATGGAGGA AAAGAAACAG ACAGTCCGAT AGCATTTGTT
GGAAAAGGAG TGACGTTTGA TACTGGTGGA ATATCTTTAA AACCTTCTAG GGGTATGTGG
GATATGAAAT ATGATATGGC TGGTTCTGCG TCTGTTGTAG GTATTATGCG GACTCTTGCA
GCAAGAAAAG CAAAAGTTAA TGCTGTAGGT GTAGTAGGGT TAGTTGAAAA TGCTGTAGGA
GGAAATGCGC AAAGACCAAG TGATGTTGTA ACTTCAATGT CTGGGCAAAC TATTGAAGTA
TTGAATACTG ATGCAGAAGG TAGACTAGTT TTGGCTGATG CTTTATGGTA TACTCAGAAA
ATGTTTTCTC CAAAATTAAT GATAGATTTA GCAACATTAA CTGGTGCAGT GGTTGTAGCA
CTAGGAAATA ATCAATATGG TGGGATTTTT TCAAACGATG ATGCAATTGC CAATCAATTA
ATTGTTGCTG GTAATGAGTC CGGTGAGAAA TTGTGGAGAT TACCTTTAGA TGATGCATAT
GATAAACTTA TAGATTCTTC GATTGCTGAT GTGCAAAATA TTTCAACAAA AGGGTATGGT
GCAGATAGCA TTACTGCTGC ACAATTTTTA CAGAGATTTG TAAATAAGAC TCCTTGGGTG
CATCTGGATA TTGCTGGAAT GGCGTGGGAT AATGAAGGTA ATGAAATATG CCCTAAAGGT
GCAACTGGGT TTGGTGTAAG GTTACTGAAT AGGTTGATTT TAAAATACTA TGAGGCTAAT
TAA
 
Protein sequence
MIDISFSSLM PGVSLFLKTT AIVVGIFEGS NHLEDYSALA ERSEQIMKVV EGYKSFDGKF 
AEVLPITGLD AGYPIVIVVG LGKPEEFDEN KSLRIGGVIY SELNRMKISE ASIISSNDSD
IMANVAYGAF LRSFKFDKYF VQKKDENATY VRKLEFFSKT NPQKTAVLFD NLKAEGESVF
LARSFVSEPP NILYPEIYAQ MIYEELSKVD VKVEIFDEDY MKANQMMALL GVGQGSAKKS
RLVVMRWNGG KETDSPIAFV GKGVTFDTGG ISLKPSRGMW DMKYDMAGSA SVVGIMRTLA
ARKAKVNAVG VVGLVENAVG GNAQRPSDVV TSMSGQTIEV LNTDAEGRLV LADALWYTQK
MFSPKLMIDL ATLTGAVVVA LGNNQYGGIF SNDDAIANQL IVAGNESGEK LWRLPLDDAY
DKLIDSSIAD VQNISTKGYG ADSITAAQFL QRFVNKTPWV HLDIAGMAWD NEGNEICPKG
ATGFGVRLLN RLILKYYEAN