Gene ECH_0479 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0479 
Symbol 
ID3927126 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp458144 
End bp459868 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content30% 
IMG OID637901602 
ProductM24 family metallopeptidase 
Protein accessionYP_507295 
Protein GI88657638 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA GGTTAAATCA ATTAATTGGT TTAATGGAAG AGTATGAAAT TGATGTATTG 
TTATTGCAGA ATACGGATGA ATATCAGTGT GAGTATGTGC ATATTAACAA GCAAAGAATA
AGATGGTTGT GTGGTTTTTC TGGCTCAAAT GCTACTTTGA TAATATCAAG GGAAGGTAAA
CAGAATTTTT TTACTGATGG TAGATATACA TTACAAGCAA CAAGGGAGTT GGACTGTAGT
TATTATCAAA TACATAATGT GTGTGAGTTA ACTCCTTGGC AATGGTGTGT AGAAAATTGT
CTATCTCATA CTGTTGTAGC TTATGAATCT GCATTGTTTA CTTTGAGTCA AATAAGAAAG
TATGAAGATT GTGGTATTTT TTTAAAACCA ATAGATCAAA TTTTGATTGA TAAGTTGTGG
ATTCGTGATT TTGCTATAGA ACATGATATA GTACAGCATT CTTTAGAATA TTCTGGTGTT
GAGAGTTATA CAAAGTCTTG TGAAGTTGCA AAATATTTGT CTGACAAAGA TGCTGCATTA
ATCACAAATA CTGATGTTAT TTCATGGATG TTAAATATAC GTAATAAAAA GTTTCTTTAT
AATCCTTCAG TGTTGTCTAG AGCGATTTTA TATAAGGATG GAAGAGTTGA TTTATTTATT
GATGATGTTT ACTCAGTAAA TGTTAAATAT GAGCATCTTA ATATATGTTC TTTGAATAAT
CTATTTAATG TTTTGAAATC TGTAAAATCA GTAGTTGTAG ATGCATCTAC TATACCAATG
AGTATTTTTC TATCTTTACA ACAACAGGAT GTATTAGTTA ATGATGCAGA TTTTTGTCTT
TTAATGAAAG CCAGAAAGAA TGATGTTGAA ATACAAGGTG CAATTAATGC TCATGTTAGA
GATGGAATTT CTATAGTTAA TTTATTATAC TGGTTAAATA TGCAATTGGA TAATAATCAG
AAAATTACAG AGCTGGATGT TGAGTCAAAA TTATTAGATT TCAGAAAACA GCAAAGTTTG
TTTCAAGGTG AAAGTTTTTC TACAATTTCT GGTTTTCAAG AAAACGGAGC TGTAATACAT
TACAGAGCGA ATAATGATAC AAATAAGTTA ATATGTAAGA ATGGATTATA TTTATTAGAT
TCTGGTGGTC AATATCTTGA TGGTACAACA GATGTTACGC GTACTGTTGC TATAGGTGAA
CCTACATCTG AGCAAATTAC TAATTTTACG TTAGTTTTAA AAGGTCATAT TGCTTTAGCG
ATGGCAGTCT TTCCTTTAGG TACTACTGGT GGAATGTTAG ATATATTAGC TAGACAGTAT
TTGTGGAAAT CAGGACTTGA TTATCAACAT GGTACGGGTC ATGGGGTGGG AAGTTTTTTA
TCAGTTCATG AAGGTCCTTG TGCTATTTCG TACAAGAATG ATGTTGTATT GCAGCCAAAT
ATGGTGCTAT CAAATGAGCC AGGATATTAT AAAAATGGTG AATATGGAAT AAGGATTGAA
AATCTGATGT ACGTTGAAGA ATATATGAAT GGCTTTTTAA GATTTAAACA GTTAACATGT
GTGCCTATAG ATTTAAGATT AATAGATGTA GATATGTTAA ATCATGAGGA AATCAATTAC
ATAGACCAGT ATCATAATTT TGTATATAAC ACTATTGCTC CACATGTAAG TGAAGAAGTA
AAACATTGGT TATGTCATGC ATGTCAGAGT TTAAAAGGTA AGTAG
 
Protein sequence
MKNRLNQLIG LMEEYEIDVL LLQNTDEYQC EYVHINKQRI RWLCGFSGSN ATLIISREGK 
QNFFTDGRYT LQATRELDCS YYQIHNVCEL TPWQWCVENC LSHTVVAYES ALFTLSQIRK
YEDCGIFLKP IDQILIDKLW IRDFAIEHDI VQHSLEYSGV ESYTKSCEVA KYLSDKDAAL
ITNTDVISWM LNIRNKKFLY NPSVLSRAIL YKDGRVDLFI DDVYSVNVKY EHLNICSLNN
LFNVLKSVKS VVVDASTIPM SIFLSLQQQD VLVNDADFCL LMKARKNDVE IQGAINAHVR
DGISIVNLLY WLNMQLDNNQ KITELDVESK LLDFRKQQSL FQGESFSTIS GFQENGAVIH
YRANNDTNKL ICKNGLYLLD SGGQYLDGTT DVTRTVAIGE PTSEQITNFT LVLKGHIALA
MAVFPLGTTG GMLDILARQY LWKSGLDYQH GTGHGVGSFL SVHEGPCAIS YKNDVVLQPN
MVLSNEPGYY KNGEYGIRIE NLMYVEEYMN GFLRFKQLTC VPIDLRLIDV DMLNHEEINY
IDQYHNFVYN TIAPHVSEEV KHWLCHACQS LKGK