Gene ECH_1058 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1058 
Symbol 
ID3927992 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1085558 
End bp1086913 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content29% 
IMG OID637902172 
ProductM16 family peptidase 
Protein accessionYP_507843 
Protein GI88658196 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAATA CATTATTCTA CATAATAACA TTGATTTTTT TTGCATATAA TGCATATGCA 
GATGATATTA ATATCAACAT AAAGGAAGCC ACTATTAATA ATAACATACG CTACTTATAT
GTTGAACATC ACGACTTACC AACAATTTCT TTAACACTTG CATTCAAAAA AGCAGGATAT
GCATATGATG CTTCTGACAA ACAAGGACTT GCACATTTCA CATCACAAAT ATTACAAGAA
GGATCAGAAA GTAATCATGC TCTAGAATTT GCAAAACAAT TAGAAGGTAA AGGTATAGAC
TTAAAATTTC ATGTAGACAT AGATAATTTC TATATATCTA TAAAAACACT ATCAGAAAAC
TTTGAAGAAG CTTTAACTTT ATTAAGTGAT TGCTTATTCA ATCCAGTAAC TGACCCAGAA
ATATTTCATA GGGTAATAGC AGAACAAAGT GCACATGTAA AATCTTTATA TGGATCTCCT
AAATTCATAG CAGCAACTGA AATTAATCAT GCCATATTTA AAGGACACCC ATACTCTAAT
AAAATCTATG GTACATTAAA TACTATTAAC AACATTACCC AAGAAGATGT ATCATCATAC
ATAAAAAACA GTTTTGATAA GGACCAAATC GTCATTAGTG CAGCAGGAGA TATAGATTCA
GCAAAACTAT CGAATTTATT AGATAAATAT ATCCTATCAA AATTACCGTC TGGTAATAAC
AAAAATACTA TACCAGATGC CACCGTTAAC AGAGAACAGA AACTCTTATA TGTAAGAAGA
AATGTACCAC AAAGTGTCAT AATGTTTGCT ACAGACACAG TATCATACAA TGACGAGGAT
TATTATGCAT CCAACTTATT CAACAATATG CTAGGAGGAT TAAGCCTTAA TTCAATATTG
ATGATAGAGT TACGAGACAA ATTAGGACTA ACTTACCATG CTAGTAGCAT GCTAGATAAT
ATGAATCATA GTAACGTGTT ACTCGGCATA ATAACTACTG ATAATACTAC AGTAACAAAA
TGCATATCTG TATTAAAAGA AATTATAGAA AATATTAAAA ATAATGGAAT TAATCAGGAA
ACTTTTTTAA CTGCAAAATC TAGTATTACT AATTCTTTCA TTTTATCAAT GTTAAACAAT
GATAATGTTG CAAATACATT ATTAAACCTA CAATTACGCG GTCTAGATCC AAGTTATATA
AACAAACATA ATTCCTACTA TAAAACCCTC ACAATAGAAG AAGTAACCAA AGTTGCTAGG
AAAATCTTAT CTAATGATTT AGTAATAATT GAAGTGGGAA AAAACAATAA TATAAACGGT
AAACAGATAG AAGCTAAAGA AAACATACTT GGCTAA
 
Protein sequence
MRNTLFYIIT LIFFAYNAYA DDININIKEA TINNNIRYLY VEHHDLPTIS LTLAFKKAGY 
AYDASDKQGL AHFTSQILQE GSESNHALEF AKQLEGKGID LKFHVDIDNF YISIKTLSEN
FEEALTLLSD CLFNPVTDPE IFHRVIAEQS AHVKSLYGSP KFIAATEINH AIFKGHPYSN
KIYGTLNTIN NITQEDVSSY IKNSFDKDQI VISAAGDIDS AKLSNLLDKY ILSKLPSGNN
KNTIPDATVN REQKLLYVRR NVPQSVIMFA TDTVSYNDED YYASNLFNNM LGGLSLNSIL
MIELRDKLGL TYHASSMLDN MNHSNVLLGI ITTDNTTVTK CISVLKEIIE NIKNNGINQE
TFLTAKSSIT NSFILSMLNN DNVANTLLNL QLRGLDPSYI NKHNSYYKTL TIEEVTKVAR
KILSNDLVII EVGKNNNING KQIEAKENIL G