Gene ECH_0347 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_0347 
Symbol 
ID3926978 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp336854 
End bp338161 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content30% 
IMG OID637901471 
ProductM48 family peptidase 
Protein accessionYP_507167 
Protein GI88658252 
COG category[R] General function prediction only 
COG ID[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATATTA AAAGCGTGAT ATTATTTCTA AGTATTTTGC TTTATAGCAA TGTATCATTT 
TGTCAAGATG GTTACATGAT ATTTAGAGAT AGTGAAGTCG AGGCTGTAAT AAAGAAAATA
GCATTTCCAA TATTTATTGC AGCTAAAATT AATCCTGAGA CTGTTAGAGT GTTTATTGTT
AATGATAAAA TGGTAAATGC TTATGTTGAT GGTAATAATA ATGATGTGTT TTTGAATTAT
GGGTTATTTG AGTTTTCAAA TGATCCTAGT GTACTTATTG GGGTTTTAGC CCATGAAGTT
GGTCATATAT CTCAAAAACA TGTGTTATTC CGTAGAAGTA AAGTACAAAA TTCTATGATT
TTGTCTGGGA TAGGATATGT TCTAGGTATT ATTACTGCAA TTACAGTAAA TCCTGATATG
GGACAGGCAA TAGCACTTGC TACTAATGAT ATTAGTAAAA AAATGTTTTT TCTTTATAGT
CGTTTACAGG AGGCGTCTGC AGATCAATGT GCATTAAGAT ATTTAGATGA AGCTGGGTAT
AGCAACGATG GATTAATTAA AATGTTTAAG CATTTTTATT CACTGGAAGC ACAATATCGA
GGAAATATTG ATCAATACTT ATTATCGCAT CCTCTTAGTT ATGATAGGCT GTTGCAAATA
CAAAATTATC GCAATCGTAA TGAGGTTCAT GGTTTTTCTG ATGAAGATGT ACAGAAATTT
AAGCGAGTAG TAGAAAAAAT TAATGCGTTT TTTAACCCAG TAGAACGTTT GGTTAATGAT
AAAAATGATA TAAATCAATT ATCTCCATAC ATACAATCTA TTATTTTTTA TAAGCAATCT
GATGTTTCAA AAGCCTTAGA AAAACTTGAT AATCTAATAC TACAATCCCC TGAAGATCCT
TATCTTTATG AGCTGAAAGC ACAAATTTTG TATAAGGCAG GTGACATTAA AAAGTCTGTA
GAAAATTATA AATTAGCGCT TAAGTTTTCT TTCGATGATG TTTTAATAAA ACTTGAAACA
TCACAAGCTT TGTTATTGTA TGATCAGAAG GAAGCAGTAA ATTATTTGGA ACAAGTGACA
TACCAAGAAC CAGATAATGT TTTTGCTTGG AAGCAATTGG CTGTAGCTTA TGGTAAAATA
GGGGATTTGG GAATGTCGTA TTTTTCACTG GCAAATAAAT CTTTTTTTGA AAATAATAGA
AGAGATTTTG ATAAATACTT TAGCTTAGCA AGAAAGTATT TACCAAAAGA TAGCGTACAC
TTAGAACGTA TGCGTGATCT AAGGATAAAT TTATTAAGTA ATACATAA
 
Protein sequence
MNIKSVILFL SILLYSNVSF CQDGYMIFRD SEVEAVIKKI AFPIFIAAKI NPETVRVFIV 
NDKMVNAYVD GNNNDVFLNY GLFEFSNDPS VLIGVLAHEV GHISQKHVLF RRSKVQNSMI
LSGIGYVLGI ITAITVNPDM GQAIALATND ISKKMFFLYS RLQEASADQC ALRYLDEAGY
SNDGLIKMFK HFYSLEAQYR GNIDQYLLSH PLSYDRLLQI QNYRNRNEVH GFSDEDVQKF
KRVVEKINAF FNPVERLVND KNDINQLSPY IQSIIFYKQS DVSKALEKLD NLILQSPEDP
YLYELKAQIL YKAGDIKKSV ENYKLALKFS FDDVLIKLET SQALLLYDQK EAVNYLEQVT
YQEPDNVFAW KQLAVAYGKI GDLGMSYFSL ANKSFFENNR RDFDKYFSLA RKYLPKDSVH
LERMRDLRIN LLSNT