Gene ECH_1057 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1057 
Symbol 
ID3927287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1084236 
End bp1085555 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content32% 
IMG OID637902171 
ProductM16 family peptidase 
Protein accessionYP_507842 
Protein GI88658542 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAAT TTTTTACTTG TTTTTTTACA ATCTTCTTCA CAATAGCTAA TCATGCTTTA 
TCTTTTAACA TTAAAGTTAC ACATGAAAAG CTAGATAATG GCATGGAAGT ATATGTTATC
CCTAATCATC GCGCACCCGC AGTTATGCAT ATGGTATTAT ATAAAGTTGG TGGGACTGAT
GATCCAGTAG GTTATTCTGG ACTTGCACAT TTTTTTGAAC ATTTAATGTT CAGTGGTACA
GAAAAATTCC CTAATCTTAT AACTACTCTA AGTGATATAG GCGGAAACTT CAATGCAAGT
ACATCTGAAT TTTGCACTAT ATATTATGAA CTAATACCAA AACAACATTT ATCTCTTGCA
ATGGATATTG AATCAGACAG AATGCAAAAT TTTAAAATTA CTGATAAGGC ATTAATAAGA
GAGCAAAAGG TAGTATTAGA AGAAAGAAAA ATGAGAGTTG AAAGTCAGGC AAAAAACATA
CTGCAAGAAG AGATGGAAAA CACATTCTAT TATAATGGAT ATGGTAGACC AGTAGTAGGG
TGGGAACATG AAATCAGCAA TTACAATAGA GAAGTTGCTG AAGCATTTTA TAAACTTCAC
TACAGCCCTA ACAACGCTAT ATTAGTTGTA ACTGGAGATG TCGATCCACA GGAAACAATC
AACCTTGCAC AACAGTACTA TGGGAAGATA GAACCTAATC ACAAAAAATC CACACGTGTT
TTTAGAGCAG AACCTTCACA CAAAGCAAAC ATTACATTAA CATTAGAAGA TAGTTCAGTA
GAAATCCCAG AATTATTTTT AATGTATCAA ATACCAAGCG GTATCGCAAA TAAAAACTAT
ATACTCAATA TGATGGCAGC AGAAATACTT GGTAACGGTA AATTCAGTTT GCTTTACAAT
GATCTAGTAA TGAATAACTC AATAGTCACA TCAATAGGCA CCAATTATAA CTATTTAACT
GATAGTGATA ACTACCTCTT TATAGAAGCC GTACCTAAAG ATGGGATCTC TACAGAAACT
GTAGAAAAAG AAATCCACAA ATGTATAAAT AGTTATCTTG AAAATGGCAT TTCACCAGAA
TATTTAGAAA GTGCAAAACA AAAAGTAAAA GCACACTTAA CTTATTCTCT TGATGGATTA
AGCTTTATAT CATATTTCTA CGGCATGAAC TTAATTCTAG GAGTACCATT ATCAGAAATT
AACAATATTT ACGATACAAT AGATAAAATA AAGATTGAGG ATATTGATTC CACTATGGAA
AACATCTTCT TAAAGAACGT AAGATTAGCT GGACATTTAT TACCTAAATT GGGAGAATAG
 
Protein sequence
MVKFFTCFFT IFFTIANHAL SFNIKVTHEK LDNGMEVYVI PNHRAPAVMH MVLYKVGGTD 
DPVGYSGLAH FFEHLMFSGT EKFPNLITTL SDIGGNFNAS TSEFCTIYYE LIPKQHLSLA
MDIESDRMQN FKITDKALIR EQKVVLEERK MRVESQAKNI LQEEMENTFY YNGYGRPVVG
WEHEISNYNR EVAEAFYKLH YSPNNAILVV TGDVDPQETI NLAQQYYGKI EPNHKKSTRV
FRAEPSHKAN ITLTLEDSSV EIPELFLMYQ IPSGIANKNY ILNMMAAEIL GNGKFSLLYN
DLVMNNSIVT SIGTNYNYLT DSDNYLFIEA VPKDGISTET VEKEIHKCIN SYLENGISPE
YLESAKQKVK AHLTYSLDGL SFISYFYGMN LILGVPLSEI NNIYDTIDKI KIEDIDSTME
NIFLKNVRLA GHLLPKLGE