Gene ECH_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1070 
Symbol 
ID3927910 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1098742 
End bp1099884 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content34% 
IMG OID637902184 
Productputative membrane-associated zinc metalloprotease 
Protein accessionYP_507855 
Protein GI88658328 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAGTA TTATTGATAA CTTATATCAT GTACTCAATA ATGGTTCATT TTATTTGCTG 
TCGTTTCTCA TTATAATGTC TATTATAGTT TTCGTACATG AATACGGTCA TTATATTGTT
GCAAAATTAT GCAATGTGAA GGTTGAAGTT TTTTCTATAG GATTTGGCCC AGAGTTATTT
GGAATTAACG ATAAGTCTGG CACAAGGTGG AAGTTCAGCG TGATACCAAT AGGTGGGTAT
GTAAAGATGT TAGGGGATGA AGACCCAGCA AGTGTTGAAG CAAATCCTAA CCGTTTGTCA
GAAGAAGATA AGTTACTTGC ATTTTGTGAA AAACCTCTAT ACCAAAAATT TCTTATTGTA
TTTGCTGGAC CATTCGCAAA TTTAGTGTTT GCTATAGTAG TACTCATGAT GTTCTTCACT
ACTAAAGGAA TGATGAAGCA CAACTCTGTC ATTGGAGGCG TAGTACAAGA TAGTGCAGCA
CAACATGCAG GATTAGCTTC AGGGGATACA ATTCTAAAAA TCAACGACTA CCAGGTTAAA
TGGTTTGAAG AAATTAAACA GTATATAGAA AAATATGCAA AAGATAATCA AGAGCTAACT
ATAGAATATG CACGTGACGG GCACATTCAT GTTGTGAAAG TTAAACCAAG CATTAAGGAA
GAAAAAGGAC TTTTTGGAAG CATAAAGAAA AGTCCATTTT TAGGAGTTAC AATGAGTAAT
GTACTCAGCA ATTATGAATT TCAGAGATTA AGCATCACTA GTGCTTTTGT TCAGTCCATT
AATTACACTT ATTTACTGTC AAAGTCAATT TTTCAAGTAT TGGGACAAAT GTTGGTAGGG
AAACGCAGTA TTTCTGAGTT AGGTGGTCCT ATACGCATTG CTCAATATTC TGGAGAATCA
GTAAAACACA ACGAAGTACT ATTGTGCATG GCAATGATTT CCATTAACCT AGGTGTAATG
AATTTATTAC CAATTCCTAT GCTAGATGGT GGACATATTT TCCAATATTT TGTCCAAGCT
ATATTACGAC GCAAACAACT CAATCCTAAA TATCAGCGGT ATATATCTAC AATTGGGTTA
ATGCTTCTGC TATCTTTAAT GATTTTTGTC ACGTTTAACG ATATAAAAAG TATGTTTAAG
TAG
 
Protein sequence
MASIIDNLYH VLNNGSFYLL SFLIIMSIIV FVHEYGHYIV AKLCNVKVEV FSIGFGPELF 
GINDKSGTRW KFSVIPIGGY VKMLGDEDPA SVEANPNRLS EEDKLLAFCE KPLYQKFLIV
FAGPFANLVF AIVVLMMFFT TKGMMKHNSV IGGVVQDSAA QHAGLASGDT ILKINDYQVK
WFEEIKQYIE KYAKDNQELT IEYARDGHIH VVKVKPSIKE EKGLFGSIKK SPFLGVTMSN
VLSNYEFQRL SITSAFVQSI NYTYLLSKSI FQVLGQMLVG KRSISELGGP IRIAQYSGES
VKHNEVLLCM AMISINLGVM NLLPIPMLDG GHIFQYFVQA ILRRKQLNPK YQRYISTIGL
MLLLSLMIFV TFNDIKSMFK