Gene ECH_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1052 
Symbol 
ID3927606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1080495 
End bp1081910 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content34% 
IMG OID637902166 
Productserine protease 
Protein accessionYP_507837 
Protein GI88658017 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID[TIGR02037] periplasmic serine protease, Do/DeqQ family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.72618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGAT TTCTATTACT AATATTAGTA CTAATATTGG CAAATGTTCC TATAGGTAGT 
TTTGCTGATC AGAAAGCAGA ACAATATGAT CCAAGGTCAG GCTTTTCTAA ATTAATCAAA
GAATCTACAC CAGCAGTAGT TAATATCAGT ATAGTACATG ATTTAATACA GGAACAATTT
CCTTTAATAA CTCTCGAAGA ACTTTTAAGG AATATATTAG AAGGCAAGCC AGTAAAAAAA
GACATACCAC AAGAAGTATT AAGCGCAGGG TCGGGATTTG TTGTAGATGA ATCAGGTATT
ATCGTCACCA ACTATCATGT TGTACACAAT GCAAAAGAAG TTTATGTCAC ATTTAGTGAC
AACAAGTCAA TTCCTGCTAA GATTTTAGGA GTTGACCCAC AAACAGACCT AGCAGTGTTA
AAAGTTGAAG TTAATGAAAA ACTTCCTTAT CTAGAATTTG GAGATTCTGA CAAGACTATG
GTTGGAGACT GGGTAGTTGC CATAGGTAAC CCATTTGGTC TTGGTGGTTC TGCAAGTATT
GGTATTATAT CTGCACGCGC AAGAGATCTT AACATTGGCA CAGCAACAGA ATTTTTACAA
ACTGATGCTG CAATTAATAA GGGTAACTCT GGTGGTCCTC TATTTAATAT AGATGGTAAA
GTAATTGGTA TTAACACAGC AATATTATCT ACACAAAAAG GTGGTGGTAA CATAGGAGTT
GGATTTGCTA TCCCATCAAA TAATGCTGTT TCTATAATAA AAGTTTTATC CCAAGGGAAA
AAAGTAGAAC ATGGTTGGCT CGGTGTAGTG ATGCAACCGA TAACTGAAGA ACTAGTAGAA
CCATTACAAC TAAAAGAAGT GGGTGGAGCT TTAATTACTA ATGTAGTCAA AGGTAGTCCA
GCAAGTAAAG CAAACTTGCT TCCAGGGGAT ATTATACTTG AGTTTAATGG TACTAAGATT
AATTCAATAT CACAACTACA TCAATTAGTA CTAAGATCAG AAGCAGACAA TGAAGTAAAA
TTACTTGTGT CACGTAATGG TAGTATTATA AGTATACTAG TCAAAATAGG GAAATTTGAG
AATCCTGATA TTTCAGAAAA CGGAATGCCT AAGGATGCTA TCCAATCCCC GGAGTTAGGG
TTAACAGTAG GTAGTGTACA ACGCAATAAC ATATATAATA ATATTGAGGA AGAAGAAGCA
AAAGGAGTGG TGATATTAGA TATAGATAGT ACAAGTAATG CTTCAACCAG AAATATAAGA
AAAGGCGATA TAATATTACA AATCAATCAA TCACCAGTAA ATAATCTCGA AGATTTTAAA
AATGTTATGA AAAAAGTACG TAAAAATAAA TCTGTAGCAT TGTTAATCAG TAGGGATAAT
ATATCAGCTT TTGTGACTGT CAAGCTGAAG CAGTAG
 
Protein sequence
MKRFLLLILV LILANVPIGS FADQKAEQYD PRSGFSKLIK ESTPAVVNIS IVHDLIQEQF 
PLITLEELLR NILEGKPVKK DIPQEVLSAG SGFVVDESGI IVTNYHVVHN AKEVYVTFSD
NKSIPAKILG VDPQTDLAVL KVEVNEKLPY LEFGDSDKTM VGDWVVAIGN PFGLGGSASI
GIISARARDL NIGTATEFLQ TDAAINKGNS GGPLFNIDGK VIGINTAILS TQKGGGNIGV
GFAIPSNNAV SIIKVLSQGK KVEHGWLGVV MQPITEELVE PLQLKEVGGA LITNVVKGSP
ASKANLLPGD IILEFNGTKI NSISQLHQLV LRSEADNEVK LLVSRNGSII SILVKIGKFE
NPDISENGMP KDAIQSPELG LTVGSVQRNN IYNNIEEEEA KGVVILDIDS TSNASTRNIR
KGDIILQINQ SPVNNLEDFK NVMKKVRKNK SVALLISRDN ISAFVTVKLK Q