Gene ECH_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH_1098 
SymbolftsH 
ID3927708 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEhrlichia chaffeensis str. Arkansas 
KingdomBacteria 
Replicon accessionNC_007799 
Strand
Start bp1121571 
End bp1123403 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content35% 
IMG OID637902212 
ProductATP-dependent metalloprotease FtsH 
Protein accessionYP_507882 
Protein GI88657587 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID[TIGR01241] ATP-dependent metalloprotease FtsH 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0627303 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAAAA TAATTGAAGG GTTAGCAATT TGGATTATAG TAGTTGTTCT GGTTGCTGCA 
GCATATGTTC AATTTAATGA TAAAATCATT AACGGTAATA CCATCAAATT GCCATTCTCA
GAGTTTTTAA ACAAGATTGA TAATAATGAA GTCGAAAGCA TTAATATAGG CGAACATAAT
ATTACTGGAA AATTGAAAGA TGGTTCTAAA TTTCAAACAA CTGCCATAGT ATACGACAGT
TTAATAAAAA CATTACATAA TCAGCAAGTG ACTTTTACTT TTTTACCTGA AGATACATTC
TTTGGGGTAT TAAGTAACAT ATTGGTATCC TGGTTCCCTA TGTTATTACT TGTTATAATA
TGGTTTATAT TCCTCAAACG TATGCAAATA GGAGGTAACC GCACTATAAA CTTTAGCAAA
TCACGAGCGA AATTAATGAC CGAAAATCGT AATAAAGTAA CTTTTAACGA TGTAGCAGGT
ATTGATGAAG CTAAAGAAGA ATTAATCGAA ATAGTAGATT TCTTGAAACA CAGACAAAGG
TTTCAAAAAT TAGGAGGTAA AATACCAAAA GGATGCTTAT TAATAGGTTC ACCAGGTACA
GGAAAAACTT TATTAGCACG AGCAATTGCA GGTGAAGCAA ATGTTCCGTT TTTTAGTATA
TCTGGGTCAG ATTTTGTAGA AATGTTTGTA GGTGTGGGTG CAAGCCGTGT GCGCGACATG
TTTGAACAAG GGAAAAAAAA TGCCCCATGT ATCATATTCA TTGATGAGAT AGATGCTGTA
GGAAGACACA GAGGAATTGG CTTAGGTGGA GGAAATGATG AACGCGAACA AACATTAAAC
CAATTACTTG TTGAAATGGA TGGATTTGAA TCTAATGAGG GAGTAATCAT CATCGCTGCA
ACTAACCGTC CAGATGTATT AGATTCAGCA CTTCTTAGAC CAGGAAGATT TGATAGACAA
GTTACTATTA GCATACCTGA TATTAACGGG CGAGAAAAAA TAATTAATGT ACATATCAAA
AAAGTACCTA CAGCACCAGA TGTCAATATT AGAACCATTG CACGTGGTAC TCCAGGATTC
TCTGGAGCAG ATTTGGCAAA TCTAGTAAAT GAAGCAGCAC TAATAGCAGC AAGACTCAAC
AAAAAAATTG TCACAATGAG TGATTTTGAG TATGCAAGAG ACAAAGTTAT GATGGGAGCT
GAAAGAAAAT CACTAATGAT GACAGAAGAA GAAAGAAGGC TAACAGCTTA TCATGAAGCA
GGACATGCTA TTATCGCATT TTTCACAGAA GCATCCGATC CTATACATAA GGCAACAATC
ATTCCAAGAG GAAGAAGTTT AGGGTTGGTA ATGAGACTAC CAGAATCAGA TCGTGTCTCT
CATACAAGGG AAAAAATGAT TGCAGATTTG ACAGTAGCAA TGGGTGGAAG AGCTGCTGAG
GAATTGATTT TCGGATACCA CAAAGTAACA AGTGGAGCAT CATCAGACAT AAAACAAGCA
ACAGACCTTG CAAAAGCCAT GGTTATGAAG TGGGGAATGA GCGACAAAGT AGGTCCGTTA
TATCACAATG ACGATAAAAA TGACACTATT TCTAATAACT TAGCAAATTT GATAGATGAA
GAGGTCAAGC TTATAGTAAC ATCTGCACTT GAAAGAGCAA AAAGCCTATT AAATGAGCAT
TTAGAATCAT TACATATAGT TGCTAAAAAT TTATTGGAAT TTGAGACTCT AACTGGAGAA
GATATCAAAA ATATAATAAA TGGTAAAGAA CTGACCAAAG ATGATATTGA GGAATCTCAG
GTTTTAAAAA GATCATTCGC TTCAAAACAG TAA
 
Protein sequence
MRKIIEGLAI WIIVVVLVAA AYVQFNDKII NGNTIKLPFS EFLNKIDNNE VESINIGEHN 
ITGKLKDGSK FQTTAIVYDS LIKTLHNQQV TFTFLPEDTF FGVLSNILVS WFPMLLLVII
WFIFLKRMQI GGNRTINFSK SRAKLMTENR NKVTFNDVAG IDEAKEELIE IVDFLKHRQR
FQKLGGKIPK GCLLIGSPGT GKTLLARAIA GEANVPFFSI SGSDFVEMFV GVGASRVRDM
FEQGKKNAPC IIFIDEIDAV GRHRGIGLGG GNDEREQTLN QLLVEMDGFE SNEGVIIIAA
TNRPDVLDSA LLRPGRFDRQ VTISIPDING REKIINVHIK KVPTAPDVNI RTIARGTPGF
SGADLANLVN EAALIAARLN KKIVTMSDFE YARDKVMMGA ERKSLMMTEE ERRLTAYHEA
GHAIIAFFTE ASDPIHKATI IPRGRSLGLV MRLPESDRVS HTREKMIADL TVAMGGRAAE
ELIFGYHKVT SGASSDIKQA TDLAKAMVMK WGMSDKVGPL YHNDDKNDTI SNNLANLIDE
EVKLIVTSAL ERAKSLLNEH LESLHIVAKN LLEFETLTGE DIKNIINGKE LTKDDIEESQ
VLKRSFASKQ