Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH_1098 |
Symbol | ftsH |
ID | 3927708 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ehrlichia chaffeensis str. Arkansas |
Kingdom | Bacteria |
Replicon accession | NC_007799 |
Strand | + |
Start bp | 1121571 |
End bp | 1123403 |
Gene Length | 1833 bp |
Protein Length | 610 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 637902212 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_507882 |
Protein GI | 88657587 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0627303 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAAAAA TAATTGAAGG GTTAGCAATT TGGATTATAG TAGTTGTTCT GGTTGCTGCA GCATATGTTC AATTTAATGA TAAAATCATT AACGGTAATA CCATCAAATT GCCATTCTCA GAGTTTTTAA ACAAGATTGA TAATAATGAA GTCGAAAGCA TTAATATAGG CGAACATAAT ATTACTGGAA AATTGAAAGA TGGTTCTAAA TTTCAAACAA CTGCCATAGT ATACGACAGT TTAATAAAAA CATTACATAA TCAGCAAGTG ACTTTTACTT TTTTACCTGA AGATACATTC TTTGGGGTAT TAAGTAACAT ATTGGTATCC TGGTTCCCTA TGTTATTACT TGTTATAATA TGGTTTATAT TCCTCAAACG TATGCAAATA GGAGGTAACC GCACTATAAA CTTTAGCAAA TCACGAGCGA AATTAATGAC CGAAAATCGT AATAAAGTAA CTTTTAACGA TGTAGCAGGT ATTGATGAAG CTAAAGAAGA ATTAATCGAA ATAGTAGATT TCTTGAAACA CAGACAAAGG TTTCAAAAAT TAGGAGGTAA AATACCAAAA GGATGCTTAT TAATAGGTTC ACCAGGTACA GGAAAAACTT TATTAGCACG AGCAATTGCA GGTGAAGCAA ATGTTCCGTT TTTTAGTATA TCTGGGTCAG ATTTTGTAGA AATGTTTGTA GGTGTGGGTG CAAGCCGTGT GCGCGACATG TTTGAACAAG GGAAAAAAAA TGCCCCATGT ATCATATTCA TTGATGAGAT AGATGCTGTA GGAAGACACA GAGGAATTGG CTTAGGTGGA GGAAATGATG AACGCGAACA AACATTAAAC CAATTACTTG TTGAAATGGA TGGATTTGAA TCTAATGAGG GAGTAATCAT CATCGCTGCA ACTAACCGTC CAGATGTATT AGATTCAGCA CTTCTTAGAC CAGGAAGATT TGATAGACAA GTTACTATTA GCATACCTGA TATTAACGGG CGAGAAAAAA TAATTAATGT ACATATCAAA AAAGTACCTA CAGCACCAGA TGTCAATATT AGAACCATTG CACGTGGTAC TCCAGGATTC TCTGGAGCAG ATTTGGCAAA TCTAGTAAAT GAAGCAGCAC TAATAGCAGC AAGACTCAAC AAAAAAATTG TCACAATGAG TGATTTTGAG TATGCAAGAG ACAAAGTTAT GATGGGAGCT GAAAGAAAAT CACTAATGAT GACAGAAGAA GAAAGAAGGC TAACAGCTTA TCATGAAGCA GGACATGCTA TTATCGCATT TTTCACAGAA GCATCCGATC CTATACATAA GGCAACAATC ATTCCAAGAG GAAGAAGTTT AGGGTTGGTA ATGAGACTAC CAGAATCAGA TCGTGTCTCT CATACAAGGG AAAAAATGAT TGCAGATTTG ACAGTAGCAA TGGGTGGAAG AGCTGCTGAG GAATTGATTT TCGGATACCA CAAAGTAACA AGTGGAGCAT CATCAGACAT AAAACAAGCA ACAGACCTTG CAAAAGCCAT GGTTATGAAG TGGGGAATGA GCGACAAAGT AGGTCCGTTA TATCACAATG ACGATAAAAA TGACACTATT TCTAATAACT TAGCAAATTT GATAGATGAA GAGGTCAAGC TTATAGTAAC ATCTGCACTT GAAAGAGCAA AAAGCCTATT AAATGAGCAT TTAGAATCAT TACATATAGT TGCTAAAAAT TTATTGGAAT TTGAGACTCT AACTGGAGAA GATATCAAAA ATATAATAAA TGGTAAAGAA CTGACCAAAG ATGATATTGA GGAATCTCAG GTTTTAAAAA GATCATTCGC TTCAAAACAG TAA
|
Protein sequence | MRKIIEGLAI WIIVVVLVAA AYVQFNDKII NGNTIKLPFS EFLNKIDNNE VESINIGEHN ITGKLKDGSK FQTTAIVYDS LIKTLHNQQV TFTFLPEDTF FGVLSNILVS WFPMLLLVII WFIFLKRMQI GGNRTINFSK SRAKLMTENR NKVTFNDVAG IDEAKEELIE IVDFLKHRQR FQKLGGKIPK GCLLIGSPGT GKTLLARAIA GEANVPFFSI SGSDFVEMFV GVGASRVRDM FEQGKKNAPC IIFIDEIDAV GRHRGIGLGG GNDEREQTLN QLLVEMDGFE SNEGVIIIAA TNRPDVLDSA LLRPGRFDRQ VTISIPDING REKIINVHIK KVPTAPDVNI RTIARGTPGF SGADLANLVN EAALIAARLN KKIVTMSDFE YARDKVMMGA ERKSLMMTEE ERRLTAYHEA GHAIIAFFTE ASDPIHKATI IPRGRSLGLV MRLPESDRVS HTREKMIADL TVAMGGRAAE ELIFGYHKVT SGASSDIKQA TDLAKAMVMK WGMSDKVGPL YHNDDKNDTI SNNLANLIDE EVKLIVTSAL ERAKSLLNEH LESLHIVAKN LLEFETLTGE DIKNIINGKE LTKDDIEESQ VLKRSFASKQ
|
| |