Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_3248 |
Symbol | |
ID | 8138605 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | - |
Start bp | 3777926 |
End bp | 3779776 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 644870857 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_003023032 |
Protein GI | 253701843 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 120 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGACAAG GCATCTGGAA ACCCCTGATG ATCACCGCAT TGTTCGTGCT GCTGGTTGAC CTGTTCTACG GTGCCGTCGT GAAGCAGACG GCGGAGCGCG GGGCCGACAT CAGCTACACC CGTTTCAGGG ACGAGGTGGC AGCGAACAAC GTGACCAAAG TCACGCTCAG GGGGAAGAAC GTCAAGGGGC AGTTCCGCAA TGCGATCAAG GTTCCTGCGG CCACGGTCGA TGAAAAGGGG GGGGCCGTCA GCGTCACCAA CTTCACCACC ACGCTGCCGT CGATAGAAGA CATGACCCTC CTGCCCGAGC TTAACAGCAG ACGGGTTGAC GTCTCGGTCG TTTCCACGGA AGGGTCAGCG ATTGGGACCG TGTTCCTGTA TCTCCTGCCG TGGCTCATCA TCCTTGGGGT CTGGTGGTTG GTGGTGCGTG GGATGAGAAA ACAGGGACCG ACCGGGATGA TGGGCGGGTT CGCCCGCTCC GGCGCCAAGG CGTACACCTC CGAGCGGATC GAGGTGACCT TTACCGACGT GGCCGGCATG GAAGAGGCCA AGCAGGAACT GCGCGAGGTG GTGGACTATC TGAAGGAGCC TAAGAAGTTC CAGCAGATTG GGGGGAAGGT ACCCAAAGGC GTGCTCCTGG TGGGGCCTCC TGGAACCGGC AAGACGCTTT TGGCGAGGGC AGTGGCGGGG GAGGCTGGGG TACCTTTTTT CTCCATTTCC GCGTCGGCCT TCATAGAGAT GTTCGTCGGG GTTGGCGCAA GCCGGGTGCG CGATCTTTTC GCCACGGCAA GGAAATCGCT CCCCAGCATC ATCTTCATCG ACGAGTTGGA TGCCGTCGGC AGGAGCCGCG GAGCCGGTTT CGGCGGTGGC CATGACGAGC GCGAACAGAC GCTGAACCAG CTGCTCTCGG AGATGGACGG CTTCGATCCC CATACCGAGC TGGTTGTTAT TTCGGCCACG AACAGGCCCG ACGTCCTCGA CCCGGCCCTG CTTCGTCCCG GGCGCTTCGA CCGCACGGTG GTGATAGAGC GGCCGGACTG GCGCGACCGC GAGAAGATCC TGAGGGTTCA TACCAAGAAG GTCCCGCTGG GGGCGGATGT CGATCTTGCG GTCATCGCCA AGGGAACCCC AGGAATGACT GGCGCGGACC TGGAGGGACT GGTCAACGAA GCGGCGATTC TCACAGCCCG CGAGAACAAG CACATCGTCG GTCTGGATGA GTTGGAACGT GCCAAGGACA AGATCCTGAT GGGCGGGGAG CGGCACATGG TGATCTCGGA TGAGGAGCGG CGGATCACAG CCTACCATGA GGCGGGGCAC GCCCTGGTGG CGCGGCTTCT TCCGAGCACC GATCCGGTCC ACAAAGTAAC TATACTGCCG CGGGGGCGTG CTCTTGGGGT AACCCAGCAG TTGCCCGAGG ACGACCGCTA CCACTACCCG CGCGCCTACC TCGTGAACCG TCTTTGCGTG GCGCTGGGAG GTAGGGTTGC CGAGCGGATC GTCTTCAACG ACGTATCATC GGGTGCCCAG AGCGATCTGA AGCATGTCAC GGAACTGGCC GAAAAGATGG TATGTCAGTG GGGGATGAGC GAGAAGATAG GGCCGATGAC CTTCTCCAGA GGAGAGGAGC ATCCTTTTCT GGGCATGAAG TTGGCGGAAG AAAAGACCTT TTCCGAAGAG ATGGCTTGGC TGATAGACCA GGAGATAGCA TCGTTCATCA GGGCTGCCGA AGGGAAATCC CTGGAGCTCT TGGGTGCCAA TCGGAGCAAG CTGGACGCAT TGGCGGCGGC GCTTCTGGAG GAAGAGACCC TTGACGGCCT GCGCGTCGAC GAGATTTTGT CGGGAGCGTG A
|
Protein sequence | MGQGIWKPLM ITALFVLLVD LFYGAVVKQT AERGADISYT RFRDEVAANN VTKVTLRGKN VKGQFRNAIK VPAATVDEKG GAVSVTNFTT TLPSIEDMTL LPELNSRRVD VSVVSTEGSA IGTVFLYLLP WLIILGVWWL VVRGMRKQGP TGMMGGFARS GAKAYTSERI EVTFTDVAGM EEAKQELREV VDYLKEPKKF QQIGGKVPKG VLLVGPPGTG KTLLARAVAG EAGVPFFSIS ASAFIEMFVG VGASRVRDLF ATARKSLPSI IFIDELDAVG RSRGAGFGGG HDEREQTLNQ LLSEMDGFDP HTELVVISAT NRPDVLDPAL LRPGRFDRTV VIERPDWRDR EKILRVHTKK VPLGADVDLA VIAKGTPGMT GADLEGLVNE AAILTARENK HIVGLDELER AKDKILMGGE RHMVISDEER RITAYHEAGH ALVARLLPST DPVHKVTILP RGRALGVTQQ LPEDDRYHYP RAYLVNRLCV ALGGRVAERI VFNDVSSGAQ SDLKHVTELA EKMVCQWGMS EKIGPMTFSR GEEHPFLGMK LAEEKTFSEE MAWLIDQEIA SFIRAAEGKS LELLGANRSK LDALAAALLE EETLDGLRVD EILSGA
|
| |