Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0642 |
Symbol | |
ID | 7406983 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 727918 |
End bp | 729768 |
Gene Length | 1851 bp |
Protein Length | 616 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643715023 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_002572539 |
Protein GI | 222528657 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGGAACC TATTTAAAAC TGCAACAATT TATATTCTGA TTGCTCTTGT GATTTTGTTA CTTGTAGATA TTTTTAGTGG AGGGCTTTCG TACAATCAGT TCTTTTCAAA TTTGAGCGAA AGAAGAGAGG TAATTTATTC AGAGCTTATA AACGATATAA ACGATGGCAA GGTGACAAGG ATTGTTTTGA GCTATAACAA TGTGTCTGGA CAGTATGCAG ACGGCACCAA GTTTGACAAT GTGTTTGTAC CATCTCCAGA TAAGTTTTTA GACCAGATAC AGCCAGCCAT TCAGGCAAAG AAGATTCAAA TAGTGACAAA AGAACCACCG CAGGTTCCAT GGTGGCTTTC GACCTTTTTG CCAATGCTTA TATTTGCAGG ACTGATGATT TTTGTATGGA TATTCATGCT ACAGCAGACC CAGGGTGGCG GCAGCAAGAT AATGTCGTTT ACAAAATCGC GTGCAAAGAC AATCCAGGAC CTCAAAAAGA AGGTCACATT TGCAGACGTT GCAGGTGCAG ATGAAGAAAA AGAAGAACTC AAAGAGGTTA TTGATTTTCT CAAAAATCCA AGAAAGTATA TCGAACTTGG TGCGAGAATT CCAAAAGGGA TTTTGCTTGT CGGACCGCCT GGAACAGGTA AAACCCTTTT AGCAAAAGCA GTTGCAGGCG AGGCAGGAGT TCCATTTTTC AGCATATCGG GTTCTGACTT TGTTGAGATG TTTGTTGGTG TTGGCGCAGC AAGAGTGAGA GACCTTTTTG ACCAAGCAAA GAGAAATGCT CCATGTGTTG TGTTTATAGA TGAGATTGAC GCAGTTGGTC GTCACAGGGG AGCAGGGCTT GGCGGAGGTC ATGACGAAAG AGAACAGACT TTAAACCAAC TTCTTGTTGA GATGGACGGG TTTGGAACAA ATGAAGGAAT AATTGTAATG GCGGCAACAA ACAGACCTGA CATATTGGAC CCTGCACTTT TGCGACCTGG CAGATTTGAC AGGCAGATTG TTGTAAATGT TCCAGACGCA AAGGCAAGAG AAGAGATTTT AAAAGTCCAT GCACGAAACA AGCCTCTGGG TGAAGATGTT GATTTATCTC AAATAGCAAA GATAACAGCT GGGTTTACTG GTGCTGACCT TGAAAATCTT TTGAATGAGG CTGCACTTTT GGCAGCAAGG AAGGGTAAAA GACAGATTAA CATGGAAGAG GTTCAGGAAG CTGTAGCAAA GGTGTTGATG GGGCCTGAAA AAAGAAGCAG AGTTTATACT GAAAAAGAAA AGAAGCTTAC TGCATATCAT GAAGCAGGGC ATGCAATTGT TAGAACTATG ATTCCTGATT CTGAACCTGT TCATGAGGTT TCAATTATAC CAAGAGGGTA TGCCGGTGGG TACACTATGT ATCTTCCAAA GGAAGATAAG TTCTACGCAT CAAAATCTGA TATGATGAGA GAGATTGTAA CCCTTCTTGG TGGAAGAGTT GCAGAAAAGC TTGTTTTGGA AGATGTATCA ACAGGTGCAG CATCTGATAT AAAAAGAGCA ACCAAGATTG CAAGGGACAT GGTAACAAAA TATGGAATGT CTGACAAACT TGGTCCTATG ACCTTCGGAA CAGAGCAGGA AGAAGTGTTC TTAGGAAGAG ACCTTGCGCT TGCAAGGAAC TACTCAGAGG AAGTTGCTGC TGAAATAGAC AGGGAGATAA AAAGCATTAT TGAAGAGGCT TATAAAAAGG CCGAAGAGAT ACTAAAACAG AACATTGATA AGCTTCACAA GGTTGCAAAT GCACTTTTAG AAAAAGAAAA GCTCACGGGC GAAGAGTTCA GAAAACTTGT TTTTGAAGAT GCTCAGCCAC AGCTTGTTTA A
|
Protein sequence | MRNLFKTATI YILIALVILL LVDIFSGGLS YNQFFSNLSE RREVIYSELI NDINDGKVTR IVLSYNNVSG QYADGTKFDN VFVPSPDKFL DQIQPAIQAK KIQIVTKEPP QVPWWLSTFL PMLIFAGLMI FVWIFMLQQT QGGGSKIMSF TKSRAKTIQD LKKKVTFADV AGADEEKEEL KEVIDFLKNP RKYIELGARI PKGILLVGPP GTGKTLLAKA VAGEAGVPFF SISGSDFVEM FVGVGAARVR DLFDQAKRNA PCVVFIDEID AVGRHRGAGL GGGHDEREQT LNQLLVEMDG FGTNEGIIVM AATNRPDILD PALLRPGRFD RQIVVNVPDA KAREEILKVH ARNKPLGEDV DLSQIAKITA GFTGADLENL LNEAALLAAR KGKRQINMEE VQEAVAKVLM GPEKRSRVYT EKEKKLTAYH EAGHAIVRTM IPDSEPVHEV SIIPRGYAGG YTMYLPKEDK FYASKSDMMR EIVTLLGGRV AEKLVLEDVS TGAASDIKRA TKIARDMVTK YGMSDKLGPM TFGTEQEEVF LGRDLALARN YSEEVAAEID REIKSIIEEA YKKAEEILKQ NIDKLHKVAN ALLEKEKLTG EEFRKLVFED AQPQLV
|
| |