Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HY04AAS1_0518 |
Symbol | |
ID | 6743314 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Hydrogenobaculum sp. Y04AAS1 |
Kingdom | Bacteria |
Replicon accession | NC_011126 |
Strand | + |
Start bp | 455434 |
End bp | 457344 |
Gene Length | 1911 bp |
Protein Length | 636 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 642750309 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_002121183 |
Protein GI | 195952893 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000475149 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCAGACAG CTAAGAATAT ACTTATTTGG TTTTTAATTA TAGGTTTTAT GATAGTAGCT TTTAACTTGT TTGAGGAAAA AGGTTCATCC TCACCAAGTG CAACCCCTAT GTCTTTAACA ACATTGGTTG ACATGGTAAA ACAAAACAAA ATAGGTGAGG CTGATATCAA AGGCGACAAG ATAATAGCTT ATACCAAAGA CGGCCAGAAA GTAGAAACCT ATATTCCAAA AGGTTATACA TCTATTATAG ATGAAATGAT AAAAGATGGA GTAAAGGTAA AAGCTTCACC ATCTAGCGGA GGAGATATAT CCTCCAGTGG AAACTGGCTT GTATCAATGC TAATATCTTG GTTTCCGGTA CTTTTGTTTG CTGGTATTTG GATTTTAATG ATGAGACAAA TGGGAAACGG TGGACCCACA AGAGCGTTCT CTTTTGGAAA ATCTAAAGCG AAAGTGTATA TAGAAGAAAA ACCAAACGTA AAGCTAGACA ATGTAGCTGG TATGGATGAA GTTAAAGAAG AAGTGGCTGA AGTTATAGAA TATTTAAAAG ACCCGGCAAG ATTTAGAAAA TTAGGTGGTA GACCTCCAAA GGGAATACTG TTTTATGGAG AACCAGGCGT TGGAAAGACA CTTTTAGCTA AGGCTCTTGC AGGAGAAGCA CACGTACCTT TCATATCGGT ATCTGGATCA GATTTTGTTG AGATGTTTGT AGGTGTAGGT GCCGCTAGAA TGAGAGATAC GTTTGAAACA GCTCGTAAAA ATGCCCCTTG TATAGTATTT ATAGATGAGA TTGATGCGGT AGGAAGAAGC AGAGGGGCTA TCAACCTAGG TGGTAACGAC GAAAGAGAAC AAACATTAAA CCAGCTTCTG GTAGAAATGG ATGGTTTTGA CACTTCTGAA GGTATACTCA TAATAGCAGC TACAAATAGA CCAGATATTT TAGATCCAGC TCTTCTAAGA CCAGGAAGGT TTGATAGACA AATATTTATT CCAAAGCCAG ATGTAAAAGG AAGATACGAG ATATTAAAAG TACACGCTAA AAATAAACCG CTTGCAAAGG ATGTAGATTT AGAGCTTATA GCAAGAGCAA CGCCCGGATT TACAGGAGCT GACCTTGAGA ACATATTAAA CGAAGCAGCG CTTTTGGCAG CTAGAAAAAG AAAAGATCTT ATACATATGG AGGATTTGGA AGAAGCTATA GATAGAGTTA TGATGGGGTT GGAAAGAAGA GGTATGGCCA TATCGCCAAA AGAAAAAGAA AAAATAGCTG TTCATGAAGC AGGACACGCT TTAATGGGGC TTATGATGCC AGACGCAGAT CCTCTTCACA AAGTTTCAAT TATACCAAGA GGTATGGCTT TGGGAGTGAC TACTCAGCTT CCAATAGACG ATAAGCATAT TTACGATAAA GCAGATCTTC TTTCAAGAAT ACATATACTT ATGGGTGGAA GATGTGCGGA GGAAGTGTTT TACGGTAAGG ATGGTATAAC TACAGGAGCA GAAAATGACC TACAAAGGGC TACCGATTTA GCTTACCGTA TAGTGGCTAC TTGGGGAATG AGCGAGAATG TAGGACCAAT ATCTGTAAGA AGAAATATCA ATCCTTTCCT TGGGGGTTCT ACAGTCACGG AAGGAAGCCC AGATCTTCTA AAAGAGATAG ACAAAGAAGT GCAAAAACTC CTAGCATCTG CTTACGAAGA AACAAAAAGA GTTATAGCCG AGAACAAAGA AGCTTTAAGC AGTGTGGTAA AAAGGCTAAT AGAAAAAGAA ACCATAGATT GTAAAGAATT TGTGGAGATA CTTAGTTTAC ATGGCGTTGA GGTAAAAAAT GCTTGTAAAC AAGAGGAATC TTTGGAGAAA AAAGAAAATA ATGTAGAGCC TAAAATTGAT AAAAATGTTG TTAACGTATG A
|
Protein sequence | MQTAKNILIW FLIIGFMIVA FNLFEEKGSS SPSATPMSLT TLVDMVKQNK IGEADIKGDK IIAYTKDGQK VETYIPKGYT SIIDEMIKDG VKVKASPSSG GDISSSGNWL VSMLISWFPV LLFAGIWILM MRQMGNGGPT RAFSFGKSKA KVYIEEKPNV KLDNVAGMDE VKEEVAEVIE YLKDPARFRK LGGRPPKGIL FYGEPGVGKT LLAKALAGEA HVPFISVSGS DFVEMFVGVG AARMRDTFET ARKNAPCIVF IDEIDAVGRS RGAINLGGND EREQTLNQLL VEMDGFDTSE GILIIAATNR PDILDPALLR PGRFDRQIFI PKPDVKGRYE ILKVHAKNKP LAKDVDLELI ARATPGFTGA DLENILNEAA LLAARKRKDL IHMEDLEEAI DRVMMGLERR GMAISPKEKE KIAVHEAGHA LMGLMMPDAD PLHKVSIIPR GMALGVTTQL PIDDKHIYDK ADLLSRIHIL MGGRCAEEVF YGKDGITTGA ENDLQRATDL AYRIVATWGM SENVGPISVR RNINPFLGGS TVTEGSPDLL KEIDKEVQKL LASAYEETKR VIAENKEALS SVVKRLIEKE TIDCKEFVEI LSLHGVEVKN ACKQEESLEK KENNVEPKID KNVVNV
|
| |