Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3029 |
Symbol | |
ID | 5734886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3824907 |
End bp | 3826862 |
Gene Length | 1956 bp |
Protein Length | 651 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280173 |
Product | ATP-dependent metalloprotease FtsH |
Protein accession | YP_001545795 |
Protein GI | 159899548 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0465] ATP-dependent Zn proteases |
TIGRFAM ID | [TIGR01241] ATP-dependent metalloprotease FtsH |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0253083 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGAAA ATCGCTTCTT ACGCAACAGT TTTGTGTATT TGATCATCGT AGTGGCGGCG TTGGCGCTCT TCTATCAATA CATCAATGGT AGTCCTGGCA ATACCAGCGA AAAGAGCTAC AGTGAAATTC TCGACTTAGC TCTCCAAGGT AAAATTGCCG AGATCGTCCA AACCGAAGGT CAAGTCGAGT TTCGGGCGAC CACCAACGAC GCGCCACCCA CCACCTATAT TTCTCGCAAG AACAGCACGA CCGAAAGTAT TGAAGAAATT CTTTCGCAAA AAGCCCAAGA TGCTAGCAAA TTAGCCGAGC TTAAAGATCC TGAGAATAAG ACGGCAATTG CAGCACCGTT GGAAAACGCT GCCAAAGTTA AGTTCAACCC TAAATCGGCT CCAGCTTGGG GCGGTATTTT AGGCGCGGCC TTGACCTTCT TGCTGCCCAC GCTTTTGCTG ATTGGCTTTT TCGTCTTTTT CATGCGCCAA GCCCAAGGCT CCAACAACCA AGCGATGTCG TTTGGCAAGA GCAAAGCTCG CATGTTCACT GGCGACAAAC CATCGGTAAC CTTCGCTGAT GTTGCTGGCC AAGAAGAAGC CAAACAAGAC TTAACCGAAG TTGTCGAGTT TCTTAAATTT CCTGAGAAAT TTGCCCAACT TGGTGCACGG ATTCCACGTG GGGTTTTGAT GGTTGGCCCT CCAGGTACTG GTAAAACTTT GCTCAGCCGT GCGGTTGCTG GCGAAGCAGG CGTGCCATTC TTCTCAATCA GCGGTTCGGA ATTCGTGGAA ATGTTCGTCG GGGTTGGCGC AAGCCGCGTG CGCGATTTGT TTGAACAAGC CAAACGCAAC GCTCCTTGTA TCGTCTTTAT CGACGAAGTT GATGCCGTTG GTCGCCAACG CGGTGCTGGC CTTGGTGGCT CGCACGACGA ACGCGAACAA ACGCTTAACC AAATCTTGGT TGAAATGGAC GGCTTCGATA GCAATACCAA TGTGATTGTG ATCGCCGCCA CCAATCGCCC TGACGTGCTC GACCCAGCCT TGGTTCGCCC AGGCCGTTTC GACCGCCAAG TTGTGCTTGA TGCTCCTGAT ATGCGCGGAC GGGTCGAAGT GCTCAAGGTC CATACCAAGG GCAAACCACT CTCCGAAGAT GTTAATTTGG AAGCAATTGC CAAATTAACG CCTGGCTCAT CGGGTGCAGA CCTCGCTAAC ATCGTCAACG AAGCTGCGAT CTTGGCTGCA CGGCGCTCGA AAAAACGCAT CGCCATGCAA GAAATGCAAG ATGCCACCGA ACGAATTATG CTTGGTGGCC CAGAACGCCG CTCACGCGTG ATGACTCCTA AGCAAAAAGA GTTGACTGCC TTCCACGAAG CTGGTCACGC GATTGTTGCT AAAGCGATGC CTGGCGCAAA CCCTGTGCAT AAAGTGACGA TTATTCCTCG TGGGATGGCT GGTGGCTACA CCTTGATGAT CCCCGATGAA GATCAAAGCT ATATGAGCGT TTCGCAGTTT GAAGCCCAAA TTGCTGTAGC CCTTGGTGGT CGCGCCGCCG AAGAATTGGT GTTGAGCGAC TTTACCACTG GCGCTTCGGG CGATATTCAA CAAGTGACGC GCATGGCACG AGCGATGGTT ACCCGCTATG GGATGAGTTC AGAGCTGGGG CCAATTGCCT TTGGCGAAAA AGAAGAGTTG ATCTTCTTGG GCCGCGAAAT CAGCGAACAA CGCAACTATT CGGAAGAAAC CTCACGCAAG ATCGATTCGG AAGTGCGGCG TTTGGTCAGC GAAGGCCACG AACGCGCCCG CGCAATCTTG GAACGCAACC GCGAGGTGAT GAACCGCATG GCCGAAGCCT TGATCGAGCA TGAAAACCTC GATGGTGAGC CATTGCGCCA ATTGCTCGAC GAAGTGATTA AGTATAACTC CAACAATGGG GTTTATAATG ACGCTTTGCC CAAGCAACGC ATCTAA
|
Protein sequence | MGENRFLRNS FVYLIIVVAA LALFYQYING SPGNTSEKSY SEILDLALQG KIAEIVQTEG QVEFRATTND APPTTYISRK NSTTESIEEI LSQKAQDASK LAELKDPENK TAIAAPLENA AKVKFNPKSA PAWGGILGAA LTFLLPTLLL IGFFVFFMRQ AQGSNNQAMS FGKSKARMFT GDKPSVTFAD VAGQEEAKQD LTEVVEFLKF PEKFAQLGAR IPRGVLMVGP PGTGKTLLSR AVAGEAGVPF FSISGSEFVE MFVGVGASRV RDLFEQAKRN APCIVFIDEV DAVGRQRGAG LGGSHDEREQ TLNQILVEMD GFDSNTNVIV IAATNRPDVL DPALVRPGRF DRQVVLDAPD MRGRVEVLKV HTKGKPLSED VNLEAIAKLT PGSSGADLAN IVNEAAILAA RRSKKRIAMQ EMQDATERIM LGGPERRSRV MTPKQKELTA FHEAGHAIVA KAMPGANPVH KVTIIPRGMA GGYTLMIPDE DQSYMSVSQF EAQIAVALGG RAAEELVLSD FTTGASGDIQ QVTRMARAMV TRYGMSSELG PIAFGEKEEL IFLGREISEQ RNYSEETSRK IDSEVRRLVS EGHERARAIL ERNREVMNRM AEALIEHENL DGEPLRQLLD EVIKYNSNNG VYNDALPKQR I
|
| |