Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1717 |
Symbol | |
ID | 4710221 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1882469 |
End bp | 1884874 |
Gene Length | 2406 bp |
Protein Length | 801 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639856185 |
Product | peptidase S16, lon domain-containing protein |
Protein accession | YP_001003283 |
Protein GI | 121998496 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1067] Predicted ATP-dependent protease |
TIGRFAM ID | [TIGR00764] lon-related putative ATP-dependent protease |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCTGCAC CCGAGCCCCT TTCACTGGAG CGCCTGTACC GGGTCTGTGA TCCGGAGCAG CTCGGTTTTC GCACCACCGA GGAGCTGGCG GGCATGGACC GCCCACCCGG GCAGGAGCGG GCCTTGGAGG CGATGGATCT GGGCGCGAAC ATGCGCGCCC CGGGCTTCAA CCTCTTCGTC ATGGGCCCGG AGGGCGACGG CAAGCTGGAG ATGGTCCAGC GCCTACTGGC CGAACGCGCC GCTCGCGAGC CGACGCCCTC GGACTGGTGC TACCTGAACA ACTTCGACGA GCCCACACAA CCCCGCCTTC TGCGGCTACC CCCGGGCCAA GGGGCACGCT GGCGCCACGA TCTGGAGCAG CTGATCGAGG AGCTGCGCAG CACCATCCCG GCCACCTTCG AGAGCGACGA GTACCAGAAC CGGCTGCAGG AGCTGCAGCA GCAGCTCAAC CGCCGCCAGC GTGAGGCCTT CGAGACCATC CAGAAGGAGG CCGAACAGTA CGACGTCACC CTGCTGCAGA CGCCCTCGGG ATTCAGCTTC GCCCCGGTGA AGGATGGCGA GGTGATCGAG CCGGAACAGT TCCAGCAGCT ACCCGATGAG GAGCGCAAGC GCTACCAGGA GGCCATCGAG TTCCTGCAGG AACGACTGCA GTCCGTGGTG CAGCAGATCC CCAAGTGGCG CAAGGAGATC CAGCAGCAGG TCCGCAAGCT CAACGAGGAG ATGACCCTGC TCGCGGTCGG TCAGCGCATC CAGGAGCTGC GCCAGCGCTA CGGCGAGCTA CCGGTGGCGG CGGCCCATCT GGACGCCATC CGCAACGACA TCATCGAGCA CGTGGACGCC TTCCGCTCCG GGGAGCAGGA CCACGTGGAG TACATCCTCG GCCGCTACCG GGCCAATCTA TTACTCGCCC ACGATCCAGC CGACGGCGCC CCGGTGGTCT ACGAGGACAT GCCCACCCAC CAGAGACTGG TGGGTCGAAC GGAGCACCAC GTCCATCAGG GCGCCCTGCT CACCGACTTC AATCTGATCC GCCCTGGCTC GCTGCATCAG GCCAATGGCG GTTACCTGGT CGTGGACGCC CACCGCATCC TCACCCAGCC ACTGGCCTGG CCGTCGCTCA AGCGCACCTT GTCTGCCGGA GAGATCCGCA TCGAATCCCT GGAGCAGGTC CACGGCTTCT GGACCACCGT CACCCTGGAG CCGGAGCCGA TGCCGCTGCG TACCAAGGTG GTGCTGCTCG GCGACCGGAT GGTCTACTAC CTGCTCTCCG CCTACGACCC GGACTTCCCG GAACTGTTCA AGGTTGAGGC CGACCTGGAG GACGACCTCC CCCGGGACAC CGAGACCCAG CAGCTCTACG CCCGCATGCT CGCCACCCTG GTTCGGCAGC GCCGTCTGCG CCACCTGGAC CGCTTCGCCG TGGCGCGGGT GATCGAGCAC GGCAGCCGCA TGGCCGATGA CAGCGAGCGG CTGGCCGCCG GCGGGCGGGC CATCACCGAT TTACTGCAGG AGGCGGATCA CTACGCCACC GGCGACGGCG CCGAGATCAT CGGCCAGGAT CACATCGAGC GCGCCCTCGC CGCCCAAGAG CGCCGCGCCG GGCGCATCCG CGATCGCAGC CAGGAGACCA TCGAGCGCGG CACCCTGGTG ATCCACACCG AGGGGCACCA CACCGCCTCG GTCAACGGGC TCTCGGTCCT GCAGCTGGGC GATTTCGGTT TCGGCCGTCC GACACGGATC ACCGCCACCG CCCGCCCCGG GCGCGGGCAG CTGGTGGATA TCGAGCGCGA GGCCAAGCTC GGTGGCAAGA TCCACTCCAA GGGGGTGATG ATCCTCTCGC GCTTTCTGGC CAGCCGCTTC GCCCCGGAGG GCGACCTGTC GCTGTCGGCA AGCCTCGCCT TCGAGCAATC CTACGGCGGA ATCGACGGCG ACAGCGCGTC GGTGGCCGAG CTCTGTGCAC TCTTCTCGGC CATCGGCCGC GTCCCACTGG ATCACGGCAT CGCCGTCACC GGCTCGCTGA ACCAGCTTGG CGAGGTCCAG GCCGTGGGCG GGGTGAACGA GAAGATTGAG GGCTTCTTCG AGGTCTGTCG GCGGCGCGGG CTGACCGGGC AGCAGGGCGT GGCGCTGCCG GCAACCAACG TGCCGCATCT GATGCTGCGC CAGGAGGTGC GCGATGCGGT GGCCGCCGGG CAGTTCCACA TTTACCCGCT GAGCCGCGTG GACGAGGCCC TGGAGCTGCT CACCGGTCTA CCCGCCGGTG TCTGCGACGA CGCCGGCGAG TACCCGCAGG GGTCGGTGAA CCGCGCTGTC GCCGACCGCC TGGTGCAGTT CGCCAAGAGC CAGCGTCGCC GCGGCGACGG CGACGCCGGA GACACCCCGG ACACCGCGGA GGATGACGAT GACTGA
|
Protein sequence | MAAPEPLSLE RLYRVCDPEQ LGFRTTEELA GMDRPPGQER ALEAMDLGAN MRAPGFNLFV MGPEGDGKLE MVQRLLAERA AREPTPSDWC YLNNFDEPTQ PRLLRLPPGQ GARWRHDLEQ LIEELRSTIP ATFESDEYQN RLQELQQQLN RRQREAFETI QKEAEQYDVT LLQTPSGFSF APVKDGEVIE PEQFQQLPDE ERKRYQEAIE FLQERLQSVV QQIPKWRKEI QQQVRKLNEE MTLLAVGQRI QELRQRYGEL PVAAAHLDAI RNDIIEHVDA FRSGEQDHVE YILGRYRANL LLAHDPADGA PVVYEDMPTH QRLVGRTEHH VHQGALLTDF NLIRPGSLHQ ANGGYLVVDA HRILTQPLAW PSLKRTLSAG EIRIESLEQV HGFWTTVTLE PEPMPLRTKV VLLGDRMVYY LLSAYDPDFP ELFKVEADLE DDLPRDTETQ QLYARMLATL VRQRRLRHLD RFAVARVIEH GSRMADDSER LAAGGRAITD LLQEADHYAT GDGAEIIGQD HIERALAAQE RRAGRIRDRS QETIERGTLV IHTEGHHTAS VNGLSVLQLG DFGFGRPTRI TATARPGRGQ LVDIEREAKL GGKIHSKGVM ILSRFLASRF APEGDLSLSA SLAFEQSYGG IDGDSASVAE LCALFSAIGR VPLDHGIAVT GSLNQLGEVQ AVGGVNEKIE GFFEVCRRRG LTGQQGVALP ATNVPHLMLR QEVRDAVAAG QFHIYPLSRV DEALELLTGL PAGVCDDAGE YPQGSVNRAV ADRLVQFAKS QRRRGDGDAG DTPDTAEDDD D
|
| |