Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0121 |
Symbol | |
ID | 4710620 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 137582 |
End bp | 139702 |
Gene Length | 2121 bp |
Protein Length | 706 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 639854579 |
Product | carboxyl-terminal protease |
Protein accession | YP_001001717 |
Protein GI | 121996930 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCCCGTA GTCACCTGTT GTGGACCCCC GTACTGGCTC TCGGCCTGGC CTTTTCCTCC GTGGCACCAC TCCCCGGAGC GCCGGGTGCC GCCCCGGCGA CCGCGGGTGA GAATGGCGCA GTACCCGGAC AGACCGAATT CGAGAAGGCG CGCGTCGTCG GCGACCTGCT CCAGCGCTAC CACTACGGTG GCCCCGAGTC CGACGAGCAG CTCATGGAGC AGGCCACCGA GACGTACCTC AAGCAGCTCG ACCACGGCCG CTTCTTTCTT CTGAAGGAGG ATGTCGAGGC GTTCCGCGAG CGCATGAGCG AGCTCGACCC CGGCCAGGGC GACGCAATCC TGGAGGCCGC CTACGACCTC CACGCCCGCT ATCGCGACCG GGTCGCCGAG CAGACGGAAT TCGCTCTGGC CCTTCTCGAG GAAGGGTTCG GCTTCGACGG CGAAGGCCGC TTCGAACAAG ACCGCAGTGA AGCCGAATGG GCCGCCGACC GTGAGGCCCT CGACGAGCTC TGGCGGCAGC GCGTGACCCA CGACGCACTG ACCCTGGAGC TGGCCGAGCG CAGCACCGAG GAGATCCGCG ACAACCTCGA ACGGCGGTAC ACCACGCTGC GCGACCGGGC CGTGGACGCG GAAAACAAGG ACATCATGGA TCAGTTCCTC AGTGCCTGGG CGGCGGCCTA CGACCCGCAC AGCACCTTCC TCTCGCCGCA GCGCTCCGAG GAGTTCGACA TGCAGATGTC GCTGCAGCTC GAGGGGATCG GGGCCAAGCT GACCATGGAT CAGGACTTCA CCGAGATCGT CGAACTCATC CCCGGCGGAC CTGCCGAGCA GTCCGGTGAA TTGCGCGAGG GCGAGCGCAT CATCGGCGTC GCCGACGGCG ATGACGGGGA GATGAAAGAC GTCGTCGGGT GGCGGCTGGA CGAGATCGTG GACATGATTC GCGGCCCGAA GGAGTCCGTG GTCCGACTCA ACGTGCTGCC GCCCGCCGGC GCCAGCGAGA GTTCACCGCG GGAGGTACGC CTGGTCCGCA ACAAGGTCGA CCTGGAAGAC CAGGCCGCCC GTAAGGAGGT CATCGAGAAG ACCAACGCCG AGGGTGAGCA GAAGCGTATC GGCGTGATCA CGATCCCCAA GTTCTACCGC GACTTTGAGG CGGCCCACTC CGGGCAGGAC GACTTCCGCA GCACCACGCG GGACGTCGAG CGCCTGCTCG GCGAACTGCT CGAGGACGGG GTCGACGGAC TGCTGATTGA CCTGCGCGGC AACTCCGGCG GGGCCCTGCG CGAGGCCACC GCCCTGACCG CCCTGTTCAC CGGCGGCGGG CCGGCGGTGC AGGTCCGCGA CTCCCGCGGC CACCCGGAGC AAGTCGGCGA GTCCAGCGGC GATCCGGCTT ACGACGGCCC GTTGGGGGTG CTGGTGGATC GACGCAGCGC ATCGGCCTCC GAGATCTTCG CGGCCGCGAT CAAGGATTAC GGGCGCGGGA TCGTGCTCGG CGATCAGACC TTCGGCAAGG GCACGGTGCA GCAGATGATC GGCCTGGACA ATTACGCCAT CCCCGGAGAG GAGCGTTCCG GTCAGCTCAA GCTGACCCTC GCCCAGTTCT ACCGGGTGAC CGGAGAGAGC ACCCAGCTCG AGGGCGTAAA ACCGGATATC CACCTGCCGT CTGAGTTCAG CCACGAGGAG TTCGGCGAAC GGGCCACCCG GAATCCGCTG CCGGCCACCC AGATCGACGG GCTCGACATC ACCGTTCAGT ACGAGCTGGA GACCATCATC GACGAACTGG CCCGCCGGCA CGAGGCACGG ATGGAGGAGA CCGAGACCTT CCGGGCCCTG GAGCGAAAAT TGGAGGCCCA ACGGGAGATC CGCGAGGACA CCACGGTCGC CCTGAGCAAA ACGACCCGGC AGGAGGAGCA GAAGGCCCGC GAGGAACGGC TGCTGGAGCT GCACAACGAT CGGCGCCGAG CCCACGGCAA GGATCCGGTG GAGAGCTACG CCGACGTCGA CGCCGATGAC CTACCGGACG CCCTGCTCGA TGCCAGTGCG GCGATCATTG CCGACTTCGC ACAGCTCCTG CGGGAGGCCG GCGACGAGGT ACTCACCGCC GAGGCTCGCA AGGAAGGCTG A
|
Protein sequence | MARSHLLWTP VLALGLAFSS VAPLPGAPGA APATAGENGA VPGQTEFEKA RVVGDLLQRY HYGGPESDEQ LMEQATETYL KQLDHGRFFL LKEDVEAFRE RMSELDPGQG DAILEAAYDL HARYRDRVAE QTEFALALLE EGFGFDGEGR FEQDRSEAEW AADREALDEL WRQRVTHDAL TLELAERSTE EIRDNLERRY TTLRDRAVDA ENKDIMDQFL SAWAAAYDPH STFLSPQRSE EFDMQMSLQL EGIGAKLTMD QDFTEIVELI PGGPAEQSGE LREGERIIGV ADGDDGEMKD VVGWRLDEIV DMIRGPKESV VRLNVLPPAG ASESSPREVR LVRNKVDLED QAARKEVIEK TNAEGEQKRI GVITIPKFYR DFEAAHSGQD DFRSTTRDVE RLLGELLEDG VDGLLIDLRG NSGGALREAT ALTALFTGGG PAVQVRDSRG HPEQVGESSG DPAYDGPLGV LVDRRSASAS EIFAAAIKDY GRGIVLGDQT FGKGTVQQMI GLDNYAIPGE ERSGQLKLTL AQFYRVTGES TQLEGVKPDI HLPSEFSHEE FGERATRNPL PATQIDGLDI TVQYELETII DELARRHEAR MEETETFRAL ERKLEAQREI REDTTVALSK TTRQEEQKAR EERLLELHND RRRAHGKDPV ESYADVDADD LPDALLDASA AIIADFAQLL REAGDEVLTA EARKEG
|
| |