Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2202 |
Symbol | |
ID | 4709550 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2416478 |
End bp | 2417938 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639856677 |
Product | peptidase M48, Ste24p |
Protein accession | YP_001003768 |
Protein GI | 121998981 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000223537 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTCGATC GCCTCCGCCG AACAGCCGCG GCCCTACTGA TCGCCGCGCT GACGCTCACC GCCCCGGCCC CGTCGCCGGT CCAGGCGGAG AGCGCCCAGC TGCCGCAGCT TGGCGTACCG GGGGCGGACG CCCTGCCCGT GCACAAGGAG CGCGAACTGG GGGCCAAAAT CATGCGCCAG GTCCGCCAGC ACCTACCCCT GCACGAGGAC CCAGAAACCA ATGAGTATCT CCAGAACCTC GGGCACCGCC TGGCGGCGCA CAGCAACGAG CCGGGATTCG GCTACAGTTT CTTCCTGGTG GAAGACGACC AGATCAACGC CTTCGCCCTG CCCGGCGGAT ACATCGGCCT CCACACCGGG CTGATCCGCG AAACCCGGAC GGAGAGCGAA CTCGCCGGCG TGCTCGCCCA CGAGATCGCC CACGTCACCC AGCGCCACAT CGCTCGGCAG TACGCTCAGT CGCAGCAACT CAACCTGCAG ACCGCCGCCG CCGTCCTGGC CGCCATCCTG ATCGGCTCGC AGAGCCCGCA GGCCGGCAGC GCCGCCGCCA TGGCGGGCAT CGCCGCGCCC ATCCAGCAGC AGCTGAGCCA CTCCCGCACC CACGAGCAGG AGGCGGACCG GGTCGGCCTC CACAACCTGG TGGCCGCCGG GCTCGACCCC TACGGCATGC CCGGGTTTTT CGAGCGCCTG GCCGACGCCT CCCGTTTTGC CGAGGATCCG CCGGAGTACC TGAGCACCCA CCCGCTCACC GAGCGCCGCC TGAACGAGGC CCAACGCCTG GCCGAACGCC TGGAGGGCGG CACCGTCTAC GAGAGCGGCC ATCACGCCTT CATTCGCGCC CGGCAGCAGG TCCTCACTAA TAGGGAGCGC AGCACCAGCG CAGTGGCGTT TATGCGGGAT CAGTTGCGGC GCAGCACCGA CGACCCCACC GAGCGTGCGG CCGCCCTCTA CGGCCTCGCC CTGGCGCTCT CTTGGGAAGA GGGGCAGCAC GCCCAGGCCC TGGCCCTGCT CAACACCCTG AGCGCCATCG AGGGGGAGCG CCTCTACGTC CTCCTCGGGC GCGGCGAGAT CTTGCGCGCC ATCGGCGACA CCGAGGAGGC ACTGGCCACC TACCGGGAGG CCCGCTCCCT GTACCCGGGT AGCTGGGCCG CCACCTACCG ACTGGCCGAG ACCCTGCTGG CCGACGACGA CGCCAAGGAG GCGCGCCGGG TGCTGGCCCG CGCGACCCGC GGGTCGTCCG GATCGCCTCA GCTCCTGCGG CTGCTGGCCG ACGCCGCCCA CGCCGCCGGC CGCGAGGCCG AGGGTTACAT CGCGCTGGCC GAACACTACC GGGGCCGCGG CGAACACCGA CTGGCCGTGG CGCAACTGAA CAATGCCATC CGCCACGCCG GCGAAGACCG CTACCAGCGC GCCCGCGCCG AGGCACTCAA GGCGCGCTGG ACGCAGCACG CCGCGGACTA G
|
Protein sequence | MLDRLRRTAA ALLIAALTLT APAPSPVQAE SAQLPQLGVP GADALPVHKE RELGAKIMRQ VRQHLPLHED PETNEYLQNL GHRLAAHSNE PGFGYSFFLV EDDQINAFAL PGGYIGLHTG LIRETRTESE LAGVLAHEIA HVTQRHIARQ YAQSQQLNLQ TAAAVLAAIL IGSQSPQAGS AAAMAGIAAP IQQQLSHSRT HEQEADRVGL HNLVAAGLDP YGMPGFFERL ADASRFAEDP PEYLSTHPLT ERRLNEAQRL AERLEGGTVY ESGHHAFIRA RQQVLTNRER STSAVAFMRD QLRRSTDDPT ERAAALYGLA LALSWEEGQH AQALALLNTL SAIEGERLYV LLGRGEILRA IGDTEEALAT YREARSLYPG SWAATYRLAE TLLADDDAKE ARRVLARATR GSSGSPQLLR LLADAAHAAG REAEGYIALA EHYRGRGEHR LAVAQLNNAI RHAGEDRYQR ARAEALKARW TQHAAD
|
| |