Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0821 |
Symbol | |
ID | 4709097 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 902573 |
End bp | 903871 |
Gene Length | 1299 bp |
Protein Length | 432 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 639855280 |
Product | peptidase M48, Ste24p |
Protein accession | YP_001002399 |
Protein GI | 121997612 |
COG category | [R] General function prediction only |
COG ID | [COG4783] Putative Zn-dependent protease, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.371859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGGCA GACGGGCGTT TCAATGGCTG ATCCCGGCGG TGCTATTGGC GCTGAGCGTG GCTAGCTGCG CCACCAACCC GGTCACCGGC GAGCGGGAAC TGCGGCTGAT CTCCGAGGGC GAGGAGGTCG CCATGGGCGA GCAGCACTAC GAGCCCACCC TGCAGAGCAT GGGCGGACGC TACAACGCCG ACCCGGACCT GGTCGCCTAC GTCGATGAGG TCGGCCAGCG GGTGGCCGCC GAGAGCCACC GCCCGGGGCT GCCCTACGAG TTCGTGGTGC TCAACGACGG CACCCCCAAC GCCTGGGCCC TGCCCGGCGG TAAGATCGCC ATCAACCGTG GCCTGCTCAC CGAGATGGAG AATGAGGCCG AGCTGGCCGC GGTGCTTGGC CACGAGATCG TCCACTCCGC CGCCCGCCAC GGCGCCCAGC GCGTCGAGCG CGGGATGATG ATGCAGGCCG GGGTGGCCAC CGTCGGCCTG GCCACCCAGG ACCACCAGCT CTCCGGACTG CTGGTGGCCG GGGCCAGCGT CGGTGTGGGC TTGATCAGTC AGCGCTACTC GCGGCAGGCG GAGCTAGAGG CGGACGACTA CGGCACCCGC TACATGGCCC AGGCCGGCTA CGACCCCGAG GCCGCCGTCA CCCTGCAGGA GAAGTTCGTG CGCCTGGCCG GGGGCGGGGA GTCGAGCTGG CTCGAGGGGC TGTTCGCCAG CCACCCGCCG TCCCGGGAGC GCGTGCGCGC CAACCGCGAG ACCGCCCAGA CCCTGCGCGA GGAGCTCGGC GGCGAAGACT GGACCCTGGG CGAGGAACGC TACGCCCGGC ACATGCGGGT CCTGGAGGAG AACCGGGAGG CCTACGCCCA GCTGGATGAG GCGCAGCAGG CGCTACGCGC CAAGGAGCCC GAGCGGGCCC TGGAGCTGGC CGACGCGGCC ATCGACGCCT ATCCCGAGGA GGCCGCCTTC CACGCCGTCC GCGGCCAGGC CCTGGCCCGC ATGGGCGAGG AGGCATCGGC CATCGCCGCC CTGGATGCCG CCATTGAGCG CAACGACGGC TACTTCAGCT ACCACCTCGA CCGCGGCCTG CTGCACCGGG CCCGCGGCGA CGACGAGCGC GCCCGCACGG ACCTGGAGCG CTCGGCCAGC CTGCTGCCCA CCGCGCCCGC CCACCTGGCC CTGGGCCAGC TGGCCGAGGC CGACGGCGCC CGGGCGGACG CCATCGGCCA CTACGAGAAG GCGGCCAGTG CGGAAGGCTT CTTCGGGGAG CGGGCGCGGG AGGCGCTCAG CCGTCTGCAG GACGGCTGA
|
Protein sequence | MDGRRAFQWL IPAVLLALSV ASCATNPVTG ERELRLISEG EEVAMGEQHY EPTLQSMGGR YNADPDLVAY VDEVGQRVAA ESHRPGLPYE FVVLNDGTPN AWALPGGKIA INRGLLTEME NEAELAAVLG HEIVHSAARH GAQRVERGMM MQAGVATVGL ATQDHQLSGL LVAGASVGVG LISQRYSRQA ELEADDYGTR YMAQAGYDPE AAVTLQEKFV RLAGGGESSW LEGLFASHPP SRERVRANRE TAQTLREELG GEDWTLGEER YARHMRVLEE NREAYAQLDE AQQALRAKEP ERALELADAA IDAYPEEAAF HAVRGQALAR MGEEASAIAA LDAAIERNDG YFSYHLDRGL LHRARGDDER ARTDLERSAS LLPTAPAHLA LGQLAEADGA RADAIGHYEK AASAEGFFGE RAREALSRLQ DG
|
| |