Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2368 |
Symbol | |
ID | 4709083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2601254 |
End bp | 2603200 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639856843 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001003933 |
Protein GI | 121999146 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGGC CGAGCCGGTT GTCCCTGGCC CTTGGCGGCG CATTGCTGCT GGCCGGCCCC GGCCCGGTGA GTGCCCCCGA GGGCGCGCCG CCGAACTCCC CGTCGTCGTC CGCTGAATCA CCCCAAGAGC TGCTGGTACG GTTCGCGCCG GATGTGGCGG AGGGCGTCCG CACTGCCACC CACCGCGCCT ACGGCGGCCA CACCAAGCGT CGCCATGAGC GGGGTCGGTT CGAGGTCGTG CCGCTACCGC CTTACGCCGA CCGCGAGGAG GCCCTGCGCA AATACCGCGA CGATCCCCAC GTGGAGCACG CCGAGCCCAA CCGGTACGTG GAGCGCGTCG CGGCGCCTGC CGACGCCGCG AGCGATGCCA ACGGGTGGTG GCAGGAGCGC ATCCGCCTCA CCGAGCTGGA GACAGCAGAA CGCAAGGCCG GGGACACCAT CGTCGGCATC CTGGATACCG GCATCCAGTG CGACCACCCG GCCCTGGCCG ACAACACCTG GGACGACGGC GACGGTCAGT GCGGGAAGAA CTTCATCGAT CCGGACACCC CCCCGGACGA CGACTCGGAT CGGGGACACG GGACCCACGT GGCCGGCATC ATCGGCGCGA ACAGCGACGA GATGACCGGC GTCGCCCGAT CCGTCCAACT CCAGGCCCTG AAGTTCCTGG GGTCACTGGA CGACGGCACC CTCGCCGATG CCATCGAGGC CATCGACTAC GCCATCGAGC AGGGGACGGA CGTCCTCAAT GCCAGCTACG CCTACACGGC GAGCCGTACC GACGACGGCC CCCTGCCTAC CAGTTGCGCG GATCTCGCCG ATACCATGGA GGGGGCTTCG CGCCTGCATT GCGAGGCCGT TGCCGACGCC GGCGAGGCCG GGATCCTCTT CGTGGCAGCG GCACACAACT CCGGTAACGA CAACGACACC GGCACGGTCG CGCTGCCGGC AGGCTACCCG CTGGATAACG TGATCGCGGT GGCCGCCAGC AGGGAGACCG CGGCGGGCGA GCCGAGCGAC CAACTGGCCG ATTTCTCGAA CTTCGGGCGG CAAACGGTCC ACCTGGCAGC GCCCGGCGTG GGCATCCACA GCACCGTGGC CGGGGATGAC TACGACGAGC TTTCGGGCAC CTCCATGGCC ACCCCGATGG TCGCCGGGGT TGCGGCGCTG CTGCTCGATC AGGCCGGGTC GGAGGCCTCG CACCTCACGA TCCGTGAGCG GCTACTCGGG TCCGTGGCCT GCGACCGCGA CGCCGACCCG AGCCTGCCAC GCTGCGATGG GAACCCGCTC GGCGACAGCG CCGGCCAACA GCTGGCCGCC AAGACCCTGT CCGGGGGCCG GCTCGACGCC GCCGCAGCCC TGGGCGCTGA TCCGGATACG GTGCCCCCCG TTCCGCCGAG CCACGTCAGC ATCGAGACCG CCTCCGGCAT CCCGGAGCTG CGTTGGCTGT CCTCCAGCCC CACCGCCCGG GGCTACCGCA TCGAGCGTTT CGACACCGCC GAGGGGACTT TCCGGCATGT GGCCACCGTC GAGCCGGACC GGGATCGGTA CCGGGATCGC AGCGCCCCGA GCGATACCCT GCTCGGCTAC CGCATCCGCG CCCTGGGCAG CGACGGGCAA CCGAACTCGC GCTGGACGGA GGCCGGAACC GTCGAGACCG ACCAGGCCCT GAACCGAGTG ACGGAGCAGT TCGCAGGCTT CAGCAGTGCC GAGCGGGATG AGCGGTGTTT CATCGCCACG GCCGCCTACG GCTCCGAGCA GGCGCACCAG GTCGAGGCGC TGCGCGATTT CCGGGACGAC TACCTGATGC CCCACCCGCC CGGCCGGGCG CTGGTGGCGG CGTACTACGC GGTCAGCCCG CCCATCGCCG AGTGGGTCGC CGCCGAGGAA CACCGCCAGC GCTGGGTCCG GCGGCTCCTG GCCCTGTTGC CGACTCAAAA GGAGTAG
|
Protein sequence | MNRPSRLSLA LGGALLLAGP GPVSAPEGAP PNSPSSSAES PQELLVRFAP DVAEGVRTAT HRAYGGHTKR RHERGRFEVV PLPPYADREE ALRKYRDDPH VEHAEPNRYV ERVAAPADAA SDANGWWQER IRLTELETAE RKAGDTIVGI LDTGIQCDHP ALADNTWDDG DGQCGKNFID PDTPPDDDSD RGHGTHVAGI IGANSDEMTG VARSVQLQAL KFLGSLDDGT LADAIEAIDY AIEQGTDVLN ASYAYTASRT DDGPLPTSCA DLADTMEGAS RLHCEAVADA GEAGILFVAA AHNSGNDNDT GTVALPAGYP LDNVIAVAAS RETAAGEPSD QLADFSNFGR QTVHLAAPGV GIHSTVAGDD YDELSGTSMA TPMVAGVAAL LLDQAGSEAS HLTIRERLLG SVACDRDADP SLPRCDGNPL GDSAGQQLAA KTLSGGRLDA AAALGADPDT VPPVPPSHVS IETASGIPEL RWLSSSPTAR GYRIERFDTA EGTFRHVATV EPDRDRYRDR SAPSDTLLGY RIRALGSDGQ PNSRWTEAGT VETDQALNRV TEQFAGFSSA ERDERCFIAT AAYGSEQAHQ VEALRDFRDD YLMPHPPGRA LVAAYYAVSP PIAEWVAAEE HRQRWVRRLL ALLPTQKE
|
| |