Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2110 |
Symbol | |
ID | 4710050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2314790 |
End bp | 2316043 |
Gene Length | 1254 bp |
Protein Length | 417 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639856584 |
Product | peptidase S1 and S6, chymotrypsin/Hap |
Protein accession | YP_001003676 |
Protein GI | 121998889 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR02037] periplasmic serine protease, Do/DeqQ family [TIGR02038] periplasmic serine pepetdase DegS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0560997 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGCTAG CAGGACGATC CGGTTGGCGG CCATTGGGCC TGGAGGGCAC CGAGTGGATT CGGTTTCTCC TCGGATACAC CGGGCTGGGC GTGGTGTTGG CCCTGGTCAT CGTCTGGATC AACCCTGACC TGCTGGGCCC CCTGACGCCC CGGGTCGAGA TCACCGAGAG TGAAGATTGC CAGACCGTCG CCCCGGGGCG ACAAAACGAC GCAGCCGATG CGCCGCCTCG ACGGGAACCG GTCTCCTACG CCGACGCGGT GGAACGGGCC GCGCCGGCGG TGGTGAACAT CTTCACGGTC AAGCAGGTCA CCGAACAGCT CACCCCGCCC GGATTCGACG ATCCCCTATT CCGCCGCTTC TTCGGCGACC CTCCGACCCG CGAGCGCCAA CGCACCGAGA CCAGCCTCGG CTCGGGGGTC ATCGTCGCCG AGGAAGGCTA TGTGGTCACC AACCACCACG TCATCGACGA CGCCGACCAG ATCCAGGTAC TGCTGGCCGA CGGTCGCCAG AGGGCGGCCA CGGTGGTGGG GCGCGACCCG GAGACGGACC TGGCAGTGCT GCGCATCGAG GCCGAACGGC TGCCAGTGAT CACCTTCGCC CGGGACGAGC GGGTGCGGGT GGGCGACGTG GTGCTGGCCA TCGGCAACCC GTTCGGGGTC GGACAGACGG TCACCCAGGG GATCATCAGC GCCACCGGGC GCGACCAGCT GGGGCTGTCG ACCTTCGAGA ACTTCCTGCA GACCGATGCC GCCATCAACC CGGGCAATTC CGGTGGCGCC CTGATCGACG CCGAGGGGCG GCTGGTGGGG ATCAATACCG CCATCTTCAG CGGCACCGGC GGCTCACAGG GGATCGGCTT CGCCATCCCG GCGGGCATTG CCCAGGCGGT GATGTCCGAC CTGATCCAGT ACGGACGCGT GGTGCGCGGC TGGCTGGGGG TACAGGCACA GCGGCTGACG CCGGCCCTCG CCGAGTCCTT CGGCCACCCC CCGGATACCG AGGGCGTAGC GGTCACCCAC ATCCTGCCCC GCGGACCGGC GGACCAGGCC GGCCTGGAGG CCGGCGACAT CATCGTCGAG CTGGGCGGTC AGCGCATCCG CGACGTCCAG GATCTGCTGC AGGTGGCCAG CGCGGCGGCA CCGGGCACCG AGATGGAGAT CTCCGGCTAC CGCGATCAGG AGCCTTTCTC AACCTCCGTC ACCCTCGGTG AGCGGCCGGA CATGCAGCAG CCCAGCCACC CCGGGCGGCG CTAA
|
Protein sequence | MRLAGRSGWR PLGLEGTEWI RFLLGYTGLG VVLALVIVWI NPDLLGPLTP RVEITESEDC QTVAPGRQND AADAPPRREP VSYADAVERA APAVVNIFTV KQVTEQLTPP GFDDPLFRRF FGDPPTRERQ RTETSLGSGV IVAEEGYVVT NHHVIDDADQ IQVLLADGRQ RAATVVGRDP ETDLAVLRIE AERLPVITFA RDERVRVGDV VLAIGNPFGV GQTVTQGIIS ATGRDQLGLS TFENFLQTDA AINPGNSGGA LIDAEGRLVG INTAIFSGTG GSQGIGFAIP AGIAQAVMSD LIQYGRVVRG WLGVQAQRLT PALAESFGHP PDTEGVAVTH ILPRGPADQA GLEAGDIIVE LGGQRIRDVQ DLLQVASAAA PGTEMEISGY RDQEPFSTSV TLGERPDMQQ PSHPGRR
|
| |