Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0147 |
Symbol | |
ID | 4710700 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 167453 |
End bp | 171034 |
Gene Length | 3582 bp |
Protein Length | 1193 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639854605 |
Product | peptidase S41 |
Protein accession | YP_001001743 |
Protein GI | 121996956 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0793] Periplasmic protease |
TIGRFAM ID | [TIGR00225] C-terminal peptidase (prc) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTAGCG TCATCCCGTT CCTGCTCGTG TTGCTCGCAG GCCTTCTCGG GTTGGTCGGC CCCTTGACGG TGGCGGCGGC TGACGAGGGT GAAACGGCCG TCGAGCTGCC CCGTTTCCCG TCCATCAGCC CGGATGGAGA GAAGCTGGTC TTCAGCTGGG GGGGCGACCT CTGGCGGGTC CCCGCCGAGG GTGGCACAGC GACCCGGCTG ACCGGCCATC GTCTGGACGA TCTCCACTCC GCCTGGTCCG GCGACGGGGA TACGCTGGTG TTCTCCTCCT TGCGGGACGG TTACCTCAAC CTCTGGCGCA TGCGGCCGGA CGGGACGGGA CTCAGGCAGG TCACTTACGG CGATCGTTTC CTGCGCCACC CCGGTTTCGG AGTGGGCCGC GATGGCGAGC CGCTGGTGAC CTTCTCGGCG CAGCTTGAGG CGGATGTCTA CCGGGACCAG CGCCCCTACG GCGTCGCGCC GGGGGGCGGC GAGCCGCAGC GCCTGCATGA CGCCTTTGGC TCCGAGCCCA GCACCTCCCC CGATGGGCGA TACGTAGCCT TCACCCGTGG CGGGGCCTAT CACGGTTGGA GTCGCCGCCA CTACCACGGG CCCGATGCCC GCGATGTGTG GCTCTACGAC CGCGACGAGG AGGCGTTCTC CCCGTTGACC CAGCGTCGTG CCGGGGACGA CGGCCAGGCA CAGTGGTTGG ATGAGCGGAC CCTGATCTTC CTCTCGGACC GTGAGGACAA CACCGTGAAC CTCTTCCGCG CGAGGGTGGA GGGGGCGTCC GGTGGCGTGG AGCGCCTGAC CCACTTCGAT GAGCGGGATG TACAGGCGTT CGATGTGGCG CCCGAAGCGG GTCGGGCGGT GTTGCAGGTC TGGGATCGGC TCTACCTTCT CGATCTGGAT AGGCTCGATG CCGAGCCGGA GCCGGTGGCG TTGCGGGCCG GCGACCCGGG GCACGACCGC TACGAGCTGC GCTCGGTGGA CCGCCAGGTC AGTGAGGCCG CGCTGAGCCC CGACGGCGAA GTGATGGCGT ACATCGCCTA CGGCCGGGTC TACGTGCGAC ATATGGATGA GCACAGTCCG ACTCGCCCCG TGACCCCGGA TACCCACGCC CGTCATCGGG ACCTGCGCTG GTCGCCGGAT GGTCTGCGGT TGTACTTCAC CCGGGATGCC GACGGCACTG AATCGATCTA CACCGCCGGG GTGGCCCTGA CCCGCGAGGA GGTACGCCGC GGCTGGGTGG CCTCCGGGGA AACGGGGGGT GCACCGCGCC CGGCGCGACC CGAGACCGGG GCGGACGCGT CAGAGGGGCA GGAGTCAAGC GGCGGGGATG GCGCAGCCGC CGACCCGGAC GACCCCTTCG CCCCCACCGA CCCCATCGAG CCGCCGTCGG ATCCCGACCC CGGCGACCCG GGTCTCGAGC CCGCCCTGAC GGAACCGGAT ACGCCGCCGG AGCCCGATCC GCAGCCGGCC CCGGACCCGG AGCCCGTGCC GGAGGAAGCC GATCCGGTGG CACCGGCCGC GGATCCGGAT CGCTGGCACG ACGCCCTGCA GTTCACTCTG CGTCCTCTGG TGCAGAGCGA GGAGCACGAG CGCGATGCCA GGCCGTCCCC GGACGGTACC CGGATCGCAT TCCGCCGCGG GCGCGGCGAT CTGGTGGTCC AAGACCTCGA CGGGGGCGGC GAGCGCACCC TGGTCGAGGG GTGGGACGCC ACAATCGACT GGCGCTGGTC GCCCGACGGC CGCCACATCG CCTATACGCA GAACGACCTG GATTTCAGCG CCAACGTCTT CATCGTCCCT GCGGACGGGT CGGCGGAGCC GGTCAACATC ACCCGTCACC CACGCAACGA TCTCAATCCG CACTGGTCAG CCGACGGTCG GGTGCTGACC TTCATCTCCA ACCGCAGCGG GGATAGCTAC GATCTCTACC GGGTCTACCT GGATCCGGAG CTGGCGCGTT ACAGCCGCCT GGAGCTTGAC CGCTACTATC GCCAGGGGCG CGAAGCGGCG GAGCGGCGTG AGCCGCTGCC GGTGCTCACC CCGGGTGCCA CGGATGTGAC TCGGACGGAG ACGGTGACCC AGGAAGAGCC GCCGGAGCTG GAGCTCGAGG GTGCTTGGCG GCGCGTGGAG CGAGTCAGCA GCGCGGCCGG CAACGAGTAC GCGAGCCGGA TGACCCCGGC CGGCGACCGC TACGTCTTCA ATAGCGCCGG CGAGGGGCTG ATGGTGATGA ACTGGGACGG CAGCGAGCGG CGCCGCCTGG GCCCGGTGGC CGATATCCAG CATCTGACCC TGCGCGGCGA TCGGGTGGTC TACGTCGCCG GAGGGCGGGC CGGGGTGGTG CGGCTCGGCG GTGGCGAGCA TCGGCGCCCT GACATCAGCG ACCGGATCCG CATCGACCGA AAGGCCCAGG GCGTGCAGAA GTTCCGCGAG GCGGCGCGCA TCATCGAAGA GGGGTTCTAT CGGCCGGACC TCAAGGGCCT GGACTGGGAG GCGCTGGTAT CCGATTACGA GGCCTTGATC CGCCGGGCGC GGACGGCCAG CGAATTCAGC GACATCGCCA ATCGCCTCAT GGGCGAACTG GCCGCCTCAC ACACCGGGGT GAGCAATCCC GGCCCCGGGT CGGCCCTGCG CGAGCCCTCC GGGCGGCTGG GGATTCGACA TGAGCGCGTC GAACTGGCAG ACGGGCGCCC GGGATACCGG GTCGAGTCGG TGGTTCCCAA CGGGCCGGCG GCCCACGGGC CTATGCCCCT GCAGAGCGGC GATACCATCG TGGCCATCGA CGGCCGCGGA ATCGACCGCG ATGAGACCCT GCTCCAGCGC CTGCGGGGCC GGGTCGGTGA TGAGCTGCTC ATCGCCTTCC GTCGGCCCGA ACCGGAGGGG GACGGCAGGC AGCTTTTGCA CACCCTGGTA ACGCCCGTCG ATTATCGGGG TATGGCCGAG CTGCGCTACG ACGCGTTCCG GGAGGAGCGG CGCCGGCTGG TGGACGAGCG CTCCGATGGC CGGCTCGGGT ACATCCATAT CCAGGCGATG AACCAGGCCT CCCTGGAGGC CTTCCAGGGC AGTCTCTACG CGGCGGCGGA GGGCAAGGAG GGGCTGATCA TCGACGTCCG CAACAACGGC GGCGGGCATA CCACCGATCG GATCCTCACC TCGATCATGG CGGCCGAGCA CGCCTACACC ATCCCCGCGG GCGCCGACCC CTCGCGCACT GGTCACTACC CGCAGGACCG CCTGGATGCC CCGCGCTATA CGCTGCCCAT CAACATGGTG GCCAACGAGA AGAGCTACTC CAACGCCGAG ATCCTGGCGC ACGCCTTCAA TACCCTCGAA CGGGGCACCC TGGTGGGCGA ACAGACCTAC GGTGGCGTGA TCTCCACCGG TCGGCACGCC CTGATCGACG GTGCCACCGT GCGCCGTCCC TTCCGCGGCT GGTATCTGCC GGACGGCACC GACATGGAGC ACCACGGTGC CGAGCCGGAT ATCCGGGTGC GCCAGCGCCC GGAGGATGAA GTGGCGGGCC GCGATCGGCA GCTCGAGGCG GCGGTGGACG ACATGCTCGA GCAGCTCGAC GACCGGGAGT GA
|
Protein sequence | MRSVIPFLLV LLAGLLGLVG PLTVAAADEG ETAVELPRFP SISPDGEKLV FSWGGDLWRV PAEGGTATRL TGHRLDDLHS AWSGDGDTLV FSSLRDGYLN LWRMRPDGTG LRQVTYGDRF LRHPGFGVGR DGEPLVTFSA QLEADVYRDQ RPYGVAPGGG EPQRLHDAFG SEPSTSPDGR YVAFTRGGAY HGWSRRHYHG PDARDVWLYD RDEEAFSPLT QRRAGDDGQA QWLDERTLIF LSDREDNTVN LFRARVEGAS GGVERLTHFD ERDVQAFDVA PEAGRAVLQV WDRLYLLDLD RLDAEPEPVA LRAGDPGHDR YELRSVDRQV SEAALSPDGE VMAYIAYGRV YVRHMDEHSP TRPVTPDTHA RHRDLRWSPD GLRLYFTRDA DGTESIYTAG VALTREEVRR GWVASGETGG APRPARPETG ADASEGQESS GGDGAAADPD DPFAPTDPIE PPSDPDPGDP GLEPALTEPD TPPEPDPQPA PDPEPVPEEA DPVAPAADPD RWHDALQFTL RPLVQSEEHE RDARPSPDGT RIAFRRGRGD LVVQDLDGGG ERTLVEGWDA TIDWRWSPDG RHIAYTQNDL DFSANVFIVP ADGSAEPVNI TRHPRNDLNP HWSADGRVLT FISNRSGDSY DLYRVYLDPE LARYSRLELD RYYRQGREAA ERREPLPVLT PGATDVTRTE TVTQEEPPEL ELEGAWRRVE RVSSAAGNEY ASRMTPAGDR YVFNSAGEGL MVMNWDGSER RRLGPVADIQ HLTLRGDRVV YVAGGRAGVV RLGGGEHRRP DISDRIRIDR KAQGVQKFRE AARIIEEGFY RPDLKGLDWE ALVSDYEALI RRARTASEFS DIANRLMGEL AASHTGVSNP GPGSALREPS GRLGIRHERV ELADGRPGYR VESVVPNGPA AHGPMPLQSG DTIVAIDGRG IDRDETLLQR LRGRVGDELL IAFRRPEPEG DGRQLLHTLV TPVDYRGMAE LRYDAFREER RRLVDERSDG RLGYIHIQAM NQASLEAFQG SLYAAAEGKE GLIIDVRNNG GGHTTDRILT SIMAAEHAYT IPAGADPSRT GHYPQDRLDA PRYTLPINMV ANEKSYSNAE ILAHAFNTLE RGTLVGEQTY GGVISTGRHA LIDGATVRRP FRGWYLPDGT DMEHHGAEPD IRVRQRPEDE VAGRDRQLEA AVDDMLEQLD DRE
|
| |