Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4417 |
Symbol | |
ID | 5736268 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5639571 |
End bp | 5646119 |
Gene Length | 6549 bp |
Protein Length | 2182 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641281580 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001547177 |
Protein GI | 159900930 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAAAT TCCCCTTCAT TCGGCAAGCA GCCCTCATTG CAATGACGCT CTCATTGGGA CTTCAGGGTG CGGCGCAACC AGCCCGTGCC GAAGTAATGC GTGCTCCGAC TGCGACGACC AAGTTCCTCG TTGAACCAAA CAGCCCAGCC AACAAATTTG TCCGCACGAA TGGCACATTG ATTGCCGATT ATGGTTCTTT TTCAGTCTGG CAAGTTGCCA ATAACAACAC CAGCAGTCTT AGTCAATTGG CTGGTGCACA AGCCGCCGAT TTGGATACGA TTGGCTTACG GGGAATCAGC TTCAATCCTT TAAAGAGCGT TCCCACGCCG CAACCAAGCC TGAGCCAAAG CCCAACTGCC GACCAACAAT TGTGGTTGGT GCAATTTGCG GGGCCGCTCA AGGATGCTTG GTTGGATGAA CTAACCAAAG CTGGCGCGGA ACGCGTCATA TATATGCCCA ACAACGCTTA CCTCGTTTGG GCAAATGGCC AAACTGTGGC CAAACTTGAT GCACTCAGCA AGAGCAATGG GGCGATTCAA TGGGCTGGTG CCTACCACCC CGAATATCGC TTAGCTCCTG AGTTCCGCCA AAAAGCCAGC AACCCTGATG CGAAGGGGCT GGTCGATGTT ACGATTCAGT TCTACAACAG TGCTACGGTT GAACGTGATG TGCGCGAAGT GCTTGATGCC AGCGCCCAAG TGTACGCGAC CCCATGGCAG GTGCTCAATT TCACTACAAT TTCGGTGCAA GTCAGCGAAG CCGCTTTGGC TCCAATTGCT CGCCGCGCCA ATGTCTACAA CATTGAAGCG TGGGAAGCTC CTCAAAAAAT GGATGAGCGC CAAGGCCAAA TTATTGCTGG TAACGTAACC ACCGCTGGTG GCAAAGTGGT GCCGAACGGC CCAGGCTACT TAAGCTGGTT GCAAAGCCAA GGCGTGCCAA CCGACCCCAA CCAATATCCA ATTGTCGATG TTGTCGATGA TGGTCTGGAT AATGGGACAA CCTCACCATT GCATCCTGAC TTCTATGTCA ATGGGGTTCG GCCTGGTACT TCACGGATTA CAGCTGTTGG TAACTGTACT GCCGATGCCA CTGGTAATGG TCAGGCAGGC CACGGTAACC TGAATATTGG GATCGTTGGT AGCTACAACA ACCTGACTGG CTCGCCCCAC GTTGACGGTG CTAGCTCAGG GATTGCTGGC GGCTATCGCG TTGGCTTAGG CATTTCGCCA TTCAGCCGAA TTGCCAGCAC CAAGATCTTC AACAATGCTG GGAGCTTTGA TCTGACCAAT TGTGGTGGCG GTAGCAACTA CATTGGAATT ATTGCAATGG CCTACAGCCA TGGCGCTGCC ATCTCGTCCA ACAGTTGGGG TTCAAACAGT GGTGGTGCCT ATACCGCTTC ATCACAAGCC TTCGACCAAG GAACCCGCGA TGCAAGCAGC ACCACTGCTG GCAACCAAGA AATGACTCAC ATCATTGCCG CTGGTAACGC TGGTTCAGGT ACGAACACAG TTGGCTCACC TGGTACAGCC AAAAACGTTA TTACCGTAGG TGCAACCGAA AACGTGCGCG ACGAAGGCGT GCTCGACGGC TGTAACGAAG GCAATGCTGA TAGTGCTGAT GATATTGCCA GTTTCTCCAG CCGTGGTCCA ACCGATGATG CACGGGTAAA ACCTGACATC ATGGCTCCCG GGACGCACGT GATGGGGCCA GCTTCACAAA TCGCCACCTT TGATGGGACT GGCCTTTGTG GGCCAAGCAG TGGCACTCAC TACCCCAATG GTCAAACCCT CTATAGCTGG TCGAGTGGTA CGAGCCACTC AACCCCAGCA GTTGCAGGCT CGGCTTCATT GCTCTACACC AAATATCGCA CCAGCTTCGG CGGTGGGGTT GCTCCAAGCC CAGCAATGTT GAAAGCCTAT ATGTTGGCCT CAACACGCTA CCTCAACGGG AATGGGACTG GCGGCACCTT GCCATCAAAT AACCAAGGCT GGGGCGATGT AAACTTGGCT CCGGCCTTGG ACAGCACCCC ACGGATCGTG GTTGACCAAA CCCGCACCTT CGGCGCAACG GGTGAAGAAT TTACCCAAGT TGGTCAGATT GCCGATTCAG CCAAACCAAT TCGGATTACC TTGGCTTGGA CTGACGCAGT TGGCAGCACC ACTGGCAACA GCTATGTCAA CGACCTTGAT CTCGAAGTGA CGGTTGGTGG CCAAACCTAC AAAGGTAACG TCTTCAACGG CGCACTTTCA ACCACTGGCG GCACTGCTGA TAACAAGAAC AACATTGAAG GTGTGTACTT GCCAGCAGGC ACAAGTGGCT CGATCCAAGT CAAAGTCATT GCCCGCAACA TCGCTGGCGA TGCCATCCCA GGCAATGCTG ATACCACCGA CCAAGACTTT GCCTTGTATG TCTACAACGG CTCGGTTGGC CCAACCGGTA CCTTGACTGG CCGCGTTGCC CAAAGCAACA ACAACCCAGT TGTTGGCGCA ACGGTTCGCA CCAGCGGTGG CTCAAGCGCT TTGACCAACG CCAGTGGTAA CTACAGCATG ATCGTACCAG TTGGTACCTA TGCGGTAACC GCTAGCTTGC AAGGTGCTTT CCAAACTGTT CCAAGCGTAT CGATTACCGA AAATGTCACC ACGACCCAAA ACTTTGTATT GAGCTACGGC TCAATTACAG GGACGGTACG TGATGCCTTT ACCCCCAGCC GCCCAATTGT GGGAGCAAAT GTTTCGGTCA TTGGCTATAG CACCACAACC AATGCTGCTG GTCAATACAC CATTCCAGTA GCCAATGTTG GCTCAACCTT GGTAACGGTC AGTGCACCAA AATACGCCAC CCAATCACAA TCGGTAACGG TGGTTGATGC AGGCACGGCG ACCCAAGACT TTACCTTGGC TGCTGGCGCG GTCGCAGGGA TTGTGACCAA CAGCGCTAAC GGTGCACCAG TTGGCGATGC AACGGTCGCC ATCAGCAACC AATACGTCAA GACCAAAGCT GACGGTAGCT TTGCCATTCG CTTGGAGCCA GGCAGTCATA CCCTGACCGC TTCAAAGATT GGTTTCGCTG CTGATAGCGC CAGCTTGACG ATTAGCAATG GGGTTACGAC AACCCAAGAT CTGGAAATTA CACCAGCTTT TGGTTATACC CCAGCCAGCT TGACCCGCAC CTTTACCTTT GGTGATGCGC CATTCACCGA TACGGTTGGC TTAGAATTAA CCAATAATGG CACTCAGCCA TTCACCTACA CCATTCGCGA AAATGGTGCG ACTGGCTTTA CCCCAGCGGT CAACCGCGCT GGTGGCAACG TCTTGGTGGT TCAACGCAAC TCAGCAACCT CGGCAACAGC TGCAACTACG GCTTTGACCG CGCTTGGCTA TACCTTTGAA TCAATCGATA ACATTGCATT CGAAGCCCGT AGCTTGGCCA ATTTGCAAAC TTTTGATGCA GTACTCTTCC TTGGTTCAAC TAACGCAACC GCCAATAATG CCAGCGAAAC CAAACTTACT GAATACCTCA ACGCAGGTGG TAAGCTGTTT ATCGCTGACA ACGACCTTGG TTTCTTCACC AATGCAGGCA CGTTCTACCG AACCATGCTT GACTCAACCT ATGGTGGTGA TGATCCAGGT GCTGCGAATC GGGCCTTGAC TGGCTTCGAT TTCATGGCTG GCGTAAATGC TGCCTCAGTG GATGCATACC CTGACTTCTA TACTCCAGGT AGCTCATCGA CGGTAATCTT CCGCTATGGC AATAACGCAG TTGGCGGTAG TTTCATCCAA CGTAATGGCT ACAAAGCTGT CTACCTCGCA ACCGACTTCA TTAACTTGGG CACTGGTGCC AGTGGCGAAG CGATCGAGCG CACCGTGCTT GAGCGTGTGT TGCAAGCAAT TGTTGGTAGT GGCGATAGCA TTGCCTGGTT GCAGGAAGCT CCTGCGACTG GCGTAATTGC TGGTGGTGAT AGCACCAACG TAGCAATTGG CTGGAATCCC GGTGTGATTA CCCAACCTGG CACCTACACT GGCAGCCTGA ACCTTGGCTA CTCCTCAGTG ATTAGCCAAA CCTTCAATAT TCCAGTGACG ATGGTGGTCA ACCCAGCCGC AACTCAAGCA CGAATTAGTG GTACGGTTAC CTCATCAGGG GTCTGTGACA CTGTTCCAGC TCCAGCTGCT GGCGCTACAA TTTTGATCAC CTATACCAGC GGGATGACTG CAACAGTTAC CACCAATGCC AATGGTGAAT ATGCCTTCTT CGTGCCTCAA GGTGGCACCT ATACTGTTGC CGCTAGCGCC GTTGACCACC TTGGTCAATC GCAAAGCGTC ACAGTTGCTG ATGGTGCTGA AGTTACCCAA AACTTCACCT TACGCTTGAA TAAGCCTTGT GTCACGGTTA GCCCAGGCTC ATTGAGCGCT TCGCTCCAAT TGGGCGCTGC TCCGGTTAAC CAAACCTTGG TTGTCACCAG CAAGGGTGCT GCCCCGCTCA ATGCAACCAT TAGTGAGCAA AGCCGCAGCG CAATCAGCCA AGCTGGCTAT ACCATCTCGG AAGTTCCCTA TAGCTGGCTC GAAGCCAACG ATGGGACGAA CTTGAACTTA ACCGATGATG CTGAAGCCAA CATTACTAGC CCATTCACTG TAACCATCTT CAACACCAGC AGCCGTAACT TGCGGGTTGC TAACAATGGG GTTGTCTTGG TCGGTGCAAC CACTGGTGAT GTAGCCGCAA CTAACGAATC CTTGCTAACT GGCGCAACCA ACAATGTTAT CTCGCCATTC TGGGATGATG TTGATGATGA ATCAGGGGCA ACCTACTGGA AAGTTGTTGG TACTGCGCCA AACCGCTCAT TGATCGTTCA GTGGGAAAAT CGCCCGCACT ATAACGGTAT TGGTAATGCA ACCTTCCAAG TTGTAATCAA CGAACAAGGT GGCATGATCT ACCAATATAA AGATCTCGAT TACGGCGATC CGTTGTACAA CAATGGTTTG AGTGCAACCG TTGGTGTGCG TGGAGCAAGC ACGGCTCAAG CTGCTCAATT CAGCTTCAAC CAAGCTCGCT TGCGCAGCGA AATGGCCTTG TGTGTTGGCG CTACCTGTGA TGCCATTTCA TGGTTGACTG AAACGCCAAA CACGATTACT GGTTTGGCTG GTACGCCAGT CAGCACCCAA GTCGTCAATG TGAGCTTCTC GACCAACGGT CTGACCCAAG CTGGGGTTTA TACCGGTAAC TTGGTCTTGA ACCACAACGC TCCTCAACCA GTGGTAACTA TCCCTGTTAC TTTGACGGTT ACTGCTCCAC CGAACTTCGG TACGATCAGT GGTACGATTC AAGGCTTAGC TGCATGTGAT GTCTCACCTG CTCCATTAAG TGGCGCAACC GTGACGATCA ACACCACGCC AGCAACTGTC TTGACCACCA GTGGTTCGGG GACATATAGC TATAGTTTGG CTGCTGGTAG CTACACGGTT ACCGTTGCCA AGGCTGGCTA TGTCACCCAA ATCTACAACG TGACGGTTCC TACCGCTGGC ACAGTGACCC AAAATGGTCA ACTGCGCTTG AATGCGCCAT GTTTGGATGT TGATACCACT CCAATTAGCG AAACGGTTGC CATCAACAGC ATTACCACCC AAACCCTCAC CTTGGATAAC AACGGTGCAG CGGCCTTGAC GTGGACAATC GCTGAATTGG CTGCAAGTGG TCGCCAAGCC CCAAGTGGCG AAGCAGCACC AACCCGCAAG CCAGTTCCAG CCAACTTACG CTCTGCTGGT AGCAAGCAAA CTGTTCCAGC CGATGAAGTT GTTCAAGATG GCAGCTTCGA GGAAACCAAT GCTGGCGATA GTTCGAATCC AAACTGGAGC CAAACCTCAA CCAACTTCGG GGTTGTGCTC TGTACCACCG GATGTTCAAC TGGCACCGCC GACGTTCCAC ACACTGGAAC TTGGTACGCT TGGTTTGGTG GTCTGACCCC ATCACAAAGC CCACCAGATG AGTTGGGCAC GATCTCGCAA ACCTTTACCG CTGCTGGCAA TGCCTCAGGT ACCTTATCGT TCTGGTTGTT TATGAACGAA CAAAGCGGTG TTGATAGCGA CTATCTCAAG GTTAAGATCG ACGGTACTAC GGTCTTCACG GTTACCAACG CAGAAGTTGC CGACTTCCCA ACCTACACGT TGGTCGAAGT TCCAATCAGC GCAGCCGTTT TGGCCGATAC CACCAACCAT ACCTTGTTGT TTGAATCGTT TGTTGATGGC AGTGGTAACA CCAACTTCTT CCTCGACGAT GTTTCACTCG ATATTGGTGG CGGTGCATCA TGTGTTGCTG ATGCTGCTCC ATGGGTTCGC GCAGTACCAA ATAGCGGTAC GATTGCAGCC GATGGCTCAG CCAATGTGGC ACTTGAATTC AACACCAATG GCTTGGCCGC TGGTCTCCAC ACAGTCAACC TGTGTATCCA AACCAACGAC ACAACCCGAC CAAATGTTCA AATTCCAGTC ACTATCAACG TGACTGAGAA TGTTGACCCA ACCCGTAAGC TCTACTTACC AATGGTCAGC AAGAACTAA
|
Protein sequence | MAKFPFIRQA ALIAMTLSLG LQGAAQPARA EVMRAPTATT KFLVEPNSPA NKFVRTNGTL IADYGSFSVW QVANNNTSSL SQLAGAQAAD LDTIGLRGIS FNPLKSVPTP QPSLSQSPTA DQQLWLVQFA GPLKDAWLDE LTKAGAERVI YMPNNAYLVW ANGQTVAKLD ALSKSNGAIQ WAGAYHPEYR LAPEFRQKAS NPDAKGLVDV TIQFYNSATV ERDVREVLDA SAQVYATPWQ VLNFTTISVQ VSEAALAPIA RRANVYNIEA WEAPQKMDER QGQIIAGNVT TAGGKVVPNG PGYLSWLQSQ GVPTDPNQYP IVDVVDDGLD NGTTSPLHPD FYVNGVRPGT SRITAVGNCT ADATGNGQAG HGNLNIGIVG SYNNLTGSPH VDGASSGIAG GYRVGLGISP FSRIASTKIF NNAGSFDLTN CGGGSNYIGI IAMAYSHGAA ISSNSWGSNS GGAYTASSQA FDQGTRDASS TTAGNQEMTH IIAAGNAGSG TNTVGSPGTA KNVITVGATE NVRDEGVLDG CNEGNADSAD DIASFSSRGP TDDARVKPDI MAPGTHVMGP ASQIATFDGT GLCGPSSGTH YPNGQTLYSW SSGTSHSTPA VAGSASLLYT KYRTSFGGGV APSPAMLKAY MLASTRYLNG NGTGGTLPSN NQGWGDVNLA PALDSTPRIV VDQTRTFGAT GEEFTQVGQI ADSAKPIRIT LAWTDAVGST TGNSYVNDLD LEVTVGGQTY KGNVFNGALS TTGGTADNKN NIEGVYLPAG TSGSIQVKVI ARNIAGDAIP GNADTTDQDF ALYVYNGSVG PTGTLTGRVA QSNNNPVVGA TVRTSGGSSA LTNASGNYSM IVPVGTYAVT ASLQGAFQTV PSVSITENVT TTQNFVLSYG SITGTVRDAF TPSRPIVGAN VSVIGYSTTT NAAGQYTIPV ANVGSTLVTV SAPKYATQSQ SVTVVDAGTA TQDFTLAAGA VAGIVTNSAN GAPVGDATVA ISNQYVKTKA DGSFAIRLEP GSHTLTASKI GFAADSASLT ISNGVTTTQD LEITPAFGYT PASLTRTFTF GDAPFTDTVG LELTNNGTQP FTYTIRENGA TGFTPAVNRA GGNVLVVQRN SATSATAATT ALTALGYTFE SIDNIAFEAR SLANLQTFDA VLFLGSTNAT ANNASETKLT EYLNAGGKLF IADNDLGFFT NAGTFYRTML DSTYGGDDPG AANRALTGFD FMAGVNAASV DAYPDFYTPG SSSTVIFRYG NNAVGGSFIQ RNGYKAVYLA TDFINLGTGA SGEAIERTVL ERVLQAIVGS GDSIAWLQEA PATGVIAGGD STNVAIGWNP GVITQPGTYT GSLNLGYSSV ISQTFNIPVT MVVNPAATQA RISGTVTSSG VCDTVPAPAA GATILITYTS GMTATVTTNA NGEYAFFVPQ GGTYTVAASA VDHLGQSQSV TVADGAEVTQ NFTLRLNKPC VTVSPGSLSA SLQLGAAPVN QTLVVTSKGA APLNATISEQ SRSAISQAGY TISEVPYSWL EANDGTNLNL TDDAEANITS PFTVTIFNTS SRNLRVANNG VVLVGATTGD VAATNESLLT GATNNVISPF WDDVDDESGA TYWKVVGTAP NRSLIVQWEN RPHYNGIGNA TFQVVINEQG GMIYQYKDLD YGDPLYNNGL SATVGVRGAS TAQAAQFSFN QARLRSEMAL CVGATCDAIS WLTETPNTIT GLAGTPVSTQ VVNVSFSTNG LTQAGVYTGN LVLNHNAPQP VVTIPVTLTV TAPPNFGTIS GTIQGLAACD VSPAPLSGAT VTINTTPATV LTTSGSGTYS YSLAAGSYTV TVAKAGYVTQ IYNVTVPTAG TVTQNGQLRL NAPCLDVDTT PISETVAINS ITTQTLTLDN NGAAALTWTI AELAASGRQA PSGEAAPTRK PVPANLRSAG SKQTVPADEV VQDGSFEETN AGDSSNPNWS QTSTNFGVVL CTTGCSTGTA DVPHTGTWYA WFGGLTPSQS PPDELGTISQ TFTAAGNASG TLSFWLFMNE QSGVDSDYLK VKIDGTTVFT VTNAEVADFP TYTLVEVPIS AAVLADTTNH TLLFESFVDG SGNTNFFLDD VSLDIGGGAS CVADAAPWVR AVPNSGTIAA DGSANVALEF NTNGLAAGLH TVNLCIQTND TTRPNVQIPV TINVTENVDP TRKLYLPMVS KN
|
| |