Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4652 |
Symbol | |
ID | 5736499 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5942748 |
End bp | 5945708 |
Gene Length | 2961 bp |
Protein Length | 986 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281816 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001547411 |
Protein GI | 159901164 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0586324 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTGGCGCG TTTTAACAAT CGGCCTGCTC GTTTTTGGCC TTTTACCCTC GGTTAGTGCC CGAACAACCC TGCCCATTCA GTTTGTTATG CCAATTGCCA GCGTAACCAA TCGCGGCCTA AGCTTCGATA ATCGTGGCTT GCCTCAACGC ACGCTTCAGT TGTACGAGGC TTGGCCTGCA ACCCAAGCCA AACGCCAGCC TGATCCGTTG TTGCGGGTGG CTGTGCCAAG CTTTCCCCAA AAAATTGACC CAAATTTACA AAACTATTGG CATGATGCTC CGAGCCAGCC GCAAACGCTG CTGGTATTTT TGGCCGAGCA AGCCGATTTG AGCTTGGCCA GCACCTTTGA TGATTGGGCT GCACGTGGTG ATTACGTCTA CAAAACCCTG ACTGACCATG CCCAACATAG CCAAACCCCT TTGCTAAACG CATTACGGGC GCAAGGCCAC AACCCACAAT CGTTGTGGAT TGTCAATAGT TTGATTGTTG AGGGTGATCA GCAGTTGGCC TTGGATCTGG CGCAACATCC AGCGGTTGCT AGCATTGGGG CTAACCATGT TTTTAATCTC CCAAGCGTAG CGACGACTCT TGTAACTGAG CCTGAGAATG TGGCGTGGGG GGTGGCGGCG GTTGATGCGC CCCATGTTTG GTCTGATTGG GGCGTGCGTG GTCAGGGGAT TGTGGTTGCC AATATTGATA CTGGCGTTAC CGTAAGCCAT ACAGCTTTGC TGAATAATTA TCGTGGTTGG TCAGCCAATG GCTTGAGCAA TGATTACAAC TGGTTTGATC CGCTGTATCA GTATCGTTTG CCAACCGATC CGGCGGGCCA TGGTACGCAC ACGATGGGCA GTTTGGTGGG AGCGAATGAC CAGCAGGGCA TGGCCTTGGG TGTTGCACCA GCCGCTCGTT GGATTGCCGC CCGCGCGTGT GGGGCGTTGA CCTGCGATGA CTTGAGTTTG ATCAGAAGTG CCCAATGGAT GCTTGCCCCA ACCCGGGTTG GCTGCGAACG CAATCAGCAA ATTGCTTGCG ATCCGCGGCC CGATTTACGT CCACACATCA TCAATAATTC GTGGGGTGGG CCAGGCGAAA GCACGTGGTA CAGCGGGTAT ATTACTGCGT GGGATGCAGC AGGCATTTTG AGTGTGTTTG CGGCGGGCAA TTTTGGGCGC TCTGGCTGCT ATACGAGCAC TGCACCTGGT AATAATGCCA ATGTGTTCAG TGTTGGTGCT GTCGATATTA ATAATCTGAT CGCCGATTTT TCTTCGCGTG GGCCAACCAG CGATGGTCGT ACCAACCCCG ATTTGAGTGC GCCAGGTGTG CGTGTTCCTT CAGCTTGGCC GAATGGATCG ACAGCCTTGC TCGATGGCAC ATCGATGGCG GCTCCTCATG TGAGCGGTAT TGCTGCGCTA ATTTGGTCGG CGAACCCGCA GTATATTGGC GATTTAGCTG CAACCCAAGC CTTGTTGACG AACACAAGCG AAGCGCGTTA CTCGGCCCAA TGTGGCGATG CGCCAACAGC ACGGCCCAAT AATGTGTATG GTTGGGGCAG TGCCGATGCC TATGCAGCGG TACGTCAGGC ACGAGTTGAT GTGGCTTGGC TGAGTTTGCC TGAGCAGTTG CTTGTTCCAG CGAATACCCT CGTGACGATT CCGATCACTT TGGATACCCG CCAAGTCAGC GCAGCGGGGA GCTATCGGGC GAATGTTTTG GTGGTCGCTA GTTCAGGTAC AAACACCTTT GAATTAGAAC TGATCGTCGA GGCCGCAGCC AATACTAGCC AATTTACCGG CCAATTAGTT GATCGGTGGC ATGGACGTGG GGTGTATGGG CGGGTCAGCA TTGGTGGCGG GCCTTCTAGT TATACCGATC CAACCGGCCA TTACACCATG ACCCTGACAA CCAGCAGCCA TGAGATCTCG ACCCAAGCTA CTGGCTATCA TCCAGCGGCT ACGATGGTTG ATTTAAATCT CCAGCAGACC AATGTGCTGA CATTAACGCC TGATATTCCG CATATGCTGG CTGAAATTCC GCCGATCAGT GCTAGCTTGG CTTTTGCTGA ACAACGCACA TTTGCGGTAA CCTTGACCAA TGCGGGCACT CAGCCGCTAG TCGTTTCGCC GCATGTGCCA AATCAGGAAT GGCAGATTAC CCCAGTCCCG AGAACCGCCC TATACGATAC AACTGGCTTG GCCGAATTAA AGCTCGATGA TGACCAAGTG TATACCGATG CCTTGGATTT GGGCTTTAGT GCGCCATTGT TTGGCACTTT GGCAAACAAG GTTTATCTGA GTTCAAATGG CTGGGTTTCA TTGAATCAAA CGCGGAGTGC TGCCCCTAGC GCTAACTGCT TTCCGGCCAA CAATTTGCCC AATGCCACGC TTGCCCCCTT CTGGACTGAT CTTGATCCTT CAGAGGGTGG CATTATCCGT GCTGGAAGGG TTGATGCTGA TACGTTTGTG GCGAGTTATG AACAGGTTCC AATCTGGCAA GAAGAACATC TTCCCACGGC TGCCCCAACC TACACCTTCC AATTAATTAT TGAGCGTAGT GGGCAGGTTG AGTATCGTTA TGGGGCGATG GGCTACTTTC CAGGCCGCTG GGGAGTTGGC ACGCATACCA ATAGTAGTGT TGGGCAGGCT TTGGGTTGCC ATCAAAGCCA TGAATATTTG GCGGCCCACA ATTGGCAATT GCTCAATCAG CCAAGTAGTC AGCAATGGTT GAGTGCTACG CCAAGTAGCC TGACGATCGC GCCCAATCAA CAAGCCACCG TGTTGGTTCA GCTTAAGGGC TTTGGGGCAA TCAGTTGGTT GCAACATCCT GCGGTCAGCA TTGTGCAGAT CAACAGCAAC GATCCGCGCC AGCCGCAACG TGAAATAACC GCGAGCGTAG GGTTGCAACT AGCGCCTTAT CAAACCTATG CCAATACGAT TGTGATTAGT AATCCATTAG CAAACCCTTG A
|
Protein sequence | MWRVLTIGLL VFGLLPSVSA RTTLPIQFVM PIASVTNRGL SFDNRGLPQR TLQLYEAWPA TQAKRQPDPL LRVAVPSFPQ KIDPNLQNYW HDAPSQPQTL LVFLAEQADL SLASTFDDWA ARGDYVYKTL TDHAQHSQTP LLNALRAQGH NPQSLWIVNS LIVEGDQQLA LDLAQHPAVA SIGANHVFNL PSVATTLVTE PENVAWGVAA VDAPHVWSDW GVRGQGIVVA NIDTGVTVSH TALLNNYRGW SANGLSNDYN WFDPLYQYRL PTDPAGHGTH TMGSLVGAND QQGMALGVAP AARWIAARAC GALTCDDLSL IRSAQWMLAP TRVGCERNQQ IACDPRPDLR PHIINNSWGG PGESTWYSGY ITAWDAAGIL SVFAAGNFGR SGCYTSTAPG NNANVFSVGA VDINNLIADF SSRGPTSDGR TNPDLSAPGV RVPSAWPNGS TALLDGTSMA APHVSGIAAL IWSANPQYIG DLAATQALLT NTSEARYSAQ CGDAPTARPN NVYGWGSADA YAAVRQARVD VAWLSLPEQL LVPANTLVTI PITLDTRQVS AAGSYRANVL VVASSGTNTF ELELIVEAAA NTSQFTGQLV DRWHGRGVYG RVSIGGGPSS YTDPTGHYTM TLTTSSHEIS TQATGYHPAA TMVDLNLQQT NVLTLTPDIP HMLAEIPPIS ASLAFAEQRT FAVTLTNAGT QPLVVSPHVP NQEWQITPVP RTALYDTTGL AELKLDDDQV YTDALDLGFS APLFGTLANK VYLSSNGWVS LNQTRSAAPS ANCFPANNLP NATLAPFWTD LDPSEGGIIR AGRVDADTFV ASYEQVPIWQ EEHLPTAAPT YTFQLIIERS GQVEYRYGAM GYFPGRWGVG THTNSSVGQA LGCHQSHEYL AAHNWQLLNQ PSSQQWLSAT PSSLTIAPNQ QATVLVQLKG FGAISWLQHP AVSIVQINSN DPRQPQREIT ASVGLQLAPY QTYANTIVIS NPLANP
|
| |