Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3587 |
Symbol | |
ID | 5735448 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4514514 |
End bp | 4516313 |
Gene Length | 1800 bp |
Protein Length | 599 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280736 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001546351 |
Protein GI | 159900104 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAGAG AACATCGCAG AGGATTTTTT CGCAGTTTAT TGCTGGCAAG CCTCTTTTTG AGTGCGATTG GCCTTGCACA GCCTGTTGCA ACCACCCAAC CGCAATCAAT GTTAGTGCAA GGTGCTGCGC TAAGCAACCA AGCTAGGGCA ATTCAAGCAG TTGGCGGCAC TGTCGAACGC CAATTTGCCC GCTTTGAAAC CAGCGTAGCC CAATTAACCA AACCACAAAT TGCTCAACTG CAAGCGCAAG GCTTGCGGGT TTTTGCCGAT CAAGCGGTTG CTAGCTCAGC GCAAGGAGCT GTTGATCAGC CAATTGCTGT GCAGGCTGGC CCCAAAAGTA GCAACCCTGC GCCATTGCAA TATCCATCGA TTGTAACCGG CGCTGATGTG GTGCAACGCC GTGGTATCGA TGGACGCGGC GTGACCGTGG CAGTGATCGA TTCGGGTTTG CCAGCGATCG AACGACCTGA GCGTTGGCAG CGCATCGATA ACACGACTGC TCGCTACAAC CAAGGCAGCC GTTTTTTGAT TTATAAAGAC ATGTTGGCAA GCAACCAAAT TACGAATAGC AGCGACCCCT ACGGCCATGG AACGCATGTA TTTGCCACTT TAGCCGATAA TCGCGCTTTG CCAGCAGGCT ATAGCGGAGC CAAAGTCGGG ATTGCCCCAG GCGCGAATTA TGTGGTCGTT CGGGCCTTAA ATTCGCAAGG CCAAGCCTCG TATAGCACGA TTATTGCGGC GATTGATTGG GTGATTGAGC ATGCTGACGA GTACGATATT CGGGTGCTGA ATCTCTCGTT GCAAGCCGAA GTGATTGCGC CCTACTGGTA TGATCCGCTC AATCAAGCGG TGATGCATGC TTGGAATGAG GGCATCACGG TGGTGGCAGC GGCTGGCAAC ACTGGCCCAA ATCCTGCCAC GATTATGGCT CCAGCCAACG TGCCCTATAT CATCAGTGTC GGGGCAATTC GGCCTGCAAT CTACAACGAC AATGGCAGCG ATAGTTTAGC AGCCTATTCA TCGGCTGGGC CAACCGAAAG TCGTTTTGTT AAGCCCGATG TGGTGATTGC TGGCTCGCGG GTCATTGCAC CCCTGCCCGC CGATAGTGTG TTAGCTAATT CTGGGTCCGC AGGTTTGGTT AGCGAACGCG CCAAACTCGA ATGGGCTGAT TTGAAAACCT CGCAACAGCT TAATTATTAT GCCTTGAGTG GCACATCGAT GGCCGCCGCC GAAGTCAGTG GAATTGTCGC CTTGTTGTTG CAAGATGAGC CAAATTTGAC CAATAACCAA ATTAAGGCTC GTTTGACTGG CACGGCCCAA CTTGCCACTT TGGATGATGG TCAAGCCGCT TACAGCACGT GGCAACAAGG CGCAGGCAAA GTTGTGCCAA CTGCGGTGAT CGATGGTAGC AACACTGGCG CTGCCAACGC GGGAATGGAT TTGCCGACCG ACCTGATTGT GAGCAGCAAC CCCGATGATA CCAGCAACCA TTACCAAGGC TCGACTGAGT ATGATGCGGT GACCAACACT TTTACTGATA ACACCACGAC CTATAGCTCA ACCCTTGCGA CCAGCTACAG CACCTGGGCG GGGAGCTATA GTACATGGGC TGGCAGTTAT CTAACATGGG CTGGTAGTTA CAGCACGTGG GCTGGCAGCT ACAGCACGTG GGCTGGCAGT TACAGCACGT GGGCTGGCAG TTACAGCACG TGGGCTGGCA GTTATAGCAC GTGGGCTGGC AATAACGCTG CTGCGAGTGG AACCACCAAT GTCAACGCTG GAAGCCTCGT GAGCGACTAA
|
Protein sequence | MDREHRRGFF RSLLLASLFL SAIGLAQPVA TTQPQSMLVQ GAALSNQARA IQAVGGTVER QFARFETSVA QLTKPQIAQL QAQGLRVFAD QAVASSAQGA VDQPIAVQAG PKSSNPAPLQ YPSIVTGADV VQRRGIDGRG VTVAVIDSGL PAIERPERWQ RIDNTTARYN QGSRFLIYKD MLASNQITNS SDPYGHGTHV FATLADNRAL PAGYSGAKVG IAPGANYVVV RALNSQGQAS YSTIIAAIDW VIEHADEYDI RVLNLSLQAE VIAPYWYDPL NQAVMHAWNE GITVVAAAGN TGPNPATIMA PANVPYIISV GAIRPAIYND NGSDSLAAYS SAGPTESRFV KPDVVIAGSR VIAPLPADSV LANSGSAGLV SERAKLEWAD LKTSQQLNYY ALSGTSMAAA EVSGIVALLL QDEPNLTNNQ IKARLTGTAQ LATLDDGQAA YSTWQQGAGK VVPTAVIDGS NTGAANAGMD LPTDLIVSSN PDDTSNHYQG STEYDAVTNT FTDNTTTYSS TLATSYSTWA GSYSTWAGSY LTWAGSYSTW AGSYSTWAGS YSTWAGSYST WAGSYSTWAG NNAAASGTTN VNAGSLVSD
|
| |