Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3197 |
Symbol | |
ID | 5736899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4043339 |
End bp | 4045588 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280343 |
Product | peptidase S8/S53 subtilisin kexin sedolisin |
Protein accession | YP_001545962 |
Protein GI | 159899715 |
COG category | [C] Energy production and conversion [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases [COG3794] Plastocyanin |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0061261 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCTGA TCAAACCATT CATCCTCCTA ATGGTGTTAG CAGGCTTTTG TCTGCCAAGC GCCTCCGCCA ACCCCCCCGA CCGTCTCGCA AGCAAAGCCG ATTCAGCCCT GCTGGCAAAG CTGAATGCTG GCCAACAAGT GCCAACCCTC GTGCTATTGC AAGCCCAAGT CGATACCAAC TTCGCTGATC GTTTGGCCAG TAAAGAAGCC AAGGGTGCGG CGGTGGTTGC CGCCCTGCGC CAACAAGCCC AACGCGATCA AACTCCACTG CTTGCCGAAC TCAGCAACCG CGGGATTCAA TCCGAGGGCT TTCTCTCGGT TAATGCCTTG TATGCGACGC TTGATTTAGC CAGCGCTCAA TGGCTGGCCG AACAAGCCAG CGTCAAGCAA TTGATCGAAG ATTCGATTGT GGTCAAGGTC GAAAAACCAG TTGCCGAAAC TAACCCAGCA CCCCAAGCGG TGAATACCAC GACCTGGGGG GTTAACTATG TCAAAGCCCC CGAAGTGTGG GCTAAAGGCA TTACTGGCCA AGGCATCGTA ATTGCTGGCG AAGATACTGG GGTGCGCTGG ACGCACGCCG CATTGAAGAG CAAATATCGC GGCTGGGATG GCACAAATGC TAGTCACGAT TACAACTGGT ACGACGGCAT TCGCACTTCG TTAGGTGGCA CGAATCCTTG TGGTTTAGCG CTGAACGTGC CTTGTGACGA TAACTCGCAT GGCACTCACA CCGTTGGTAC GGTTGTTGGC GATAATGGCA CGGGCGAACA AATTGGGGTT GCGCCTGGCG CAAAATGGAT CGCCTGCCGT AACATGGATG CTGGCAACGG TACGCCTGCA ACCTACATTC GCTGTATCGA CTGGATGTTG GCTCCGTTCC CCACTGCTGG CACCAGTGCT CAAGGCGACC CCAGCAAAGC GCCGCACGTT GTCAACAATT CATGGGGCTG CCTAGCCTCA GAAGGTTGTA CCAATACCCC TTCTGATGGC ATTCAGACCT CAGTTCAAAA TGTGACCAAT GCAGGGATTA TGTTTGTGGC CTCAGCAGGG AATGATGGGA GCGGTTGTGC CACCATCACC ACGCCAGTCG CGATCTACCC TGAAAGCTTT GTGGTTGGCT CACATACCTC AACTGGGGCA ATTTCTGGGT TTAGCAGCCG TGGTCCAGCT ACCAACAATG GTGCCAGCCG AATCGGCCCA GACATTTCAG CCCCAGGCTC ATCTGTGCGT TCGGCAACCA ATGGTGGTGA TGATGTTTAT GGTTCAAGCT CGGGCACGAG TATGGCTAGC CCGCACGTAG TTGGGGTTGT GGCCTTGTTG TGGTCGGCTC GCCCAGAACT GCAAGGCCAA GTTGATCTCA CTCGTGCGAT CTTGCAAGAA ACCGCTACTG CTGCGCCTTC AACCCAAACC TGTGGTGGTG TTGCTGGTAG CAGCATCCCC AACAATACCT TTGGTCATGG CTATGTCAAT GCTTTGAATG CGATTGCCCC AACCTTGCAA GGCAGCATCA CGGTCGATGG CACGGCGGCA ACCTCAGCCA CCATTCGCTT GGAAAATAGC GTTGGCGTGG TTGAATTTGG CAAAACCACT GGTGTCTATA GCACCAGCTT GCCAGCAGGC CTCTACAGTG CAACCGTTAC TGTGCCAAAC GAAACGCCAA TTACTCGCCA AGTAACGATT GTCAAAGGCC AAGTTGCGAC TGAAAACTTT GAATTTGGCG ATGTGACTGG TACTGTTAGT GGTCATGCAA CCCTCAATGG CACGGGCCGC GCTGGTGTCA GCATCACCGC CAACCCCGGC AACTTCACCA CCGAAACCAA CGCTAACGGC GATTACAACT TAGCCTTAGC CCCGGGAACC TATACGATTT CTAGCGAATT TATGGCCTTG GAAACCCAAG TGGCAACGGT AACGGTGGTT CTCAATCAGA CCGTCATCCA AGATTTTGAT TTCGCGACCA CCCAAACCAT CAATATTCAG AACTTCCGGT TTAGCCCTAG CCCAATTACG GTTACCTTGG GAACCAGCAT CTTGTGGCGT AACCTTGATG CTTCGACCCA CACCACGACC CGTGGTCAAA TGCCGTTTAT TTGGGATTCA GGCGATCTGA GCCAGAACCA AGATTATGCC GTAACCTTTG ATCAAGTTGG TACTTTCAGC TATGTTTGTA GCTTGCATGG CAGTATGCAA GGTACGGTTG TGGTAACTCC ACCAATGCAA AATACCTATC TGCCGTGGAC AACTAAATAA
|
Protein sequence | MRLIKPFILL MVLAGFCLPS ASANPPDRLA SKADSALLAK LNAGQQVPTL VLLQAQVDTN FADRLASKEA KGAAVVAALR QQAQRDQTPL LAELSNRGIQ SEGFLSVNAL YATLDLASAQ WLAEQASVKQ LIEDSIVVKV EKPVAETNPA PQAVNTTTWG VNYVKAPEVW AKGITGQGIV IAGEDTGVRW THAALKSKYR GWDGTNASHD YNWYDGIRTS LGGTNPCGLA LNVPCDDNSH GTHTVGTVVG DNGTGEQIGV APGAKWIACR NMDAGNGTPA TYIRCIDWML APFPTAGTSA QGDPSKAPHV VNNSWGCLAS EGCTNTPSDG IQTSVQNVTN AGIMFVASAG NDGSGCATIT TPVAIYPESF VVGSHTSTGA ISGFSSRGPA TNNGASRIGP DISAPGSSVR SATNGGDDVY GSSSGTSMAS PHVVGVVALL WSARPELQGQ VDLTRAILQE TATAAPSTQT CGGVAGSSIP NNTFGHGYVN ALNAIAPTLQ GSITVDGTAA TSATIRLENS VGVVEFGKTT GVYSTSLPAG LYSATVTVPN ETPITRQVTI VKGQVATENF EFGDVTGTVS GHATLNGTGR AGVSITANPG NFTTETNANG DYNLALAPGT YTISSEFMAL ETQVATVTVV LNQTVIQDFD FATTQTINIQ NFRFSPSPIT VTLGTSILWR NLDASTHTTT RGQMPFIWDS GDLSQNQDYA VTFDQVGTFS YVCSLHGSMQ GTVVVTPPMQ NTYLPWTTK
|
| |