Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SAG2053 |
Symbol | |
ID | 1014864 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Streptococcus agalactiae 2603V/R |
Kingdom | Bacteria |
Replicon accession | NC_004116 |
Strand | + |
Start bp | 2026877 |
End bp | 2031589 |
Gene Length | 4713 bp |
Protein Length | 1570 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 637317219 |
Product | serine protease |
Protein accession | NP_689039 |
Protein GI | 22538188 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain [TIGR01168] Gram-positive signal peptide, YSIRK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.322612 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACACAA AACAGCGTTT TTCAATCCGG AAATATAAGT TAGGTGCCGT ATCTGTACTT TTGGGAACCC TATTTTTTTT AGGTGGTATC ACAAATGTAG CTGCTGATTC TGTCATAAAT AAGCCATCTG ATATTGCAGT TGAACAGCAA GTAAAAGACA GTCCAACGAG CATAGCAAAT GAGACACCTA CTAACAACAC GTCATCAGCC CTTGCGTCGA CAGCTCAAGA CAATCTTGTT ACAAAGGCTA ATAATAGTCC AACAGAAACA CAACCAGTAG CTGAGTCTCA CTCTCAAGCC ACCGAGACAT TTTCCCCAGT CGCAAATCAA CCGGTTGAAA GCACTCAAGA AGTTTCTAAA ACTCCTTTAA CCAAACAAAA TTTAGCAGTC AAATCTACAC CAGCTATTTC TAAAGAAACC CCTCAAAACA TTGATAGTAA TAAAATTATC ACTGTCCCCA AAGTATGGAA CACAGGCTAC AAAGGAGAGG GAACTGTTGT AGCAATTATT GACTCAGGAC TAGATATCAA TCACGATGCT CTCCAATTAA ATGATTCGAC AAAAGCAAAA TACCAAAACG AACAGCAAAT GAATGCTGCT AAAGCAAAAG CTGGTATAAA CTATGGAAAA TGGTATAACA ACAAAGTAAT CTTTGGTCAC AACTATGTTG ATGTCAATAC AGAGCTAAAA GAGGTGAAAA GCACTTCTCA TGGTATGCAC GTAACCAGTA TCGCAACAGC TAATCCTAGC AAGAAAGATA CAAATGAATT AATCTATGGT GTTGCTCCTG AAGCACAAGT AATGTTTATG AGAGTCTTCT CTGATGAAAA AAGAGGAACT GGACCAGCCC TTTATGTTAA AGCTATTGAA GATGCCGTTA AACTCGGTGC TGACAGCATT AATTTAAGTT TAGGTGGAGC TAATGGGTCT TTAGTTAATG CCGATGACCG ACTTATAAAA GCTTTAGAGA TGGCTAGACT CGCTGGCGTT TCTGTTGTTA TAGCAGCAGG TAACGACGGT ACATTTGGGA GTGGAGCATC AAAGCCTTCT GCTCTTTATC CTGATTATGG TTTAGTTGGT AGTCCATCAA CAGCTCGTGA GGCCATTTCT GTAGCATCAT ATAATAATAC AACACTGGTT AATAAAGTCT TCAACATTAT CGGATTAGAA AACAACAGAA ATCTCAACAA CGGATTAGCT GCTTATGCAG ATCCTAAAGT TAGTGATAAG ACCTTTGAAG TAGGGAAACA ATATGATTAT GTTTTCGTAG GAAAAGGAAA CGACAATGAT TATAAGGACA AAACTTTAAA TGGTAAAATC GCCTTAATTG AACGTGGAGA TATTACTTTT ACAAAAAAAG TCGTCAATGC TATTAATCAC GGTGCTGTGG GAGCTATTAT CTTTAATAAC AAAGCTGGGG AAGCTAATCT AACAATGAGT TTAGATCCTG AAGCAAGTGC TATTCCTGCT ATTTTTACCC AAAAAGAGTT TGGAGATGTT TTAGCTAAAA ACAACTATAA AATTGTATTT AACAATATCA AAAATAAACA AGCCAACCCT AATGCAGGTG TCCTATCTGA CTTTTCAAGC TGGGGGTTAA CAGCAGACGG ACAATTAAAA CCTGACTTAT CTGCTCCTGG AGGCTCTATT TACGCCGCTA TCAATGATAA TGAATATGAT ATGATGAGTG GGACAAGTAT GGCTTCTCCC CATGTCGCTG GTGCTACTGC TCTAGTTAAA CAATACTTAT TGAAAGAACA TCCAGAACTT AAAAAAGGTG ACATTGAAAG AACTGTCAAA TACCTTCTTA TGAGTACTGC TAAAGCACAC CTAAACAAAG ATACAGGCGC TTACACCTCA CCACGCCAAC AAGGAGCAGG TATTATCGAT GTCGCAGCAG CAGTTCAGAC AGGATTATAC CTAACTGGTG GGGAAAACAA CTATGGCAGC GTTACATTAG GAAATATTAA AGATAAAATT TCCTTTGATG TTACTGTTCA TAATATCAAT AAAGTTGCAA AAGATTTACA CTATACAACC TATTTAAATA CTGATCAAGT TAAAGATGGC TTTGTCACAT TGGCTCCTCA ACAACTTGGT ACATTTACAG GGAAAACGAT ACGGATTGAA CCAGGGCAAA CCCAAACGAT TACAATTGAT ATAGATGTTT CGAAATACCA TGACATGTTA AAAAAAGTAA TGCCAAACGG CTATTTCCTA GAAGGCTACG TACGTTTTAC AGACCCTGTT GATGGTGGGG AAGTTCTTAG TATTCCTTAT GTTGGATTTA AGGGAGAATT CCAAAACTTA GAAGTTTTAG AAAAATCCAT TTATAAGCTT GTTGCTAACA AAGAAAAGGG ATTTTATTTC CAACCAAAAC AAACAAACGA AGTTCCTGGT TCAGAAGATT ATACTGCCTT AATGACTACA AGTTCAGAGC CTATCTACTC AACAGACGGT ACTAGTCCTA TCCAATTGAA AGCCTTGGGA AGCTATAAGT CTATAGATGG AAAATGGATC TTACAACTAG ATCAAAAAGG CCAGCCTCAT CTAGCCATTT CACCTAATGA TGACCAAAAT CAAGATGCCG TTGCAGTGAA AGGTGTTTTC TTACGTAATT TCAATAATTT AAGAGCCAAA GTCTATCGTG CAGATGATGT TAATTTACAA AAACCACTAT GGGTAAGTGC TCCCCAAGCA GGAGATAAAA ATTACTACAG CGGAAATACT GAAAATCCAA AATCTACATT TTTATATGAC ACAGAATGGA AAGGAACCAC TACTGATGGT ATTCCTTTAG AAGATGGAAA ATACAAATAC GTTTTAACTT ATTACTCTGA TGTCCCTGGC TCTAAGCCAC AACAAATGGT GTTTGATATC ACTTTGGATA GACAAGCTCC TACACTAACA ACAGCAACTT ATGACAAAGA TAGACGTATC TTCAAAGCTC GTCCTGCAGT AGAACACGGG GAATCTGGTA TCTTTAGAGA ACAAGTTTTT TACTTAAAAA AAGATAAAGA TGGTCATTAT AATAGCGTCT TACGTCAACA AGGAGAAGAC GGTATCCTTG TTGAAGATAA CAAAGTATTT ATCAAACAAG AAAAGGATGG TAGCTTTATT CTACCTAAAG AGGTTAACGA TTTCTCTCAT GTCTACTATA CTGTTGAAGA TTATGCAGGC AATCTAGTAT CAGCAAAACT CGAAGATTTG ATCAATATTG GCAATAAAAA TGGTTTAGTA AACGTCAAAG TGTTTAGCCC TGAGCTTAAC AGTAATGTCG ATATTGATTT CTCTTACTCT GTCAAAGATG ACAAAGGTAA TATCATCAAA AAGCAACATC ACGGGAAAGA CCTCAATTTA TTGAAATTGC CTTTTGGTAC CTATACGTTT GATCTATTCT TATACGATGA GGAACGAGCA AATCTAATCA GTCCCAAAAG TGTCACTGTA ACTATTTCTG AAAAAGATAG CCTTAAAGAC GTCTTATTTA AAGTTAACTT ACTCAAGAAA GCAGCCTTAC TCGTTGAATT TGACAAGCTT TTACCAAAAG GAGCAACAGT CCAGTTGGTT ACTAAGACAA ATACTGTTGT TGATCTACCA AAAGCAACTT ATTCTCCTAC TGACTATGGT AAAAACATAC CTGTAGGAGA CTATCGTTTA AACGTAACGC TGCCTAGTGG GTATAGCACT TTAGAGAACT TAGATGATTT ACTTGTATCC GTAAAAGAAG ATCAGGTAAA CCTAACAAAA TTGACGCTGA TTAATAAAGC TCCTCTGATT AATGCCCTAG CAGAACAAAC TGATATTATT ACCCAGCCTG TGTTTTATAA TGCTGGAACT CACTTAAAAA ATAATTACCT AGCTAATCTT GAAAAGGCAC AAACTTTAAT TAAAAATAGA GTGGAACAAA CAAGTATTGA TAATGCTATT GCTGCTTTGA GAGAAAGTCG CCAAGCTCTT AACGGTAAAG AAACAGATAC TTCTTTACTG GCAAAAGCTA TTTTAGCTGA AACAGAAATC AAGGGAAACT ATCAATTTGT TAATGCTAGT CCATTAAGCC AATCAACTTA TATCAATCAA GTCCAATTGG CGAAAAACCT TCTACAAAAA CCTAACGTCA CTCAATCAGA AGTAGACAAA GCCTTAGAAA ATCTTGATAT TGCTAAAAAT CAATTAAATG GTCATGAAAC TGATTACTCT GGTTTACACC ATATGATAAT TAAAGCAAAC GTTCTGAAAC AAACATCATC TAAATATCAG AACGCCAGTC AATTTGCTAA AGAAAATTAT AATAACCTTA TCAAGAAAGC AGAATTGCTG CTTTCCAATA GACAAGCTAC ACAAGCTCAA GTTGAAGAGT TATTAAACCA AATAAAAGCA ACCGAACAAG AGCTTGATGG CCGTGATAGA GTTTCTTCCG CAGAGAATTA TAGTCAATCA CTCAATGATA ATGACTCTCT CAATACCACA CCTATCAATC CGCCAAATCA GCCCCAGGCG TTGATATTCA AAAAAGGCAT GACTAAAGAA AGTGAGGTTG CTCAGAAGCG TGTCTTAGGG GTGACTAGCC AAACCGATAA TCAAAAGGTA AAGACAAACA AGCTTCCTAA AACAGGCGAA AGCACTCCTA AAATAACCTA TACAATATTG CTATTTAGTC TCTCTATGCT AGGTCTGGCA ACAATCAAAC TAAAGTCTAT CAAAAGAGAA TAA
|
Protein sequence | MNTKQRFSIR KYKLGAVSVL LGTLFFLGGI TNVAADSVIN KPSDIAVEQQ VKDSPTSIAN ETPTNNTSSA LASTAQDNLV TKANNSPTET QPVAESHSQA TETFSPVANQ PVESTQEVSK TPLTKQNLAV KSTPAISKET PQNIDSNKII TVPKVWNTGY KGEGTVVAII DSGLDINHDA LQLNDSTKAK YQNEQQMNAA KAKAGINYGK WYNNKVIFGH NYVDVNTELK EVKSTSHGMH VTSIATANPS KKDTNELIYG VAPEAQVMFM RVFSDEKRGT GPALYVKAIE DAVKLGADSI NLSLGGANGS LVNADDRLIK ALEMARLAGV SVVIAAGNDG TFGSGASKPS ALYPDYGLVG SPSTAREAIS VASYNNTTLV NKVFNIIGLE NNRNLNNGLA AYADPKVSDK TFEVGKQYDY VFVGKGNDND YKDKTLNGKI ALIERGDITF TKKVVNAINH GAVGAIIFNN KAGEANLTMS LDPEASAIPA IFTQKEFGDV LAKNNYKIVF NNIKNKQANP NAGVLSDFSS WGLTADGQLK PDLSAPGGSI YAAINDNEYD MMSGTSMASP HVAGATALVK QYLLKEHPEL KKGDIERTVK YLLMSTAKAH LNKDTGAYTS PRQQGAGIID VAAAVQTGLY LTGGENNYGS VTLGNIKDKI SFDVTVHNIN KVAKDLHYTT YLNTDQVKDG FVTLAPQQLG TFTGKTIRIE PGQTQTITID IDVSKYHDML KKVMPNGYFL EGYVRFTDPV DGGEVLSIPY VGFKGEFQNL EVLEKSIYKL VANKEKGFYF QPKQTNEVPG SEDYTALMTT SSEPIYSTDG TSPIQLKALG SYKSIDGKWI LQLDQKGQPH LAISPNDDQN QDAVAVKGVF LRNFNNLRAK VYRADDVNLQ KPLWVSAPQA GDKNYYSGNT ENPKSTFLYD TEWKGTTTDG IPLEDGKYKY VLTYYSDVPG SKPQQMVFDI TLDRQAPTLT TATYDKDRRI FKARPAVEHG ESGIFREQVF YLKKDKDGHY NSVLRQQGED GILVEDNKVF IKQEKDGSFI LPKEVNDFSH VYYTVEDYAG NLVSAKLEDL INIGNKNGLV NVKVFSPELN SNVDIDFSYS VKDDKGNIIK KQHHGKDLNL LKLPFGTYTF DLFLYDEERA NLISPKSVTV TISEKDSLKD VLFKVNLLKK AALLVEFDKL LPKGATVQLV TKTNTVVDLP KATYSPTDYG KNIPVGDYRL NVTLPSGYST LENLDDLLVS VKEDQVNLTK LTLINKAPLI NALAEQTDII TQPVFYNAGT HLKNNYLANL EKAQTLIKNR VEQTSIDNAI AALRESRQAL NGKETDTSLL AKAILAETEI KGNYQFVNAS PLSQSTYINQ VQLAKNLLQK PNVTQSEVDK ALENLDIAKN QLNGHETDYS GLHHMIIKAN VLKQTSSKYQ NASQFAKENY NNLIKKAELL LSNRQATQAQ VEELLNQIKA TEQELDGRDR VSSAENYSQS LNDNDSLNTT PINPPNQPQA LIFKKGMTKE SEVAQKRVLG VTSQTDNQKV KTNKLPKTGE STPKITYTIL LFSLSMLGLA TIKLKSIKRE
|
| |