Gene SAG2053 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2053 
Symbol 
ID1014864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp2026877 
End bp2031589 
Gene Length4713 bp 
Protein Length1570 aa 
Translation table11 
GC content37% 
IMG OID637317219 
Productserine protease 
Protein accessionNP_689039 
Protein GI22538188 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID[TIGR01167] LPXTG-motif cell wall anchor domain
[TIGR01168] Gram-positive signal peptide, YSIRK family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.322612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACACAA AACAGCGTTT TTCAATCCGG AAATATAAGT TAGGTGCCGT ATCTGTACTT 
TTGGGAACCC TATTTTTTTT AGGTGGTATC ACAAATGTAG CTGCTGATTC TGTCATAAAT
AAGCCATCTG ATATTGCAGT TGAACAGCAA GTAAAAGACA GTCCAACGAG CATAGCAAAT
GAGACACCTA CTAACAACAC GTCATCAGCC CTTGCGTCGA CAGCTCAAGA CAATCTTGTT
ACAAAGGCTA ATAATAGTCC AACAGAAACA CAACCAGTAG CTGAGTCTCA CTCTCAAGCC
ACCGAGACAT TTTCCCCAGT CGCAAATCAA CCGGTTGAAA GCACTCAAGA AGTTTCTAAA
ACTCCTTTAA CCAAACAAAA TTTAGCAGTC AAATCTACAC CAGCTATTTC TAAAGAAACC
CCTCAAAACA TTGATAGTAA TAAAATTATC ACTGTCCCCA AAGTATGGAA CACAGGCTAC
AAAGGAGAGG GAACTGTTGT AGCAATTATT GACTCAGGAC TAGATATCAA TCACGATGCT
CTCCAATTAA ATGATTCGAC AAAAGCAAAA TACCAAAACG AACAGCAAAT GAATGCTGCT
AAAGCAAAAG CTGGTATAAA CTATGGAAAA TGGTATAACA ACAAAGTAAT CTTTGGTCAC
AACTATGTTG ATGTCAATAC AGAGCTAAAA GAGGTGAAAA GCACTTCTCA TGGTATGCAC
GTAACCAGTA TCGCAACAGC TAATCCTAGC AAGAAAGATA CAAATGAATT AATCTATGGT
GTTGCTCCTG AAGCACAAGT AATGTTTATG AGAGTCTTCT CTGATGAAAA AAGAGGAACT
GGACCAGCCC TTTATGTTAA AGCTATTGAA GATGCCGTTA AACTCGGTGC TGACAGCATT
AATTTAAGTT TAGGTGGAGC TAATGGGTCT TTAGTTAATG CCGATGACCG ACTTATAAAA
GCTTTAGAGA TGGCTAGACT CGCTGGCGTT TCTGTTGTTA TAGCAGCAGG TAACGACGGT
ACATTTGGGA GTGGAGCATC AAAGCCTTCT GCTCTTTATC CTGATTATGG TTTAGTTGGT
AGTCCATCAA CAGCTCGTGA GGCCATTTCT GTAGCATCAT ATAATAATAC AACACTGGTT
AATAAAGTCT TCAACATTAT CGGATTAGAA AACAACAGAA ATCTCAACAA CGGATTAGCT
GCTTATGCAG ATCCTAAAGT TAGTGATAAG ACCTTTGAAG TAGGGAAACA ATATGATTAT
GTTTTCGTAG GAAAAGGAAA CGACAATGAT TATAAGGACA AAACTTTAAA TGGTAAAATC
GCCTTAATTG AACGTGGAGA TATTACTTTT ACAAAAAAAG TCGTCAATGC TATTAATCAC
GGTGCTGTGG GAGCTATTAT CTTTAATAAC AAAGCTGGGG AAGCTAATCT AACAATGAGT
TTAGATCCTG AAGCAAGTGC TATTCCTGCT ATTTTTACCC AAAAAGAGTT TGGAGATGTT
TTAGCTAAAA ACAACTATAA AATTGTATTT AACAATATCA AAAATAAACA AGCCAACCCT
AATGCAGGTG TCCTATCTGA CTTTTCAAGC TGGGGGTTAA CAGCAGACGG ACAATTAAAA
CCTGACTTAT CTGCTCCTGG AGGCTCTATT TACGCCGCTA TCAATGATAA TGAATATGAT
ATGATGAGTG GGACAAGTAT GGCTTCTCCC CATGTCGCTG GTGCTACTGC TCTAGTTAAA
CAATACTTAT TGAAAGAACA TCCAGAACTT AAAAAAGGTG ACATTGAAAG AACTGTCAAA
TACCTTCTTA TGAGTACTGC TAAAGCACAC CTAAACAAAG ATACAGGCGC TTACACCTCA
CCACGCCAAC AAGGAGCAGG TATTATCGAT GTCGCAGCAG CAGTTCAGAC AGGATTATAC
CTAACTGGTG GGGAAAACAA CTATGGCAGC GTTACATTAG GAAATATTAA AGATAAAATT
TCCTTTGATG TTACTGTTCA TAATATCAAT AAAGTTGCAA AAGATTTACA CTATACAACC
TATTTAAATA CTGATCAAGT TAAAGATGGC TTTGTCACAT TGGCTCCTCA ACAACTTGGT
ACATTTACAG GGAAAACGAT ACGGATTGAA CCAGGGCAAA CCCAAACGAT TACAATTGAT
ATAGATGTTT CGAAATACCA TGACATGTTA AAAAAAGTAA TGCCAAACGG CTATTTCCTA
GAAGGCTACG TACGTTTTAC AGACCCTGTT GATGGTGGGG AAGTTCTTAG TATTCCTTAT
GTTGGATTTA AGGGAGAATT CCAAAACTTA GAAGTTTTAG AAAAATCCAT TTATAAGCTT
GTTGCTAACA AAGAAAAGGG ATTTTATTTC CAACCAAAAC AAACAAACGA AGTTCCTGGT
TCAGAAGATT ATACTGCCTT AATGACTACA AGTTCAGAGC CTATCTACTC AACAGACGGT
ACTAGTCCTA TCCAATTGAA AGCCTTGGGA AGCTATAAGT CTATAGATGG AAAATGGATC
TTACAACTAG ATCAAAAAGG CCAGCCTCAT CTAGCCATTT CACCTAATGA TGACCAAAAT
CAAGATGCCG TTGCAGTGAA AGGTGTTTTC TTACGTAATT TCAATAATTT AAGAGCCAAA
GTCTATCGTG CAGATGATGT TAATTTACAA AAACCACTAT GGGTAAGTGC TCCCCAAGCA
GGAGATAAAA ATTACTACAG CGGAAATACT GAAAATCCAA AATCTACATT TTTATATGAC
ACAGAATGGA AAGGAACCAC TACTGATGGT ATTCCTTTAG AAGATGGAAA ATACAAATAC
GTTTTAACTT ATTACTCTGA TGTCCCTGGC TCTAAGCCAC AACAAATGGT GTTTGATATC
ACTTTGGATA GACAAGCTCC TACACTAACA ACAGCAACTT ATGACAAAGA TAGACGTATC
TTCAAAGCTC GTCCTGCAGT AGAACACGGG GAATCTGGTA TCTTTAGAGA ACAAGTTTTT
TACTTAAAAA AAGATAAAGA TGGTCATTAT AATAGCGTCT TACGTCAACA AGGAGAAGAC
GGTATCCTTG TTGAAGATAA CAAAGTATTT ATCAAACAAG AAAAGGATGG TAGCTTTATT
CTACCTAAAG AGGTTAACGA TTTCTCTCAT GTCTACTATA CTGTTGAAGA TTATGCAGGC
AATCTAGTAT CAGCAAAACT CGAAGATTTG ATCAATATTG GCAATAAAAA TGGTTTAGTA
AACGTCAAAG TGTTTAGCCC TGAGCTTAAC AGTAATGTCG ATATTGATTT CTCTTACTCT
GTCAAAGATG ACAAAGGTAA TATCATCAAA AAGCAACATC ACGGGAAAGA CCTCAATTTA
TTGAAATTGC CTTTTGGTAC CTATACGTTT GATCTATTCT TATACGATGA GGAACGAGCA
AATCTAATCA GTCCCAAAAG TGTCACTGTA ACTATTTCTG AAAAAGATAG CCTTAAAGAC
GTCTTATTTA AAGTTAACTT ACTCAAGAAA GCAGCCTTAC TCGTTGAATT TGACAAGCTT
TTACCAAAAG GAGCAACAGT CCAGTTGGTT ACTAAGACAA ATACTGTTGT TGATCTACCA
AAAGCAACTT ATTCTCCTAC TGACTATGGT AAAAACATAC CTGTAGGAGA CTATCGTTTA
AACGTAACGC TGCCTAGTGG GTATAGCACT TTAGAGAACT TAGATGATTT ACTTGTATCC
GTAAAAGAAG ATCAGGTAAA CCTAACAAAA TTGACGCTGA TTAATAAAGC TCCTCTGATT
AATGCCCTAG CAGAACAAAC TGATATTATT ACCCAGCCTG TGTTTTATAA TGCTGGAACT
CACTTAAAAA ATAATTACCT AGCTAATCTT GAAAAGGCAC AAACTTTAAT TAAAAATAGA
GTGGAACAAA CAAGTATTGA TAATGCTATT GCTGCTTTGA GAGAAAGTCG CCAAGCTCTT
AACGGTAAAG AAACAGATAC TTCTTTACTG GCAAAAGCTA TTTTAGCTGA AACAGAAATC
AAGGGAAACT ATCAATTTGT TAATGCTAGT CCATTAAGCC AATCAACTTA TATCAATCAA
GTCCAATTGG CGAAAAACCT TCTACAAAAA CCTAACGTCA CTCAATCAGA AGTAGACAAA
GCCTTAGAAA ATCTTGATAT TGCTAAAAAT CAATTAAATG GTCATGAAAC TGATTACTCT
GGTTTACACC ATATGATAAT TAAAGCAAAC GTTCTGAAAC AAACATCATC TAAATATCAG
AACGCCAGTC AATTTGCTAA AGAAAATTAT AATAACCTTA TCAAGAAAGC AGAATTGCTG
CTTTCCAATA GACAAGCTAC ACAAGCTCAA GTTGAAGAGT TATTAAACCA AATAAAAGCA
ACCGAACAAG AGCTTGATGG CCGTGATAGA GTTTCTTCCG CAGAGAATTA TAGTCAATCA
CTCAATGATA ATGACTCTCT CAATACCACA CCTATCAATC CGCCAAATCA GCCCCAGGCG
TTGATATTCA AAAAAGGCAT GACTAAAGAA AGTGAGGTTG CTCAGAAGCG TGTCTTAGGG
GTGACTAGCC AAACCGATAA TCAAAAGGTA AAGACAAACA AGCTTCCTAA AACAGGCGAA
AGCACTCCTA AAATAACCTA TACAATATTG CTATTTAGTC TCTCTATGCT AGGTCTGGCA
ACAATCAAAC TAAAGTCTAT CAAAAGAGAA TAA
 
Protein sequence
MNTKQRFSIR KYKLGAVSVL LGTLFFLGGI TNVAADSVIN KPSDIAVEQQ VKDSPTSIAN 
ETPTNNTSSA LASTAQDNLV TKANNSPTET QPVAESHSQA TETFSPVANQ PVESTQEVSK
TPLTKQNLAV KSTPAISKET PQNIDSNKII TVPKVWNTGY KGEGTVVAII DSGLDINHDA
LQLNDSTKAK YQNEQQMNAA KAKAGINYGK WYNNKVIFGH NYVDVNTELK EVKSTSHGMH
VTSIATANPS KKDTNELIYG VAPEAQVMFM RVFSDEKRGT GPALYVKAIE DAVKLGADSI
NLSLGGANGS LVNADDRLIK ALEMARLAGV SVVIAAGNDG TFGSGASKPS ALYPDYGLVG
SPSTAREAIS VASYNNTTLV NKVFNIIGLE NNRNLNNGLA AYADPKVSDK TFEVGKQYDY
VFVGKGNDND YKDKTLNGKI ALIERGDITF TKKVVNAINH GAVGAIIFNN KAGEANLTMS
LDPEASAIPA IFTQKEFGDV LAKNNYKIVF NNIKNKQANP NAGVLSDFSS WGLTADGQLK
PDLSAPGGSI YAAINDNEYD MMSGTSMASP HVAGATALVK QYLLKEHPEL KKGDIERTVK
YLLMSTAKAH LNKDTGAYTS PRQQGAGIID VAAAVQTGLY LTGGENNYGS VTLGNIKDKI
SFDVTVHNIN KVAKDLHYTT YLNTDQVKDG FVTLAPQQLG TFTGKTIRIE PGQTQTITID
IDVSKYHDML KKVMPNGYFL EGYVRFTDPV DGGEVLSIPY VGFKGEFQNL EVLEKSIYKL
VANKEKGFYF QPKQTNEVPG SEDYTALMTT SSEPIYSTDG TSPIQLKALG SYKSIDGKWI
LQLDQKGQPH LAISPNDDQN QDAVAVKGVF LRNFNNLRAK VYRADDVNLQ KPLWVSAPQA
GDKNYYSGNT ENPKSTFLYD TEWKGTTTDG IPLEDGKYKY VLTYYSDVPG SKPQQMVFDI
TLDRQAPTLT TATYDKDRRI FKARPAVEHG ESGIFREQVF YLKKDKDGHY NSVLRQQGED
GILVEDNKVF IKQEKDGSFI LPKEVNDFSH VYYTVEDYAG NLVSAKLEDL INIGNKNGLV
NVKVFSPELN SNVDIDFSYS VKDDKGNIIK KQHHGKDLNL LKLPFGTYTF DLFLYDEERA
NLISPKSVTV TISEKDSLKD VLFKVNLLKK AALLVEFDKL LPKGATVQLV TKTNTVVDLP
KATYSPTDYG KNIPVGDYRL NVTLPSGYST LENLDDLLVS VKEDQVNLTK LTLINKAPLI
NALAEQTDII TQPVFYNAGT HLKNNYLANL EKAQTLIKNR VEQTSIDNAI AALRESRQAL
NGKETDTSLL AKAILAETEI KGNYQFVNAS PLSQSTYINQ VQLAKNLLQK PNVTQSEVDK
ALENLDIAKN QLNGHETDYS GLHHMIIKAN VLKQTSSKYQ NASQFAKENY NNLIKKAELL
LSNRQATQAQ VEELLNQIKA TEQELDGRDR VSSAENYSQS LNDNDSLNTT PINPPNQPQA
LIFKKGMTKE SEVAQKRVLG VTSQTDNQKV KTNKLPKTGE STPKITYTIL LFSLSMLGLA
TIKLKSIKRE