Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0423 |
Symbol | |
ID | 8709948 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 456120 |
End bp | 462248 |
Gene Length | 6129 bp |
Protein Length | 2042 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 646482538 |
Product | PA domain protein |
Protein accession | YP_003373670 |
Protein GI | 283782916 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0201874 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTCA AACGAACGTC ATATCTGACT CGTTTAGCAG CATTAGGCGT GGCTGCGGCG ACGCTTATGG TACCGCTTGG GCAAGTTCCT GCGGCATTTG CAAAAAATGA TAATGAAACT AAAAAAGATG GCAATGCACT TGTCTCTAAG ATGCTTGATT CATCTTTAAA AGCTAATAGT GAGATTAAGT CTATAGAAGA TTTACGCCAT GTTAATCCAA CTCTTGCTAA AGCTTTTGAG AATTACAGTA AAAAAGATCA ATTTAATAAA AATTCAAATG CTCAAGTAAC TGTTGTTGTT ACGTTGAAAA ATTATCGTTC GCAACTCACG GATGCTGACG AAAAAGCCAA CATTCATGAG CAGAATGTTT TAATTAACCG CGTAAAGTCA AAGTACAATA TGACGGTTCG TCGCCAGGTT GGTTATTTAA TGAGTGCTTT TGAAGCTACT CTCCCTGAAA AACACGTTCA AGATTTGAAA CGTGAGCCAG GTGTAGTTTC TGTCGCAAAA GAACGTTTAT ATCATCCAAT GGAAAATTAT GCTCGCGATT TACAAGGCGT GCAGACGGTG TTTAAAAAGC ATCACCTCGA TGGAACCGGA ATGCTTGTTT CTATTATCGA TACGGGCATC GATCCAAATC ATCAAGATAT GAAGCTTGAC GCTTCAGCAA AAAAGAATAT TCGCTTGAAG CCAACTGGTA AAGGAAATAC AACAGATAAA GTTCCTGCAG GTTTCAATTA CGCTGATGAA AATACTAATT TCACAGACGC TAATGGTGAA CAGCACGGTA TGCACGTTGC CGGTATTGTG GCTGCTAACG GCGATGAAAA TGGTGCTCCA GCATCTGAAA ATCATCGTGT TGATGGCATA GCTCCAAACG CTCAGCTGCT CGCTATGAAA GTCTTCTCAA ATGTGCCAGG TTCGCATGGT GCTCGTGATG TTGATATTGT TGCTGCAATC GAAGATTCTG TAAAGCTTGG CGCAGACGTT ATTAACATGT CTCTTGGCTC AGATAACGGT TTCGGTGGAA CGTCAAGTGC TACTAGTGTC GCGTTAAAGA AGGCTCGCGA AGCTGGTGTG CTTCCAGTTA TTTCTGCAGG AAACTCCGGC TTGAACTTCT CTGAAAGCGG TGGCATAGAT GATAGTTTAG GCAAGTGGGA TGATGCAACT CTCGGCTCGC CATCTTCTTA TCCTTCTGCT TTCAGCGTTG CTTCTGTAGA AAATTCCAAC ATTACTCAGC AGTCTGCAAA TTGGGTTGGA AAAGATGGCA AGTCACATAC TTTGCCGTAC AGCTATTCTA TTGGTTCTTT AACAGATGCA CTATCTCAGC ATGAACTTGT GGATGCTAAA AAAGCTACTA AAGCTGATGT TAAAGATCTT GATCTTACCG GTAAATATGC TCTTGTAGAA CGTGGTGGAA TCAGTTTTAC TGAAAAGTTT AATAATGCGA TTTCTAAAGG TGCTAAAGGT GTGATTGTTT ACAATCACGA CAAAGATTCA TCCTTATTCA TAGGAATGGG CGGATTGGAG AAAATCAAAA ATTGCTTTGG TGCATCTATT CCTCGTTCTT ATGCATTGGA AATTAAAGCA GCATTAGATA AAGGTGAGAA AGTAAAGATA GCTTTCACTG AACAATTCGT TACTATCGCC AATCCGGATG ATGGGAAGCC TTCTAGCTTT ACTTCTTGGG GTCCAACTCC AGAATTTGAT TTTAAACCTC AGATTGCTGG TATCGGTGGT AATGTTTGGT CCACGCAGAA TGGAAATAAG TACACCAGCA TGTCTGGTAC TTCCATGGCT GCTCCAAACG TTTCCGGTCT TTCAGCGTTG GTGATGGAAA GCTATAAGAA TCGTTTCCCT GAGCTTAACA AGTCTGAAAT GGCTACCCGT GTAAGCCAAG CATTAATGAA TACAGCAAGT ATTATTGGAC ATAAAGATTC TGCAGATAGT AAGAAAATTC CTTATGCTCC TCGTCAAATC GGTGCTGGCT TAGCTCAGGT AGATAAAGCT GTTGCTACTG ACGTGATTGC AACAGTAAAC GGCAATTCGT ATGTGGCTCT TAAAAACGTG AATTCTAATC GCGAGTTTAC GGTTACTTTA CATAATTATG GTAAGAAGCC AGTAAAATTT GCTGTCCCAG ATCAGGAAGT TGTAAATGAG TCTAATGAGG TTAATAAAGA TACAGTAACA TCTATTAGTA ATACTGAAAC GTTGCAATCA GAAACGCGTA ATGTTACAGT AAAGCCTGGA AAAACCGCTG ATGTTAAGTT TACGTTAAGC CCAAATCGTT CTAAAGGCAA TCATTATATT GAAGGTTGGG TGCGATTTGA ATCGCTTACT GATACTCAGC CAGATGTTTC TGTGCCATAT CTTGGCTTTG TTGGCGATTG GAACGATGAA CCTATAATGG TAAAACCGGG AGAAAGCTAT AGTGCAAAAC ATAGTGATCT TACGACATCG TTGCTTTCGG TAGCTGGTTT TATGGGTAAA CTTCCGGTTA ATAATGAAAC TAAACCTTAT GTACTTGGCG AATTTTCACC TGCAAATCAA GATGGCTGGA TGGATTATGT TGTTCCATCT ATGGCTATAT TCCGCGGTGC ATCTGACATA AAGTATTCTG TATTGGATTC TCATAATAAA ACTAAGATTG TTCTTGGAAC GGAACACAAT GTGCATCGTT CTACTATTGC TAATGCTGGA ACAACAGATG GTTTAGCTGG AAATTTCGAC GGGTCTGTGT GGAATCCAAA AACAAGTAGG TTTGACATTC TTCCTGATGG ATGGTACACG TATCGTATTC AAGCTCGCTT AGGAGATAAG TTTGATTGGC AAACTTATGA TATGAAGGTA GCAATCGATA ATCATGGACC TAAGGTCACA GTATCTGATC GTGATGATAA AGGCAATGTG AAGATTGTTA TTAGTGATGA ATTGAGTCCT GATCCTGGCG TGCCAACTAT TCATCTTCCT GGTGTAAAGG ATGCTTTGAA CTATGGTGAT GAAGACACTC AATGCCCGTT GAATGAAGCA ACTCGTACGC GAGTGTGCAA GTCTATTCAC ATTGGTAATG AAGCTCAATA CATAGATGTG CTTGTTCGAG ACAATGCTAT GAATCCAACT GAGGTTAAGA AGGTATTCGA CACATATGTC CCGAATGGTT CAGGAAAAGC TGAGAAGAAC AAGAAGTTTA TTATTCCAAA TGAACAAAAA CTTATTTCTC ACCAAATTAA AGCAAATCAG TTAGAAGAAA GTAAGACTAA CCCAGATATT AATGGTAAGT TCTATCTTGA GGGATATCTC ACTAAAGATG TTGCTTCTAT AAAAGCTTAT GTGACTCCTA AAGACGGTAA AAAGCAGGAA GTCAAATTAG CTAAAAAGCG CAAGGGCGAG AATACTTTCT ATGCGTTTGT GCCACTGAAA AATGGCAAAA ATACTGTAAT GCTTAAAGCT TTTAATAGTA ATGGTGGCCG TATTGGCAAT AAGAAACTTG AGTTAACGTT CAATGCGCAA ACACCTAAGA TTACTGTTAA TGGCTTAGAT GATAAGGGTA ATTTGCCTGT AGGTGCAGAT GGTAAAGTAA CTGTTGCTGG TAAAGTAACT GATACTCCTG GAGATACTGT AAAGCTAAAG CTTACTTATC AAAAGGCTGC TGACAACACT ACTCCAGCTA ATGCAGTTGC TGGTTCTGTA AACAATCATG CTTCTGGTTC TGCTTCTGGA TCTACTATTA AAGCTGTTTC TGGTGCTAAT ACAACTTCAG CAACGAGTAT TGATGCAAAT GTAAAGAAAG CAGATACAGC TGCAGCTAGT AAAGAAACTG AAGAAGTAGT AACTGTTGGC AAAGACGGTA GCTTCAAAAC AGTTATTACT CCTGCCGCTA AAGCTGTTAT GGTAACTCTT GTAGCTACTG ATTCTGCTGA AAACACTACA ACTATAGGTT TGGCTCTCGC TGGTCGTTCA ACTCCTACAC CTAAACCTGC AGCTGCAACA AAAGATTTCG TGCTTACTAA TGCTGGCTCT ATGGGTGCTT ATGCGTGGTT AATTCATAAG AAGAATTATG GTCCAGACGT TATTGGCTCT GATTACTTTG TTGCTCAAGG TGATGTGAGT TCCAAGGTGA CTAGCGTGGT GTTTACTCCA GCTTCGCGTT TTGATGCTAC AACTAAGAAT CTTGTAACTC CAAACCCATT GCGTGCTGAA ATTAAGAACG GAAAGTTTAG TGTAAAGCTT CCAATGCATC CGGGTATAAA TGATTTCCGC TTGCAGGTGA ATACTAAAGC AGATGACGAA GATGCTGATG AAGATGATGA AACGAATGTA ATCGATACTC CTGCAGCATT CTACTTCGAC ATTACGCCTC CAACGGCGCA TTTCGATACT CCAAAGCTTT ATGGTCACAC GTTGTTCACC AATCGTGACA CCGTGACTTT CAGTGGAAAT GTATCGGATG ATGCTTTCGG TCAAAGTTTG CGAATTAACA ACGATTCCGT TGGTGATTTC TACACTTTGG ATACAAATGG TAAAGAAACA ACTCGTCGCC AATTCTCAAC TGACATTCCA GTAGACAATG GTGATAAGTT GCTGCTGCAC TTAAGTGATC AGGTTGAATC TCATTTACTT AGTGTGATTC CAGTTGTTCT TGACGAAACG GATCCTACTG TTGAAGCAGG TTTGGTTGAA GGTGAAGAAA TCGATGATGG TCACGAAATT ACTGTTAAAG CAACAGATGA TAATTTGAAG TCTTTGCGCG TTATGGTTGA TGGTCGTCCA GTAGCGTATA CAGAAAATTA TCTTCCAACT ACAGCAGTAG AGAACACATT AGTGGATGTT TCTAAGGTTA ACGATAATCC AACTCCAGAT AAGCCAGGTT CTACAACATT AACGTTGAAG ATTCCTACTA AGAAGTTGAC TGATGGTTCT CACACTATCA CCATTGAGGC GACCGATTTT GCCGGTAATA CAGCAAGCGC CACAAGTGAT AATGCAAAGG GAGCTACTGC ATATACATTC AAAGTGAAGC ATGGTGTGAA GCCTGGTGAG GATCCGTCTA AGCCTGGTCA AGATCCAAAG AAACCTGGTG AGGATCCGAA GAAGCCTGGC GAGGATCCGG CTAAGCCAAA GAATCCAGGA ACCAAGGATG AATCAACTCA AACTGATTCT GCCCCAATTC CTCCTGCTTC TACAGAAAAG ATCTCAAAGA CTATTGCTGG CTTAGACACC AAGAATGTTA CAGCTGGAAA GGAAGTAAAG ATTTATATTT CTGGAAAGCC TGCAGCTTTG AAGAATGCTA AAGCCAATGC TCGCATGTAT GCGTTCATTT ACTCTGATCC TAAGGATCTT AAGGGTGAAG ATGGTCATGA ATATGTGACA GTGCGTACTG GTGAGAACGG TAAGTACTAT TTCAAGGTTT TGATTCCTAG CGGATATTCA GGTAAGCACA CAATCTTACT TATGGATGCA AATGGTAACC AGATTGCTTC TGGAGAGGTG TCTGTATCTT CCTCTGCTTC TTCTGATAGT GGCAGCCAAA CTGATTCTGA TACTAAGAAT AATCAGAATA ATCAGAACCA GAGTGGTAGC GGTAATACTG GTAACGCTGG TAATACTGGT AATACTGGTA ATAGCGAAGG TGGAAGCTCA GGTCACGGTA AGGATAGTGG CAACAACGGT GGCAGCACTA GCGGTAATGC AAGCGGTGAT ACCAACAATG ATAGCCGCGG TGGTAATCAT GGCGGTAGCC ACAGCTCTAA TCATGGCGGT AGCCACAGCT CTAACCACGG CGGTAATCAT GGTGGAAAAT CCTTTACTAC TGGAGGTAAT TCCAGTAATA ATTCGGAGTC GTCCTCAAAG GATGATGCGG ATTCTACTAA TGCAGGCGCA TCCGCTGCAT CCGGCGATGC CGCTGCCAAT GGCAACGCAA GTTCTTCCAA TGGCAACGCA AGTGCTGCGA ATGGCAACGC TGCTGCAAAG CAGAAGGCTG CCGCTAAGGC TGCCGCCAAG AAGATGACTT CTGATTTGGC GATGACTGGT AGCAGTGTGA TTGGCATGGT TGTGACGTTT ATGGTGTTGC TTGCAAGCGG TGCTGTAACG TTTAAAACTC GCCGCCGTCA TGTGCACGCA AACAAGTAA
|
Protein sequence | MKFKRTSYLT RLAALGVAAA TLMVPLGQVP AAFAKNDNET KKDGNALVSK MLDSSLKANS EIKSIEDLRH VNPTLAKAFE NYSKKDQFNK NSNAQVTVVV TLKNYRSQLT DADEKANIHE QNVLINRVKS KYNMTVRRQV GYLMSAFEAT LPEKHVQDLK REPGVVSVAK ERLYHPMENY ARDLQGVQTV FKKHHLDGTG MLVSIIDTGI DPNHQDMKLD ASAKKNIRLK PTGKGNTTDK VPAGFNYADE NTNFTDANGE QHGMHVAGIV AANGDENGAP ASENHRVDGI APNAQLLAMK VFSNVPGSHG ARDVDIVAAI EDSVKLGADV INMSLGSDNG FGGTSSATSV ALKKAREAGV LPVISAGNSG LNFSESGGID DSLGKWDDAT LGSPSSYPSA FSVASVENSN ITQQSANWVG KDGKSHTLPY SYSIGSLTDA LSQHELVDAK KATKADVKDL DLTGKYALVE RGGISFTEKF NNAISKGAKG VIVYNHDKDS SLFIGMGGLE KIKNCFGASI PRSYALEIKA ALDKGEKVKI AFTEQFVTIA NPDDGKPSSF TSWGPTPEFD FKPQIAGIGG NVWSTQNGNK YTSMSGTSMA APNVSGLSAL VMESYKNRFP ELNKSEMATR VSQALMNTAS IIGHKDSADS KKIPYAPRQI GAGLAQVDKA VATDVIATVN GNSYVALKNV NSNREFTVTL HNYGKKPVKF AVPDQEVVNE SNEVNKDTVT SISNTETLQS ETRNVTVKPG KTADVKFTLS PNRSKGNHYI EGWVRFESLT DTQPDVSVPY LGFVGDWNDE PIMVKPGESY SAKHSDLTTS LLSVAGFMGK LPVNNETKPY VLGEFSPANQ DGWMDYVVPS MAIFRGASDI KYSVLDSHNK TKIVLGTEHN VHRSTIANAG TTDGLAGNFD GSVWNPKTSR FDILPDGWYT YRIQARLGDK FDWQTYDMKV AIDNHGPKVT VSDRDDKGNV KIVISDELSP DPGVPTIHLP GVKDALNYGD EDTQCPLNEA TRTRVCKSIH IGNEAQYIDV LVRDNAMNPT EVKKVFDTYV PNGSGKAEKN KKFIIPNEQK LISHQIKANQ LEESKTNPDI NGKFYLEGYL TKDVASIKAY VTPKDGKKQE VKLAKKRKGE NTFYAFVPLK NGKNTVMLKA FNSNGGRIGN KKLELTFNAQ TPKITVNGLD DKGNLPVGAD GKVTVAGKVT DTPGDTVKLK LTYQKAADNT TPANAVAGSV NNHASGSASG STIKAVSGAN TTSATSIDAN VKKADTAAAS KETEEVVTVG KDGSFKTVIT PAAKAVMVTL VATDSAENTT TIGLALAGRS TPTPKPAAAT KDFVLTNAGS MGAYAWLIHK KNYGPDVIGS DYFVAQGDVS SKVTSVVFTP ASRFDATTKN LVTPNPLRAE IKNGKFSVKL PMHPGINDFR LQVNTKADDE DADEDDETNV IDTPAAFYFD ITPPTAHFDT PKLYGHTLFT NRDTVTFSGN VSDDAFGQSL RINNDSVGDF YTLDTNGKET TRRQFSTDIP VDNGDKLLLH LSDQVESHLL SVIPVVLDET DPTVEAGLVE GEEIDDGHEI TVKATDDNLK SLRVMVDGRP VAYTENYLPT TAVENTLVDV SKVNDNPTPD KPGSTTLTLK IPTKKLTDGS HTITIEATDF AGNTASATSD NAKGATAYTF KVKHGVKPGE DPSKPGQDPK KPGEDPKKPG EDPAKPKNPG TKDESTQTDS APIPPASTEK ISKTIAGLDT KNVTAGKEVK IYISGKPAAL KNAKANARMY AFIYSDPKDL KGEDGHEYVT VRTGENGKYY FKVLIPSGYS GKHTILLMDA NGNQIASGEV SVSSSASSDS GSQTDSDTKN NQNNQNQSGS GNTGNAGNTG NTGNSEGGSS GHGKDSGNNG GSTSGNASGD TNNDSRGGNH GGSHSSNHGG SHSSNHGGNH GGKSFTTGGN SSNNSESSSK DDADSTNAGA SAASGDAAAN GNASSSNGNA SAANGNAAAK QKAAAKAAAK KMTSDLAMTG SSVIGMVVTF MVLLASGAVT FKTRRRHVHA NK
|
| |