Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0029 |
Symbol | |
ID | 8709153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 32916 |
End bp | 34022 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 646482152 |
Product | sortase family protein |
Protein accession | YP_003373300 |
Protein GI | 283782546 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3764] Sortase (surface protein transpeptidase) |
TIGRFAM ID | [TIGR01076] LPXTG-site transpeptidase (sortase) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAATCTG CTAATATGCA GGAAGGAAAA ACGAAGCGTA GCAGCTCGAA TTTCTGGTGG CAAGCCGCTG GAATTATTGG CGAACTGTTG ATAGCTTTTG CAGTTATTTG CGCATTGTAC ATTGCATGGC AAATGTGGTG GACTGGAGAG CAGGCTGAGC ACACTCAAGT AGAGGCACGT CAAGCAATAT CTTGGCAAAA CCCAAAAACT GGTGATAAAA CGAAGATTGC TTTACCACAG AATACTGACC CACCTGCTTT AAGCAAACCA AAGTCTGAAG ACCTTGTAGC TCGTTTATAC ATACCACGTT TTGGAAATCA GTGGGAACGA AATGTTGTAG AAGGCATTAC GCCAGACATC TTAAGCAAGC GTGGTTTAGG TCATTATCCA ACCACACAAA TGCCTGGTGC AATAGGTAAT TTTGCAGTAG CAGGTCACCG AAATGGTTAT GGTCAGCCAT TGGGAGATGT AGACTTACTG AAGCCTGGCG ACGCAATAGT AGTTCGCACA AAGGATTATT GGTTCGTCTA TAAGTACACT TCTTACAAGA TTGTTACACC AGATCACGGT GAAGTTATTG ATGCAAATCC TGATCATCCT GGTGAGAAAC CAACAAAACG TATGCTTACT CTTACAACGT GCGAACCTAA GTATACTGCT GCAACTCATC GCTGGATTAG CTACGCAAAA TTCGCTTATT GGGCAAAGAT TAGCGATGGC ATTCCAAAAG AACTTTCTAC TATTGATGAG CAAGGAAAAG TGAAGTTTAT TAACAATGAG CAACAATCTC TAGTTTCTCG TCTTTCTTCA CTAGTGCCAG TTATGGTGGT TGTTTTATTA GTGTATATGA TGATTTTCGT AGCAGGAGCA ATTGCTTGGC AATGGCCAGA ACTTCGCGCT ATTCATTCTG GATTAAAGAA AAAGCCTGAT GCAAGCATTT TTGGCGGTCT TTTGCGCATA CAACCAGGTA TTACAGCGAT TCGTTGGATT CTTTCGATTC TGATTTATTT CTTCATTGTA TTAGCATTAT TCCAGTGGAT TTTCCCTTGG GGCGCATCTA CAATTCCATT CCTTCAGCAG ATGTCTAATT ACTCAACAAC TGTATGA
|
Protein sequence | MQSANMQEGK TKRSSSNFWW QAAGIIGELL IAFAVICALY IAWQMWWTGE QAEHTQVEAR QAISWQNPKT GDKTKIALPQ NTDPPALSKP KSEDLVARLY IPRFGNQWER NVVEGITPDI LSKRGLGHYP TTQMPGAIGN FAVAGHRNGY GQPLGDVDLL KPGDAIVVRT KDYWFVYKYT SYKIVTPDHG EVIDANPDHP GEKPTKRMLT LTTCEPKYTA ATHRWISYAK FAYWAKISDG IPKELSTIDE QGKVKFINNE QQSLVSRLSS LVPVMVVVLL VYMMIFVAGA IAWQWPELRA IHSGLKKKPD ASIFGGLLRI QPGITAIRWI LSILIYFFIV LALFQWIFPW GASTIPFLQQ MSNYSTTV
|
| |