Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0427 |
Symbol | |
ID | 8709078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 467417 |
End bp | 468514 |
Gene Length | 1098 bp |
Protein Length | 365 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 646482542 |
Product | sortase family protein |
Protein accession | YP_003373674 |
Protein GI | 283782920 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3764] Sortase (surface protein transpeptidase) |
TIGRFAM ID | [TIGR01076] LPXTG-site transpeptidase (sortase) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000168911 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAGCTAG CTGATTTTGC AGAGTCTGCT CGCGCGCTAT TTAAGCGACT GACTCGTATG CGCGCTAATA AGTCCTCCAA AAGTGCGCGA GATTCCGCAA GCTCGCAAGA TTACGCAAGC TCTGAAATCG CTGCATCTTC CGAAAATACT GAATCTGCGC AATCTCCATC ATCGCAAGAG CAGTCTTCCC TAAAACCAAT GTCTACACTT CGCAAGCTGG CTGAGCCGAT TGCGCTTGTT GCTATTGGCA TTTTGTGTTT CAGCTATCCT GTGGTTTCTA CTTTGGTCAA CAATCATGCT GCCAAGGAAC TCTCAATCGA GTACGACAAA TTAAATAAGG AAAAGCCTAA AGAAAATCGC GCGGAAATTT TGCGAAAAGC GCGTGAATAC AACGCTCGCC ACAAGGCGAT TATTAGCGCG GACCCGTATA ACGGCAATAA CGATTACATG GACACTCCCG AATATAAAGA GTACGAAAAA GTGCTTAGTG AGCCGATGGG AATTATGGGC ATCGTGAAAA TACCAAAAAT TGGCGTGAGA CTGCCGATTT ACCACGGAAC TACTCAAGAT ACACTAGCAA TGGGCGCGGG GCATTTGTAC GGCACGGATT TGCCGGTGGG GGGCAAAAGC AGGCACACGG TTGTGACGGC GCATACGGGT ATGCCGGATG CCACGATGTT CGATGATTTA AACACGTTGA AAAAAGGTGA CTACTTCTAT TTTGATGTGC AAGGAAAGAC TCTTCGGTAC AAAGTGTTTC GCATAAATGT GGTGGAGCCG AACGATATTC GTTTGCTGCG GCGTGAGAAG GGGCGCGACT TGGCGACACT GATTACGTGC ACGCCGTATG GAATTAACAC GCACAGGCTG CTCGTGACGG GGTATCGCGT GCTGCCGGAT CCTGCTAACG TGCCGGGTGA CCATATGCAG TGGCCGTTGT GGATGACGCT GTTTGTGATA TCGATGGTGA TGTCTGCGGT GTTGATGGCG ATGATGCTGG TTGCGTCGTT GCGAAAGAAG AATGGTGTGA GTAGTCTGCA AGGCAGGCAT TTGCTGGCGG TTTCGCGCAA AATGCTGCGT AAGTTACGGC GCAAGTAG
|
Protein sequence | MKLADFAESA RALFKRLTRM RANKSSKSAR DSASSQDYAS SEIAASSENT ESAQSPSSQE QSSLKPMSTL RKLAEPIALV AIGILCFSYP VVSTLVNNHA AKELSIEYDK LNKEKPKENR AEILRKAREY NARHKAIISA DPYNGNNDYM DTPEYKEYEK VLSEPMGIMG IVKIPKIGVR LPIYHGTTQD TLAMGAGHLY GTDLPVGGKS RHTVVTAHTG MPDATMFDDL NTLKKGDYFY FDVQGKTLRY KVFRINVVEP NDIRLLRREK GRDLATLITC TPYGINTHRL LVTGYRVLPD PANVPGDHMQ WPLWMTLFVI SMVMSAVLMA MMLVASLRKK NGVSSLQGRH LLAVSRKMLR KLRRK
|
| |