Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0422 |
Symbol | |
ID | 8708777 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 454072 |
End bp | 455217 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 646482537 |
Product | sortase family protein |
Protein accession | YP_003373669 |
Protein GI | 283782915 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3764] Sortase (surface protein transpeptidase) |
TIGRFAM ID | [TIGR01076] LPXTG-site transpeptidase (sortase) family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.102959 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAA ACGACGCTGT TGCTAAAGTA TATGCTTTCC TCAACAAAAT CTCCCAAAAT AAGCTTTTCA AAAACCAAAC AAACGCAACC CAAACTAACG AAACCAACTT AACCGAAACC AACTTAACCG AAACCAACTT AACCGAATCA AACGCAACCG AAAATAACGC AACTCAAGCC CACGCAACCG AAGCAAAATC CGCCGAAAGT AATCAAAAAT CTAAGCAAAA AACCAATCGA AAATCTAAGC TCCGCCGCAT CGTCGAGCCA ATCGCATTCG TGTTGGCAGG CATTTTGTGC TTCAGTTACC CTGTTGTTTC AACACTTTGG AACAATCGCG TGTCGAAGGA AATTTCTAAC GCGTACGACA AGTATAACCA CGATCAGGCT GGTGATGTGC GCCGCGCTCA CATTCGCGCA GCGAAGCTTT ACAATAAAAG TCGCAAGAAT ATGCTTACCA CGGATCCGTA TGGTCCGGAT GGTCAAAAAG ACGTAACTAA CACGCCTGAA TACAAGCGTT ATCTTAAGGC ACTTGAGGAG CCTATGGGCA TTATCGGCAT CGTAAAAATT CCGAAAATTG GCGTAAAACT TCCTATTTAT CACGGCAGTT CGCAGGAAGT TTTAGCGCAC GGCGCTGGTC ATTTGTACGG CACAGATTTG CCGGTTGGTG GCAAGAACCG CCACACAGTT ATTACCGCGC ACACGGGTCT TGCGGATGCA ACCATGTTCG ATGATTTGGT GAATTTAAAG AAGGGCGACT ACTTCTACCT CGACGTGCAA GGCGAAACTT TACGATACAA AGTGTTCCGC ATCAGCGTGG TTGAGCCACA CGATATTAGT TTGTTGCAGC GCGAAAAGGG TCGCGACTTG GCGACGCTGC TAACGTGTAC CCCGTATGGT GTGAACTCGC ATAGGCTTTT GGTGACGGGG TATCGTGTGC TGCCGGACCC TGTGAAGCCG CCGGATGACC ACGTGCAATG GCCGCTTTGG ATGACGCTAT TCGTGATTGC AATGGCGTTC TCATTGATTG TTTTGTCCAT GATGATTGCT GCTGCAACGT CTAAGCGAGG GCGACAGCTC GACATTCGCG GCAAGCATTT GCTGATCCTT TCGCGCAAAA TGCTGCGTAA GTTGCGTCGC GAGTAA
|
Protein sequence | MKLNDAVAKV YAFLNKISQN KLFKNQTNAT QTNETNLTET NLTETNLTES NATENNATQA HATEAKSAES NQKSKQKTNR KSKLRRIVEP IAFVLAGILC FSYPVVSTLW NNRVSKEISN AYDKYNHDQA GDVRRAHIRA AKLYNKSRKN MLTTDPYGPD GQKDVTNTPE YKRYLKALEE PMGIIGIVKI PKIGVKLPIY HGSSQEVLAH GAGHLYGTDL PVGGKNRHTV ITAHTGLADA TMFDDLVNLK KGDYFYLDVQ GETLRYKVFR ISVVEPHDIS LLQREKGRDL ATLLTCTPYG VNSHRLLVTG YRVLPDPVKP PDDHVQWPLW MTLFVIAMAF SLIVLSMMIA AATSKRGRQL DIRGKHLLIL SRKMLRKLRR E
|
| |