Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HMPREF0424_0005 |
Symbol | |
ID | 8709593 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gardnerella vaginalis 409-05 |
Kingdom | Bacteria |
Replicon accession | NC_013721 |
Strand | + |
Start bp | 5200 |
End bp | 7962 |
Gene Length | 2763 bp |
Protein Length | 920 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 646482128 |
Product | LPXTG-motif cell wall anchor domain protein |
Protein accession | YP_003373279 |
Protein GI | 283782525 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0810] Periplasmic protein TonB, links inner and outer membranes |
TIGRFAM ID | [TIGR02331] Rib/alpha/Esp surface antigen repeat |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.0000000417696 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGAGAAAAG TAAGTACCGT TCACACAATT AGTGGAATTG TATTAACTTT TGGTTTGATC ATTTCATCGC TGGTTTACAT ACCTTCGCTA GCAACAACCG CAGCGGCAGA AGAAACATCA CCAGTGCACG TACCTCTTAC CGCACAAACC GCTCACGACA TTTCTACTGG TCGAATTCGT TTGGAAAATA ATCAGATTGA TTCTTTGCTT TACCATAAAG CCGCGCTAGA TCCAGATCAA GATAGCGACG GCGACGGTCT GAAGAACAGC GAAGAGCTTT ACACTTACGT AAAAAACGGA CGCACATATT ACGGATATAA TTCGCATCCA CTTATTAAGG ACACCGACGG CGACGGATTA GATGACAAAA ACGATAGCAA TCCTCGCAGG TGGGATATTA GCCCTCGCGA TATGGCGCTT TTTATGGAGC TTGTGTATAG GGAAGACAGC TACGTCAAAG AAGTACTTGA CTACTCAAAA CCGCTCAGGA TTATTTATGA AAATCGCAAT GAGTACCGCC TTATGCACAA CGAGCTTGCA CCATTCTGGA AAGTTAAGGA AACCTACCAC TTCGGGAGTG GGTTTGACGC TATCCTTTTT GAAAATGTGA ATCCTGCGTA CCCATTTATT AAGCAAAACG GCGTGCAAGT ATTAGCAATT CGCGGCACTA AAGGCGCAAC AGATGTATGG AATGATGTCT TGCTTGGAAC TGGCTCAAAC CCAGGCCAAG CTGCTGATAT AGATAAATTA ATTGATGAAT ATAAACAAAA GAACACTGTA AGCAATTTAT ATGTTACAGG TCACTCTTTA GGTGGGTACT TAGCACAAAG AGCCTTAATT CGAGCTAAAA GTAGTGGCGC AACTTTTAAT ATTAAAGCTT ATACTTTTAA CGCTCCGAGA ATCAAAGGTA ATATATTCAA CAGATGGCTT TGGGAAACAT CTGATTTTGG CGATAAACTA ACTAAAGAAG GTTATGCCGT TCATTATAAA GTTGATAACG ATACAGTAAT AGGCGGAATA GGAAATTTCG AAGGTGCAAT AAGCGTAGGA AGTTCTGGAG CCGGTCATGG ATCCAGAAGC TACTTTGAGC CTAGAATGAA TGCTTTCCAA GGATTCACAG TTGGCGAACG CAATAAAATT GAGGGAACCG GTCGCAAAAT TGATGCACTC GACGGATTAG TTAACGAACC AGTTATTAAA ACTGACGAAC AAGCTTTCGA ACCTAAAATT ACGAATATTG ACATTATGGA AAATGATCCA TTGCCAAAAG GCACAACAAT AAAGCATCTT CTTAATCCTA ACGATATACC GCAAGGCTCT ACCATTACAG ACATTACAGA ATACGACAAA ATTGACACTT CTCAATGCGG CAACTACGAA GGGAAAATAC TACTCGTACT ATCTGGTAGA ATATTAAAAA CTATTATAGT TCCAATAACC ATTCATAAAC GATTGGCTAA TATCGAGGAA CTTCCAAATG TTACTGCTGA ACCTGGCAAG AAAATAGAAG TCAAAAAAGA CAATAAAATC CCCGATAATT TCGACTTCAA ACCATACTTA AAAAACTTAC CAGAAAACAG CACAGTACAA GTTGTAAAAC AACCAAACAC AACAAACACT GGAGAAACAA CATTTATCAT AGAAATAACC TTTGCAAACA AAGCAAAACA GAAAGTAACT CTTAAAGCTC ACGTTGTAGA AATCATTCCA GCAACACCAC TAAACCCAGC TAAAAAGAGT ATTACGTGGG CCGAGTTAAT TCCTGCAAAG AAATCAACAC CATGGATCGA GCTAACACCA GCGACACCAG CTCCAACACC AACTCCCGTA CCTCCGACAC CTGTTCCACC GACACCGGAA CCAACGCCTT CGCCTTCGCC TACGCCGACA CCGACGCCTG ATCAACAACC TGCAGTGCCT CTGCAGCCGA AACCAAACCC GGTGAATCCA ACACCCGTTC CGCCAAAACC TGCGCCTAAA CCAGAGTCAC AACCTGTTCC ACCAACACCA GAATCTCATT CTGACAACAG TGAACACTCT AGCAATGGCG AACCTGATAA GTCTGCAAAT AGCACAACCC CACAGACTAA TAAGTTCGAT CAAGACCATA AGGAAATTCC AAGCATGGGA TTTCCCGAAC CAACACATCC TTCCAAAACT CTACAAACTC CTGCAGTTAG TAACGATTCT ATGCGCGAAA ACCACATAGA GCTTAACGCA AACTCAGAAT CACACACATT ACAGCAACAG ATTCCATCAG CACCAAAGAG TATTGCAGAT TTACTTCCAG CGTTGTGCAA CACTCTCACC GTCGGGAATA ATAACATTGC TTATGCGGGA AAAAATAACC ATATCACTGT TACAGTAAAT GCAACATCAG AATTCATGCG CAGATTACAA CTAGAAAAAG CAGTAAAAGC TTATGCCTAC ATATATTCCA ACCCTAAACT TCTTTATAGC GCAAATGGAA TGAAATATGT AACAGTTCAC ATTAATGAAT ATGGCAGAAT TGTATTCGAC GTAGTTATCC CAGAAGAATA TAAAGGCAAT CATACGATTG TCCTTATTGA CGAAAACGGT AAACAAGTTG CGTGGACGAA TACTTTAGTA AAGAAAGACC CTCATTTACA CAAACAATCC ACAATAGCCA ATAATGCACT CCCTCGTACT GGAATACAAA TAATTCCAAT TGTACTTAGC GTTGCATTAA TGCTGTTTGT TAGCACAATT GTGACAATTG CAAGAAAAAG AAAGTTTAAC TAA
|
Protein sequence | MRKVSTVHTI SGIVLTFGLI ISSLVYIPSL ATTAAAEETS PVHVPLTAQT AHDISTGRIR LENNQIDSLL YHKAALDPDQ DSDGDGLKNS EELYTYVKNG RTYYGYNSHP LIKDTDGDGL DDKNDSNPRR WDISPRDMAL FMELVYREDS YVKEVLDYSK PLRIIYENRN EYRLMHNELA PFWKVKETYH FGSGFDAILF ENVNPAYPFI KQNGVQVLAI RGTKGATDVW NDVLLGTGSN PGQAADIDKL IDEYKQKNTV SNLYVTGHSL GGYLAQRALI RAKSSGATFN IKAYTFNAPR IKGNIFNRWL WETSDFGDKL TKEGYAVHYK VDNDTVIGGI GNFEGAISVG SSGAGHGSRS YFEPRMNAFQ GFTVGERNKI EGTGRKIDAL DGLVNEPVIK TDEQAFEPKI TNIDIMENDP LPKGTTIKHL LNPNDIPQGS TITDITEYDK IDTSQCGNYE GKILLVLSGR ILKTIIVPIT IHKRLANIEE LPNVTAEPGK KIEVKKDNKI PDNFDFKPYL KNLPENSTVQ VVKQPNTTNT GETTFIIEIT FANKAKQKVT LKAHVVEIIP ATPLNPAKKS ITWAELIPAK KSTPWIELTP ATPAPTPTPV PPTPVPPTPE PTPSPSPTPT PTPDQQPAVP LQPKPNPVNP TPVPPKPAPK PESQPVPPTP ESHSDNSEHS SNGEPDKSAN STTPQTNKFD QDHKEIPSMG FPEPTHPSKT LQTPAVSNDS MRENHIELNA NSESHTLQQQ IPSAPKSIAD LLPALCNTLT VGNNNIAYAG KNNHITVTVN ATSEFMRRLQ LEKAVKAYAY IYSNPKLLYS ANGMKYVTVH INEYGRIVFD VVIPEEYKGN HTIVLIDENG KQVAWTNTLV KKDPHLHKQS TIANNALPRT GIQIIPIVLS VALMLFVSTI VTIARKRKFN
|
| |