Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | LGAS_0943 |
Symbol | |
ID | 4439862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Lactobacillus gasseri ATCC 33323 |
Kingdom | Bacteria |
Replicon accession | NC_008530 |
Strand | + |
Start bp | 942839 |
End bp | 945778 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 639672798 |
Product | adhesion exoprotein |
Protein accession | YP_814769 |
Protein GI | 116629597 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01167] LPXTG-motif cell wall anchor domain [TIGR01168] Gram-positive signal peptide, YSIRK family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTATCAA GAAATAATAA TCATTTTAAT GAACTTAAAG AAATAAGTCC TCGTTATTCA ATTCGTAAGT TTACTGTCGG AGCAGCTTCT GTTCTTATAG GGATGTCAAT TTTCGGGCTA AATTCTCAAA CAGCTCAAGC AGATAGTGTT AATGAAAATG GTTCAAATAA ACAAAATCCT GCTGTTGAAC AGGAATCAAG CAAGGCACTT ACAACTAGTC CATCAAGTAA TATTAAAAAC GTGGTTGTAA CTACTAAGAA TGTAGATGCA CAAAATCAAG TATCTGCTGA AAAGTCAAAG GTTAATACAA GCAGTGAACA AAAAGCAACT AATACCAATA AAGAAAGCAA TCAACGTGCC GAGCTACAGA TAGAAAATAC TAAAAAAGTT ATTGCAGCAA ATAAAGATCA AACTAAACAA GTTTCAACTG CTGATCAAGA TCAAGCTAAA CCAGTTGCCA AAGAAAATTA TAGTGTTATC CAGCATGATG TGGTTGCAAA TAACGGAAAT ACCCCACATG ATAGTGGCTA TGTTCAACTA AATTTAGGAT TAAAAATAGA AAATACTAAG AACATTAATC CTGGTGATTA TATTGATATT GATTTAGGCT TGCCACTACA ATCTGGCCAG CAAAAAACGT ATAGTGATGG TTTAGCAGAG AAAGATACAC CAGTAACCGT TAAAGATAAT GCAGGAAAGA CGGATACAAT AGGAAATATT GCCACAGTAG GAAATATTCC GAATGAATTT TACCGATTAA GTTTTAACGA TCATATTCAA AAATATGGTG CCATACTTCT TAATCTTGAT TTAAAGCAAT ATTCTTTAAT TCAGCGTGCT ATTAGTACTG TCGGCTATTC TCATGAAAAG AATGGCCCAA CCAGTTATAG TGCTCAAAAT GATCTAGTTA TTGGGGATGG CGCTTATAAA TTTACTTCTG GGTTAAGCGT TCCTGTTAAA TATATTCCAC AAGAAAGTAG TGGAGCGATT ACTCCGCGGA TTGGAGAAAA CACATGGATT CAAGGAAGAA GTACTGGCAT AAATTCTAGA GTATGGACTA TTTATCCTGA CGGATCATTT ACTGTAAACG ATAAATCTCC GCTTGGGTTA GATGGAATCG TATATTTTGC TAAGAATTTT GGTAATACTG CTACAGTAAC GGTTTATAGT CCAAGTAATA ATCCTTACTT TGATTATAAT TATGCAAGCG ACAACGAGAT TAAAGAACAA ATTGAAAGTG CATTTGCTCA ACTAAAAGGG ACTAACAATT TAGATCAAAT TGCTCAAGAT AATTCAAATG TCGGTTTTAG TTTAAATAAA ATACCAGAAA ATAATATTTC AATTGTTGTT ACTCATAGTG ACAATCCTGA AAAAGCATAT AGCACAGTCT TGAAAGCCGA TGGGTCGAAA TACAGTTCAG ATCAATTAAT GACTTCACGT ACATATCACA TTACTGTTAA TGGAGCTAAT TTAGGTCAAA TATATTCTTT ACCAATTTCT TTTATTTCTG AAATTACTAA AGCAGGTGTT GATGTTTCAA AGCCTGACGA TATTACTAAA CCAGAAGAAG ATAAGGCTCA GATATATCAA GATCAAGATA AGTATGAAGT ACTTAATAAC AATTTGAATA ATGCTTATCC AATTTTATAT AAGGGAATTA GAATCAATAA TCCCGAATTA ATGAATTATA TGAAGAACAA TCTTGCTACT TGGATTGATG TGAAAGATGA TAATAATCCG AATAATGAAT TAGTAGGTCC AGCAGCTTAT ACTCTTAATG TATCTGTTGT AAATCAGCCA ACTAACTTAA AGCCACAAAA TATTGCAAAT GGTGATTCTT CAGGGCAACA ATTAGATCAA ACTATTCTTG TTATTTTCCA GGACTTAGAT GAAAATAACA AAAATATCTT AAGTAAAGAT TTAACTGGAT TAAGTGGAGC TGATGCTAGA TATTCAACTT TAAGTGATAT TAAGGCTTTA GAAAATCAAC ATTATGAATT AGTTAGCGAT GATACTAAGG GTCAAAATCT TAAGTTTGGT AATCAAAAAC AAGTATTTTA TGTCAAATTT AGACATGTTC TAAAGAAAGA AAAACAAGAA AGTAAGAGTG TTTCTAGAAA TATTACTTAT GTTGACGATA AAGGAAATCC TGTTAAGGGA TCACCTGATG GCAAAGCTAG CTATGTACAA AGTGCAAGTT TTGTTCGTTT TCCAGTGAAA GATTTAGTGA CTGGTGTGGT CGGCTATAGT ATTAATAATG ATGGTGCAAT TGATACTCAG GATGGTACTC ATGCATGGAA AGCAACTAGC AGTAATTATT TCGAAAAAGT AATTTCAAAA GATCCTGCAT CATTAGGCTT TGAGCATGTT AACTATGCTG TTATTCCAGA AGAAACAGTA GATGCAAATA CTAAAGATCA AGAGATCCAA GTTATTTATA GTGGTACTAC TAAGAAGCCA GAACCAACTA AACCCGACAA GCCAAACAAG CCGGAGAAAC CGGAAACCCC GAATAAACCT GACAAACCGA GTGAACCAAG TAAGCCAGAA ATGCCAAGCA AACCTGATAA ATCAAGTGAA AGCAAGTCAA CTGAACCTGA GAAATTGATT CATCAAGCAA CATCAGACAC TTCAAAAAAT CACAGTGAAT CAACTGAAAA AGAAGATTTA TCAAGATCAA AAGTTGCTTT AGATAAGACA AGTTTTTCAC CTTTGGCTAG AAGAAATACT TCGTTAGTCA ATGAGAAGAT TAACACTTCT GAATCATCAA AAGATTCAAC CAATCTGACT AAAAATAGAA CAGCAAATTC AAATAAAGAA TTGCCACAAA CAAGTAGTAA CGCTCAACAA TCAATTGATG ATGAAGTGAT TGGTTCTGTT GCTTTATCAA TAGGATTAAT TGGTTTAGCT GGTGTTAAAA AGAGAAAAAA GGCAAGATAA
|
Protein sequence | MLSRNNNHFN ELKEISPRYS IRKFTVGAAS VLIGMSIFGL NSQTAQADSV NENGSNKQNP AVEQESSKAL TTSPSSNIKN VVVTTKNVDA QNQVSAEKSK VNTSSEQKAT NTNKESNQRA ELQIENTKKV IAANKDQTKQ VSTADQDQAK PVAKENYSVI QHDVVANNGN TPHDSGYVQL NLGLKIENTK NINPGDYIDI DLGLPLQSGQ QKTYSDGLAE KDTPVTVKDN AGKTDTIGNI ATVGNIPNEF YRLSFNDHIQ KYGAILLNLD LKQYSLIQRA ISTVGYSHEK NGPTSYSAQN DLVIGDGAYK FTSGLSVPVK YIPQESSGAI TPRIGENTWI QGRSTGINSR VWTIYPDGSF TVNDKSPLGL DGIVYFAKNF GNTATVTVYS PSNNPYFDYN YASDNEIKEQ IESAFAQLKG TNNLDQIAQD NSNVGFSLNK IPENNISIVV THSDNPEKAY STVLKADGSK YSSDQLMTSR TYHITVNGAN LGQIYSLPIS FISEITKAGV DVSKPDDITK PEEDKAQIYQ DQDKYEVLNN NLNNAYPILY KGIRINNPEL MNYMKNNLAT WIDVKDDNNP NNELVGPAAY TLNVSVVNQP TNLKPQNIAN GDSSGQQLDQ TILVIFQDLD ENNKNILSKD LTGLSGADAR YSTLSDIKAL ENQHYELVSD DTKGQNLKFG NQKQVFYVKF RHVLKKEKQE SKSVSRNITY VDDKGNPVKG SPDGKASYVQ SASFVRFPVK DLVTGVVGYS INNDGAIDTQ DGTHAWKATS SNYFEKVISK DPASLGFEHV NYAVIPEETV DANTKDQEIQ VIYSGTTKKP EPTKPDKPNK PEKPETPNKP DKPSEPSKPE MPSKPDKSSE SKSTEPEKLI HQATSDTSKN HSESTEKEDL SRSKVALDKT SFSPLARRNT SLVNEKINTS ESSKDSTNLT KNRTANSNKE LPQTSSNAQQ SIDDEVIGSV ALSIGLIGLA GVKKRKKAR
|
| |