Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | LGAS_0046 |
Symbol | |
ID | 4440468 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Lactobacillus gasseri ATCC 33323 |
Kingdom | Bacteria |
Replicon accession | NC_008530 |
Strand | + |
Start bp | 57837 |
End bp | 60794 |
Gene Length | 2958 bp |
Protein Length | 985 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 639671908 |
Product | adhesion exoprotein |
Protein accession | YP_813899 |
Protein GI | 116628727 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 105 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAATT TAAGTGAAAA AGCCGCATCC GTGGTTATCA AGTATACTGA TTTAGATAAT AACTTAGCCG AGCTTTCAAA TTCTGGAAGT TTGACCGGGA ATATCGGCGA AGTAATTAAT TATAGTACGG CAGATGAAAT AAAAAAATTA GCAAAACAAG GATATGTATT AGTTAATAAT CCTTTTGATA ATAAAGGGAA AGCGCCAGTA TTTAGTGGAG ACCAAGACAG TTACATGGTT ACTTTTAAGC ATGGTAGAGA ACGCGTCACT GCTGATAATC TAAAATATGG CTGCAAACTT GAAGATTTGC AAGTAAAGGG AACACAAACT GTTCATTATG TTGGTGCAGG AAGTCGAACT CCGCGAAATG AAGTATCAAC AATTACTTTT AATCAAATCC TGGTTTATGA TCAAGTAACT GGGAAAAAAA TTGGTAGTAA AGGCTGGGAA AAAGTTGAAC AGTCTTTCCC CGTAGTTGCT GCCCCGAGCA TTTTAGGCTA TATTCCTGAT CAAGTATTAG TCGGAGGTAA GGCAGTTACT GCTGATGCTC CTAACCGGGA ATATACTATT ACTTATAAAG TTAATGAACA TATTTCTAAT AAAGAACAAA AGGCTGAAGT TAAGTATCTT GACATTGATA GTAATAATGA AGAGATTGTG GAATCTGAAC TTTTAACGGG AAAACCAAAT ACTAAAATTA ACTATAGTAC TATCGATCAG CTTAAAAAGT TAGGTGAAAA GGGATATGAA GTTGTAAGTA ATGGCTTTGA CGCTAACGGC GATGTCCAAT TCTTCGATAC TAGTGATGAA TATGTCCAAA CTTTTATCGT TACCTTAAAA CATAAGCAAG TCTTGGTAAA TGCAGAAAAT CCACTGGATG GGATTGATGA AGCAGAATAT CATAAGACAA GCAAACGCGT TGTTTCTTAT GCTGGTGCTG AGGATGAAAC GCCAGAAGAA GTAGTTCAGT TGGTAAATTG GAATCGAAAT CTTACTGTGG ATGCAGCTAC AAAAAGAGTT ATCGCAGACG GAAAATATAC TACTGATTGG AAGCCAGAGC GAGAATCATA TTCAGCAATT TCTGTGCCGG TAGTTAGTGG ATATCATACT CGTATTAAAG AAGTGCCCGA AGAAAAGGCA AGACTGGCTA ATATAACTGA AAAGATTAGG TATGTAAAAA ATGGCTACGT GATTCCAGTT GATGAAAATG GACAAAAAAT CGATAGTTTA CCTAAATTAC GTTTTGCGTC AGATAAAGAT GACCCAACTT TAGTATCCTT GCCAGAGAAT AGTTTAAAGG ATGAAAAGTA TGAGCCGGAA AAGGTTGATC TTACTGAAAT TGATCCAGCA AAAGATTTTG AAGCTAAATA TCTTTTGAAA CATAAATATG TGACTATTAA CAAAGATAAC TCTCATTTTG ACATTAACCC TGGTTCATAT CGTCGTACTG TTACAGCAAT AGTTCGCTAT GAAGGAGCGG GAGATAAAAA TCCTAAGGAT TCTATTCAAA CTGTTCAGTG GAATAGAAGC ATCACTTATG ATGAAGTAAC AAAAGAAATT CTGGAAGACG GAAAATATAC TACTGATTGG AAGCCAGATA AAGAATACTT CGAAGCTGTA GATACACCAG TAATTTCTGG ATTTACTGCT GATATCGGAG TTGTTGCTAA GCATGATGTA ACGCAAAGCG ATCTTTTTGC GACGGTTAAG TACCAAAAGA ATGGTGCAAT TATTCCTGTT GATGAAGGTG GAAAAGAAAT TGCTAAAGCT AAACCAATTC CATTTCTTAC TGATTTAACA GATCCAACTA GAGTTTTAGC AACTGAAGAA ATTCCTGAAA TCAAGGGCTA CCGCCGAACT GAAGAATCAG TTTTAATTAA AGATCCCTTA AAAGACATTA AAGTTACATA CATTTTAAAG CCAAACTATG TCTTAGTGGA TAGTGAACAC CCGTATCGAA CAGTTAAGCC GCACAACTAT AGTATTCCTG TTAAAGAGAC TATTCATTAT GTAGGTGCTG ACGAAAAAAC TCCAGCTGAT CGAATTCAGG GTGCACGCTG GCGTAGGTCT TTGACAGTAA ATGATAATAA CGGAAAAGTG ATTGAAGACG GCAAATACAC TACTGATTGG AGCGTTGATA AAAAAGAATA TAGCGCAGCT GTAACGCCAG TCGTTGATGG CTATCATGCA GATCAGTATC AAGTTAAAGC CCATGGGGTT AATAAGGAAG ATATTGATGT AGAGGTAAAA TACCAGAGAA ACGGTCAAAT TGTACCAGTA AACTCTAAAG GTGAAAAAAT TGAACATGCA GATTGTCCAG TTTATATCAC TGATCCAACA GATGCGACAA AGGTTCTGAT GGAGCAGCCT GTGCCACGGC TATTGAACTA TATGGCGCAA GACTCTTCGA TTGTTGTCAA AGATCCAAGT CGTGATACTA AAGTTACTTA TTATACTTTT GCTGAAATTA AAGAACTAAG TTCAGCTAAA AACTTGAAAA CTGAGATCCA ATCAATTGAT GGAAAAACTG CAACTTCGAA TGTTGTATCT CTTCCAGTTA ATGGCAAGAG GAGAAAAGCG GTTGTTACAT TCGTTGATTT GAGTAACAAT GCAACTCAAA TTGCATCTTC TGGTGTTTTA AGCGGAAATG TTGGTGATAA AATTACTGAC TTGTATAACA CGAGTAAGCA AGTTGAAGAG CTTAAAAAGA AAGGTTATGA AGTTGTCTAC AATGGTTTTG ATCCAAAAGG TGCAAGTAAG TACTTTGAAG AAGATCAAAG AAAGGTTGCT ACCTTTACAG TTGCTGTCAA AAAAGTAAAG CAGTTAAAAC CTAAGGAAGA AAAGCAGGCT GAAAAAACTT CTAAAAAAGT AAAAGAAAAA TCAAAAGCAG CTAATTCTGA TGAAAAAAAG AAGAAAAATC ATAAAGTTTT GAAGTATATA TTTCCGTGGA TGAAATAA
|
Protein sequence | MDNLSEKAAS VVIKYTDLDN NLAELSNSGS LTGNIGEVIN YSTADEIKKL AKQGYVLVNN PFDNKGKAPV FSGDQDSYMV TFKHGRERVT ADNLKYGCKL EDLQVKGTQT VHYVGAGSRT PRNEVSTITF NQILVYDQVT GKKIGSKGWE KVEQSFPVVA APSILGYIPD QVLVGGKAVT ADAPNREYTI TYKVNEHISN KEQKAEVKYL DIDSNNEEIV ESELLTGKPN TKINYSTIDQ LKKLGEKGYE VVSNGFDANG DVQFFDTSDE YVQTFIVTLK HKQVLVNAEN PLDGIDEAEY HKTSKRVVSY AGAEDETPEE VVQLVNWNRN LTVDAATKRV IADGKYTTDW KPERESYSAI SVPVVSGYHT RIKEVPEEKA RLANITEKIR YVKNGYVIPV DENGQKIDSL PKLRFASDKD DPTLVSLPEN SLKDEKYEPE KVDLTEIDPA KDFEAKYLLK HKYVTINKDN SHFDINPGSY RRTVTAIVRY EGAGDKNPKD SIQTVQWNRS ITYDEVTKEI LEDGKYTTDW KPDKEYFEAV DTPVISGFTA DIGVVAKHDV TQSDLFATVK YQKNGAIIPV DEGGKEIAKA KPIPFLTDLT DPTRVLATEE IPEIKGYRRT EESVLIKDPL KDIKVTYILK PNYVLVDSEH PYRTVKPHNY SIPVKETIHY VGADEKTPAD RIQGARWRRS LTVNDNNGKV IEDGKYTTDW SVDKKEYSAA VTPVVDGYHA DQYQVKAHGV NKEDIDVEVK YQRNGQIVPV NSKGEKIEHA DCPVYITDPT DATKVLMEQP VPRLLNYMAQ DSSIVVKDPS RDTKVTYYTF AEIKELSSAK NLKTEIQSID GKTATSNVVS LPVNGKRRKA VVTFVDLSNN ATQIASSGVL SGNVGDKITD LYNTSKQVEE LKKKGYEVVY NGFDPKGASK YFEEDQRKVA TFTVAVKKVK QLKPKEEKQA EKTSKKVKEK SKAANSDEKK KKNHKVLKYI FPWMK
|
| |