Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | LGAS_1813 |
Symbol | |
ID | 4438962 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Lactobacillus gasseri ATCC 33323 |
Kingdom | Bacteria |
Replicon accession | NC_008530 |
Strand | + |
Start bp | 1800788 |
End bp | 1802038 |
Gene Length | 1251 bp |
Protein Length | 416 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 639673630 |
Product | major facilitator superfamily permease |
Protein accession | YP_815635 |
Protein GI | 116630354 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.000000665519 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 0.0000000329219 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | TTGTCAGCTT CTTTTTATAC GATAAATCTT AATTTAAAAG GAGGAACACA CATGTTAAAA AAGCATGCGC TAGACCCTAT TAAGTTAATT TGGCTATTTG CTGGTTCTCT AATCATCAAT ACTGGCGTTA GTTTTATTTG GCCACTAACT ACAATTTATA TTCACAATTA TCTTCACGAA ACGCTCACTA TAGCTGGGAT TGTATTATTT ATTAACTCCG CTTTTACAAT GGTAGGAAAT GCACTAGGTG GTTTTTTATT TGATAAGTGG CATCCATATC AAACACTTTT AACTGGCGTA AGCATTTCTA CATTATCCAC TTTCTTATTG GTCCTCTTTC ATGGCTGGCC AGCCTATCCA ATCTTGTTAA TTACTTTAGG GTTAGGTAAT GGGATCGTAG TTACTGGTTT AAATTCTATT GCGACTCTAA TTAGGAGTCG AAATGCTTCT TATGTTTTTA ACGTTTTATA TTTCACACAA AACTTGGGAC TAGTCTTTGG CTCGTTAATG GTTGGATTTA TTTTACCCTT CGGCATTACT TATATTTTCC TACTTGCCTT TATTATGTTT GCATTTTTAA GTATCGTAGT CTTTCTTGAA TATCGCGGTT TAAATCAGGC TCATGCTGCT AAAGGTAAAA AAGTAGCTGA ATCAGAATAT CAAGAACAGA TTCCCTATGG TGCAAAAAAA GCTATTTTTT CTATCTTAGT TTGTGTTCTA GCAGCTTGGA TTGCATATGA ACAATGGAAC TCAAACATTT CTTCCTACAT GTTAGGCTTA CATATGAGTG TACGTTTATA CAGTTTGCTT TGGACTTTAA ATGCTGTTTT AATCGTACTC ATTCAGCCTC TTTTAACTTA CTTCGACGAT TGGCTAACCC AGCACTTACA CGGTCGTTTA TATATTGGTT TCAGTTTATT TGGATTAGCT TTCTTATTGC TAATCGGGGC AAGTCACTAT TTTAGCTTTG TCTTAGCTAT GGCTGTCCTG ACTTGTGGTG AAATTTTAGC CTTTCCTGCT GTTTCCACTT TCGTTAATGA TCGGGCCACT AATAAAGATA AGGGAAAATA TCAAGGAATT GTTCAATCAA TTACTTCAGC TGGTCGTGCT TTAGGGCCAC TAATTGGTGC CTTAGTAATC GATAATTTTT CCTACTTGGT TTTATTTATC TTTTGTACCA TTTTAGTTTT AATTTCCGTT CTTCTATTTG CACTGATTAA TGCTTATAAC AAGAAAAAGA TCGCTAAATA G
|
Protein sequence | MSASFYTINL NLKGGTHMLK KHALDPIKLI WLFAGSLIIN TGVSFIWPLT TIYIHNYLHE TLTIAGIVLF INSAFTMVGN ALGGFLFDKW HPYQTLLTGV SISTLSTFLL VLFHGWPAYP ILLITLGLGN GIVVTGLNSI ATLIRSRNAS YVFNVLYFTQ NLGLVFGSLM VGFILPFGIT YIFLLAFIMF AFLSIVVFLE YRGLNQAHAA KGKKVAESEY QEQIPYGAKK AIFSILVCVL AAWIAYEQWN SNISSYMLGL HMSVRLYSLL WTLNAVLIVL IQPLLTYFDD WLTQHLHGRL YIGFSLFGLA FLLLIGASHY FSFVLAMAVL TCGEILAFPA VSTFVNDRAT NKDKGKYQGI VQSITSAGRA LGPLIGALVI DNFSYLVLFI FCTILVLISV LLFALINAYN KKKIAK
|
| |