Gene LGAS_0343 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLGAS_0343 
Symbol 
ID4440596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLactobacillus gasseri ATCC 33323 
KingdomBacteria 
Replicon accessionNC_008530 
Strand
Start bp367171 
End bp368874 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content33% 
IMG OID639672202 
Productcellobiose-specific PTS system IIC component 
Protein accessionYP_814187 
Protein GI116629015 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1440] Phosphotransferase system cellobiose-specific component IIB
[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00394] phosphotransferase system, lactose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component
[TIGR00853] PTS system, lactose/cellobiose family IIB component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000105406 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00000748526 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATGGGT TGGTAAAAAG AATTGAAAAA GCGCAGCCAT TCTTTAGAAA AGTTGCGACT 
AATCAATATT TAAGAGCAAT TCGTGATGGC TTCATCGCGT TAATGCCAAT TATTATCTTT
TCAAGTTTTT TCTTGTTAAT TGCTAATGTG CCCCAAATTT GGGGATTTGT TTGGCCTAAA
AAAGTAGCTA CAGCTTTAAA TTTATTTTAT AACTTATCAA TGGGCTTTTT GTCCTTAATG
GCGGCTTCTA CTGTAGCCAA GGCCTTAACT GGAAGTTTAA ATCTAAAGTT GCCAAAAACT
AATCAAATTC CAGTTGCTGG AGTTCAATAT ACGGCTCAAA TTTCTTTTGC ATTTGTGGCA
ATAGATACTT TATTAAATAA AGGAAATTTA TATTTAGCTA CTGATATGAT TGGGACTAAA
GGATTAATTT GTGCCTTCTT AGTAGCATTT ATTGTACCTA ATATTTATTA TTTTTGTTTT
AAACATAATG TCACAATCAC GCTACCTGAT GTAGTACCTC AAAATATTGC GCAAGCCTTT
AAAAATATTG TTCCCTTTGC TATAGCAACT ACTTTCTTTT GGGCATTTAC GTTAATTTTT
AGAGCTTTAA CTAATATGAA TTTGGCAGCT TGGATAATTA AAAGTCTAAC TCCACTTTTT
ACTGCAGCGG ATGGGTATGT TGGATTAGCA ATTATTTATG GTGCAATGGC GTTCTTCTGG
TTTATCGGAA TTCAAGGGCC CTCAATCGTT GAACCAGCAG TTACGGCAAT CTACTTAACT
AATGTTGAAG CTAATTTAAG AGCATTTAAT TCAGGAAATA TTCAAGGCGC TAATCACATT
TTGTCACAAG GAATTCAAAT GTTTGTTGCT ACTTTAGGAG GAACTGGAGC AACTTTAGTA
GTAACGTTTA TGTTTGCCTT TTTGTCTAAA TCTAAACAAT TAAAAGCAGT TGGACGTGCA
TCAACAATTC CAGTTATTTT TGGTGTTAAT GAACCTATTT TATTTGGTGC TCCACTGATC
TTGAATCCAA TTTTCTTTAT CCCTTTTATT TTTGCGCCAA TTCTTAATGT TTGGATCTTT
AAAATTTTTG TTGATGTGCT TCATATGCCG TCATTTATTT ACAACTTACC ATGGACCACG
CCACCAACAT TAGGATTGTG GATGGGAACT GGTTTTAATA TTTTAGCTTT ACTATTAGGA
ATTTTATTAC TAGTAGTTGA TTCATTATTG TATTATCCAT TTTTTAAGGC TTATGATGCT
TCAATGCTAG CCCAAGAAGA GAAAAAGGAA GAACTTGAAG AAAAAGATAA TAAGAAATCT
GTTGTATCAT CTGCTGAGCC TAAATTTAAT GAATCTGAGA TTCCACTAGG CAAAACTAAA
GATCAACAAG CACTAAATGT TTTAGTTTTA TGTGCTGGTG GTGGGACATC TGGAATTTTA
GCTAAGTCAC TTAATAAATT AGCTAAAGAA GACAAGTTAC CTTTAAATGC AGCAGCTGCC
GCATTTGGAT CACATCATGA TTTAATATCT GGAATGGATG TAGTAATTTT AGCGCCGCAA
ATGGATACCA TGAAAGATGA ACTTGCAAAG GAATGTAAGC AAAATAATGC TCGTATGATT
ACCACTACTG GAAAACAATA TATTTACATG ACGCAACATG CTAATGAATG TTTGAAGCTA
TTAATCAATG ATATTAATAA ATGA
 
Protein sequence
MDGLVKRIEK AQPFFRKVAT NQYLRAIRDG FIALMPIIIF SSFFLLIANV PQIWGFVWPK 
KVATALNLFY NLSMGFLSLM AASTVAKALT GSLNLKLPKT NQIPVAGVQY TAQISFAFVA
IDTLLNKGNL YLATDMIGTK GLICAFLVAF IVPNIYYFCF KHNVTITLPD VVPQNIAQAF
KNIVPFAIAT TFFWAFTLIF RALTNMNLAA WIIKSLTPLF TAADGYVGLA IIYGAMAFFW
FIGIQGPSIV EPAVTAIYLT NVEANLRAFN SGNIQGANHI LSQGIQMFVA TLGGTGATLV
VTFMFAFLSK SKQLKAVGRA STIPVIFGVN EPILFGAPLI LNPIFFIPFI FAPILNVWIF
KIFVDVLHMP SFIYNLPWTT PPTLGLWMGT GFNILALLLG ILLLVVDSLL YYPFFKAYDA
SMLAQEEKKE ELEEKDNKKS VVSSAEPKFN ESEIPLGKTK DQQALNVLVL CAGGGTSGIL
AKSLNKLAKE DKLPLNAAAA AFGSHHDLIS GMDVVILAPQ MDTMKDELAK ECKQNNARMI
TTTGKQYIYM TQHANECLKL LINDINK