Gene LGAS_0501 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLGAS_0501 
Symbol 
ID4440397 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLactobacillus gasseri ATCC 33323 
KingdomBacteria 
Replicon accessionNC_008530 
Strand
Start bp525553 
End bp527250 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content36% 
IMG OID639672359 
Productcellobiose-specific PTS system IIC component 
Protein accessionYP_814337 
Protein GI116629165 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1440] Phosphotransferase system cellobiose-specific component IIB
[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00394] phosphotransferase system, lactose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component
[TIGR00853] PTS system, lactose/cellobiose family IIB component 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones96 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGGTT TTACCAAAGC AATGGATAAA ATGAAGCCAA AATTCGAAAA AATAGCGGCT 
AATCCATATG TTTCAGCAGT TCGTGACGGC TTTATTGCTG CCATGCCAAT TATTTTGTTT
TCAAGTTTGT TTACGCTGAT TGCATATGTA CCAAACGCAT GGGGATTTTA TTGGCCGAAG
GCAGTAGAAA ACGCATTGGT TTTGCCTTAT AGCTATTCAA TGGGTTTGCT TGCTTTATAT
GTAACTGCAA CTTGTGCGAA GAATTTAACT GATTATAAGA ACTTAAAGCT TCCTAAGACA
AACCAGATTA ATCCAATGAA CGTTATCTTA GCTGCTGAAA TTTCATTCAT TATTATTGCT
ATTAAAGTTG GTAAAAATGG ATTAGATTTA ACTTACTTAG GTACGCAAGG ATTAATCGCT
TCATATATTG TTGGTTTAAT CGTACCTAAC ATTTACTATG CATGTGTTAA GAACAATGTA
ACTATTAAAA TGCCTGATGT TGTACCACAG AACATTGCTC AAACATTTAA AGACATTTTC
CCAACGACTT TCTCAGTTGC ATTGTTCTGG ATAGTTCAAA TTATTTTAAA CCAACTCTTT
GGTGCAAACT TATCAGAATG TGTAGTAAAG GTTCTTTCAC CATTATTCCA CGCAAGTGAT
ACTTACGGTG GTTTAGCATT AGTTGCTGGT GCGATGGCAT TCTTCTGGTT TGTTGGTGTG
CAAGGTCCTT CAATTGTTGC TCCAGCCGTA GCCGCAATTG AAACCACTAA TGTTGGTTTG
AACCAACAAT TAGTTCATGC TGGAATGCAA GCAAGTCATG CTTTAACTAT TAACTGTCAA
GACTACGTTA TGAACATGGG TGGTACTGGA TCAACTTTCG TAGTTCCATT CATCTTCTTA
CTTTTAGCTA AATCAGCTCA AAACAAAGCC GCTGGTAAAG CCGCTGTTAT TCCTGGATGT
TTCTCAGTTA ACGAACCTAT TTTGTTTGGT GCTCCAATTA TCATGAACCC AGTATTCTTC
ATTCCATTTT TGGTAACACC AATGTTCAAT GTTTGTGCTT ATAAATTCTT TGTTCAAGTT
CTTCACATGA ACGCTTTATA CAACACATTA CCTTGGACTG TTCCAGCTCC AATTGGTATT
ATCGTATCTT CTGGTTTTGC TGGTTTATCA TTTGTTTATG TAATTTTAAC TTTAGTTGTT
GATACTTTAA TTTGGATTCC ATTCTTTAAG TTTTACGACA ACGATTTGTA CAAGCAAGAA
CAAGCTAAGC TTGCTGCAGA AAAAGCTGCT GGTGTAGCTA CTGCTACTGA TGATTCAACT
GCTAGTCTTG CAGCTACTGA CAAAGAAGCT AAGGAAGGTA TTACTAAAGA TACTAACGTA
ATGGTTATCT GTGCCGGTGG TGGTACTTCA GGTATCTTAG CTAAGGCTTT AAACAAGATG
GCTAAAGAAC GTAACTTACC ACTTCACGCA GCTGCTCGTG CTTATGGTCA ACACATGGAT
ATTATTCATG ACATGGACTT GATTATTTTG GCTCCACAGA TGGATAGTAT GAAGGATAAC
TTGAAAGAAA TTGCTGATCA TGATGGTTCT AAATTAGTCA CTACTACTGG ACGTCAGTAC
ATTGAATTAA CTCAAAAACC TGATTTAGCA TTTAAGTTTG TTGTTGATAG CCTTGAAGGC
AAGAATGAAG ATAAGTAA
 
Protein sequence
MNGFTKAMDK MKPKFEKIAA NPYVSAVRDG FIAAMPIILF SSLFTLIAYV PNAWGFYWPK 
AVENALVLPY SYSMGLLALY VTATCAKNLT DYKNLKLPKT NQINPMNVIL AAEISFIIIA
IKVGKNGLDL TYLGTQGLIA SYIVGLIVPN IYYACVKNNV TIKMPDVVPQ NIAQTFKDIF
PTTFSVALFW IVQIILNQLF GANLSECVVK VLSPLFHASD TYGGLALVAG AMAFFWFVGV
QGPSIVAPAV AAIETTNVGL NQQLVHAGMQ ASHALTINCQ DYVMNMGGTG STFVVPFIFL
LLAKSAQNKA AGKAAVIPGC FSVNEPILFG APIIMNPVFF IPFLVTPMFN VCAYKFFVQV
LHMNALYNTL PWTVPAPIGI IVSSGFAGLS FVYVILTLVV DTLIWIPFFK FYDNDLYKQE
QAKLAAEKAA GVATATDDST ASLAATDKEA KEGITKDTNV MVICAGGGTS GILAKALNKM
AKERNLPLHA AARAYGQHMD IIHDMDLIIL APQMDSMKDN LKEIADHDGS KLVTTTGRQY
IELTQKPDLA FKFVVDSLEG KNEDK