Gene LGAS_0195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLGAS_0195 
Symbol 
ID4440333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLactobacillus gasseri ATCC 33323 
KingdomBacteria 
Replicon accessionNC_008530 
Strand
Start bp227209 
End bp228639 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content38% 
IMG OID639672056 
Productcellobiose-specific PTS system IIC component 
Protein accessionYP_814044 
Protein GI116628872 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00359] phosphotransferase system, cellobiose specific, IIC component
[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0622401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value0.284293 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAAC AGAAAAAATC GGGCTTTAGT GTTTTCGTTA ATAAACACAT CTTGCCCCCA 
GTAATGAAAT TTGTTAACAC CAAGGCCATC CAAGCCTTGC AAAATGGTAT GATTTACACT
TTACCATTTA TCTTAATCGG TTCTATCTTC CTTATTCTAG GAAATATTCC AATCAAATCA
GTAGCTGATG CAATCAATGC TTCTGGTTGG GGAGCATTCT TTAACCAAGC CTACACTACT
ACTTTTAGTA TCATGGCTAT GTGGGCATCT GTTGGTATCG CCTATATTTA TGTTAAAAAT
GAAGGCTATG AGCCATTAGC TCCCGGTCTT ACTTCTTTAG CATCATTCTT AATGCTTCAA
ACTTTAACTA TTGACAGTCC ATTAAAGAAT GCCATGGCTA AAGGTATCGA CGGTCAAATG
ACTGCTAAAG CTGTAACCGA AAATATTGAC AAGTTACCAC ATGCTTTACA AAGTTTCTTA
GAATCCCCAG TTACAGGCGT ATTTAATATT ACCTGGCTTG GCGGAGACGG TATGATCGCC
GCAATTATTG TTGGTTTATT AGTCGGCTGG ATTTATTCAG CTATTATGAA AAAAGGCTGG
ACTATTAAGT TGCCTGAACA AGTTCCAGCA GCTGTTTCTA ACCAATTTAC TGCTATGATT
CCATCAGGAA TGATCTTAAT TGGTACTATG CTTATTTACG CAGGCTTCAA GCTAACCACT
GGTTCAGACT TCTTACAATG GACCTACCAA ACCCTTCAAA TTCCACTTCA AGGTATCTCT
GATTCACTTG GTGGTGCAAT TGCCATTGGA TTCTTAGTCC CATTCTTCTG GTTCTTCGGT
GTCCATGGTG GTTTAATTGT TGGATCCTTA GCTGGTCCTA TGCTTCAAGC AAACTCATTT
GATAATGCAC AATTATACAA GGCCGGCAAG TTAACTATTG CTAATGGTGC TCACGTTGTT
ACTAATGAAT TCTACAATAA CTTCATTAAC TTAACTGGTT CAGGGATTAC TATTGGTTTA
ATTATCTTTA TCTTAATTGC TGCTAAATCA GCACAATTAC GTTCAATTGG TAAAGTTGAA
TTAGTTCCTG GTATCTTTAA CATTAACGAA CCATTCCTAT TTGGTTTACC AATTGTTATG
AATCCATTCC TTGCAATACC ATTCTTCTTA ACTCCAGTTG TAGTTGCTAT TTCAACTTAC
TTCGTAATTA AAACTGGTAT TGTTCCTCCT CTAAATGGTT TTGCCTGTCC ATGGACGATG
CCAGCAGTTA TTTCTGGCTT CCTAATTGGC GGCTGGAAGA TGGCAATTTG GCAAGCATGT
ACCTTAGTAA TTTCAACCTT AATTTACTGG CCATTTGCTA AGAAATACGA CAACATTCTT
GTTAAACGTG AAGCTGCTAC TCTCAAGAAA GACGAGGCTG AAAGTAAATA A
 
Protein sequence
MSEQKKSGFS VFVNKHILPP VMKFVNTKAI QALQNGMIYT LPFILIGSIF LILGNIPIKS 
VADAINASGW GAFFNQAYTT TFSIMAMWAS VGIAYIYVKN EGYEPLAPGL TSLASFLMLQ
TLTIDSPLKN AMAKGIDGQM TAKAVTENID KLPHALQSFL ESPVTGVFNI TWLGGDGMIA
AIIVGLLVGW IYSAIMKKGW TIKLPEQVPA AVSNQFTAMI PSGMILIGTM LIYAGFKLTT
GSDFLQWTYQ TLQIPLQGIS DSLGGAIAIG FLVPFFWFFG VHGGLIVGSL AGPMLQANSF
DNAQLYKAGK LTIANGAHVV TNEFYNNFIN LTGSGITIGL IIFILIAAKS AQLRSIGKVE
LVPGIFNINE PFLFGLPIVM NPFLAIPFFL TPVVVAISTY FVIKTGIVPP LNGFACPWTM
PAVISGFLIG GWKMAIWQAC TLVISTLIYW PFAKKYDNIL VKREAATLKK DEAESK