Gene LGAS_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLGAS_1643 
Symbol 
ID4439153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLactobacillus gasseri ATCC 33323 
KingdomBacteria 
Replicon accessionNC_008530 
Strand
Start bp1605092 
End bp1606525 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content36% 
IMG OID639673468 
Productdipeptidase 
Protein accessionYP_815376 
Protein GI116630204 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4690] Dipeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0156326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones74 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAA GTAAAGATAA CTGTACGGCA ATGATTGTCG GAAAAAAAGC TACAATTGAT 
GGTTCAACTA TCATTGCACG TGATGAAGAC GGTTATGGTG GTATTAATGA AAAGCTTTTT
GTCGTTCATG AAGCTAAGGA TTATGATGAA GACTATGTAT CAAAGTACAA TGGTTTGAAG
CTTCACTTAA AGGGGCATGG CTGCAAGTGG ACTGCTACAC CAACAGCAGA TGCTTCAGAA
GGTCGCTGGG ATGAGCAAGG TATTAACGAA TATAACGTGG CAATGTCAGC TACTGAAACT
GAAGCAACTA ATGCTCGCTG CTTAGGACAT GATCCTTTAG TTGAAAATGG TGTGGATGAA
GATTCCATGG TGTATCTTGT TTTACCATTT GTTAAAAGTG CTCGTGAGGG TGTGGCACGT
TTAGGCAAAT TAATTGAAAA ATATGGTACT GGTGAAAGTA ACGGTATCGC TTTTTCTGAT
CATGATGAAG TTTGGTATTT CGAAACTGGC GCTGGCCATC AATGGGTTGC CCAAAGAATT
CCAGAAGATT CTTATGCAAT TTGTCCAAAT ATTATGGTTA TTCAAGATAT TGACTTTGAT
GACCATGAGA ATTTTATGTA TGCTTCTACA ATTCGTGATT TTGTAGAAAA GAACCATTTA
AATCCAAGCA CTGATGGTAA GTGGAGCTTT AGAGATATTT TTGGTACTAA AGCTGAAGCT
GATAGTTATT ACAACACTCC AAGAACTTGG TATGGTCAAA AATTGTTTAA CCCTAGTGTT
GAACAGGATC CTCTAAGTCA AGAAATGCCA TTTATCAGAA AGCCTGAAAA GAAAATCGGC
GTTGAAGATG TAGAGTATTT CTTATCAAGT CACTATAACG GGACTGAATA TGATCCAATG
GGATCTTTTG CTTCTGGGGA TGATAAGGAA CAAAAGATGT TTAGGTCAAT TGCTTTAGAT
AGAAACCAAT CTAGTTGTAT TCTTCAAATT AGAAATGATG TTCCTAAAGA AATGGCTGCT
ATTCAATGGG TTAACTTTGG TTTTTATGCT TATAGTCCTT ATGTACCTTT TTATACCAAT
ATTGATGACA CACCACTTAA CTATCAAAAA GCTAGTCATA TGGTTACACC AGAATCAAGT
GCTTACTGGC TATATAAGAG TTTACAAGTA TTAATAGAAC CAAGGTATCA TCAATTTATT
TACCAAGTTG ATAATTTTAG AGATGAATGT CAAAGCTATG CTGTAAGTCG CGTTTCAGCA
ACTGATGAGA AGGCAAGAGA AATGTCTGGC AAAGAGCAGA CTAAATATTT GACGGCTGCT
AATGCTGAAA CTGCTGCTCA TATTACTGCT GAAACTAAGA AACTGATTAG TGATTTAACT
AGACAAGCAT TAAATACATC TAAATTTCAA TTTGAACGCG GCGATAATTT ATAA
 
Protein sequence
MKKSKDNCTA MIVGKKATID GSTIIARDED GYGGINEKLF VVHEAKDYDE DYVSKYNGLK 
LHLKGHGCKW TATPTADASE GRWDEQGINE YNVAMSATET EATNARCLGH DPLVENGVDE
DSMVYLVLPF VKSAREGVAR LGKLIEKYGT GESNGIAFSD HDEVWYFETG AGHQWVAQRI
PEDSYAICPN IMVIQDIDFD DHENFMYAST IRDFVEKNHL NPSTDGKWSF RDIFGTKAEA
DSYYNTPRTW YGQKLFNPSV EQDPLSQEMP FIRKPEKKIG VEDVEYFLSS HYNGTEYDPM
GSFASGDDKE QKMFRSIALD RNQSSCILQI RNDVPKEMAA IQWVNFGFYA YSPYVPFYTN
IDDTPLNYQK ASHMVTPESS AYWLYKSLQV LIEPRYHQFI YQVDNFRDEC QSYAVSRVSA
TDEKAREMSG KEQTKYLTAA NAETAAHITA ETKKLISDLT RQALNTSKFQ FERGDNL