Gene LGAS_0783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagLGAS_0783 
Symbol 
ID4440032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameLactobacillus gasseri ATCC 33323 
KingdomBacteria 
Replicon accessionNC_008530 
Strand
Start bp765552 
End bp766976 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content36% 
IMG OID639672640 
Productdipeptidase 
Protein accessionYP_814612 
Protein GI116629440 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4690] Dipeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.757e-23 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCATT TAAGTGCATG TACAACTATC TTAGTTGGTA AAAAAGCCTC AATTGATGGC 
TCAGTAATGA TTTCACGTAA TGATGATACA GCAGGTGCAA TTACACCACA AAAATTTATT
ATTGAACCAG CTGCTCATGG TGAAAAAGGA CGTAAAATTA AGTCTTGGCT TAATAAATTT
GAAATGGATC TTCCCGAAGA TGCTCAACGA GTACCAGCTG TTCCAAACGT TGACTATAAA
AAATTAGGTT ACTATGACGA AAGCGGTATT AATCAAAAAA ATGTCGCCAT GTCATGTACT
GAATCAACTT ATGGTAATGA AAGAACTTTA GCCTTTGATC CATTAGTTAA AGATGGCTTA
GATGAAGATT GTATGCAAAC TGTAGTTTTA CCATACATTG ATTCTGCTAG AGACGGTGTT
AAGCGTCTTG GTGTTTTAAT TAAGAAATAT GGATCTCCTG CTGGGAACTC AGTTTTATTT
GGTGATAAAG ATGAAATCTG GTACATGGAA ATTGTTACTG GTCACCACTG GGTTGCTCAA
CGTATTCCAG ATGATTGTTA TGCAGCAACA GGTAACCGTG TAGCTATCCA ACAAGTTAAT
TTTGATGATC CTGATAACTT TATGTGGAGT GAAGGAATTC AAGAATTTGT TGAAAAGCAC
CACTTAAATC CTGACCACGA AGGTTGGAAT TTCCGTCATA TTTTTGGAAC ATATACTGAA
CAAGACCGTC ACTACAACAC TAGTCGTCAA TGGTATATTC AAAAACTTTT CAACCCAGAA
ATTGAACAAG ATCCTGAAGA TCCAGATATT CCTTTCATTA GAAAAGCTTC TAAGAAATTA
GCTAAAGAAG ATATTGAATT TGCACTAGGT TCTCATTATC AAGATACGCC ATTTGATCCA
TTTGGTCATG GAACTGAAGA AGAAAAGCAC CGTTACCGTC CAATTGGATT AAACAGAACT
CAAAACTCAC ATATCTTGCA AATTAGAAGT GATGTTCCTG AAGAAATGGC GGGTATTATG
TGGTTATGTA TTGGTGGACC AACATTTACT CCATATATTC CATTCTTTGC TAATATGAAT
GAAACTGATC CATCATTTAA CAATACTTCA ATGACTTACA ACAAAGAAGA TGCTTGGTGG
TATTACAAGT CTCTTGCTGC TTTGGTTGAA AGTCATTATC CACAATTTGT TCAACTTGAC
ACTAAATACC TTGAAGAACT TAACCGTTAT TACCGCGGTA GAGTTGAAGA GATTATTGAG
AATGCCCAAG GTTTGAATGG TGAAAAATTA ACTGACTACT TAACTCGTGA GAATCAAAAG
ACTGTTGCTC ATACTAAGAA AGACAGTGAA GAATTGATGG GTCAAATGTT CACGGATGCT
ATTAAGATGT CTAAGTTAAC ATTTAAGATG GATCCAAATC TATAA
 
Protein sequence
MKHLSACTTI LVGKKASIDG SVMISRNDDT AGAITPQKFI IEPAAHGEKG RKIKSWLNKF 
EMDLPEDAQR VPAVPNVDYK KLGYYDESGI NQKNVAMSCT ESTYGNERTL AFDPLVKDGL
DEDCMQTVVL PYIDSARDGV KRLGVLIKKY GSPAGNSVLF GDKDEIWYME IVTGHHWVAQ
RIPDDCYAAT GNRVAIQQVN FDDPDNFMWS EGIQEFVEKH HLNPDHEGWN FRHIFGTYTE
QDRHYNTSRQ WYIQKLFNPE IEQDPEDPDI PFIRKASKKL AKEDIEFALG SHYQDTPFDP
FGHGTEEEKH RYRPIGLNRT QNSHILQIRS DVPEEMAGIM WLCIGGPTFT PYIPFFANMN
ETDPSFNNTS MTYNKEDAWW YYKSLAALVE SHYPQFVQLD TKYLEELNRY YRGRVEEIIE
NAQGLNGEKL TDYLTRENQK TVAHTKKDSE ELMGQMFTDA IKMSKLTFKM DPNL