Gene Mlg_2533 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2533 
Symbol 
ID4270172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2875802 
End bp2877841 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content68% 
IMG OID638127292 
Productoligopeptidase A 
Protein accessionYP_743363 
Protein GI114321680 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0339] Zn-dependent oligopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.437216 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.440356 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACA ATCCGCTGCT GCACGACGAG CCGCTGCCGC CCTTCCCCGA GATCCAGCCC 
GAGCACGTGG AGCCGGCCAT CGACGAGTTG CTGGCCCACT GCCGGCAGAC CCTGCGTGAG
GTGCTGGAGC GGGGCGACTG GACCTGGGAC GGGCTGGTGG CGCCGCTGGA GGCCGCCGAC
GAGCGCCTGA GCCGGGCTTG GTCGCCGGTC TCGCATATGA ACGCGGTGGT TAACAGCGAG
GCGCTGCGCG CCGCCTACAA TGCCTGTCTG CCCAAGCTCA GCGCTTACGC CACCGAAGTG
GGGCAGAACG CCGAACTGTG CGCCGCCTTC CACGCCCTGC GCGACAGTGA GGAGTACCAG
GCGCTGGATA GTGCCCAGCA GCGCACCATC GACAATGCCC TGCGCGACTT CCGCCTCTCA
GGCGTGGACC TGCCGGCGGA CCAGAAGCAA CGCTACGGAG AGATCGCCCA ACGCCTGTCC
GAGCTCTCCG CCAAGTTCGG CGAGAACGTA CTCGACGCCA CCAACGCCTG GCACAAGGAC
CTGTCGGACG CGGAGGTCCT GTCCGGCCTG CCCGACTCCT CCCTGGCGCT GGCCCGGCAG
ACCGCTGAGC GCGCCGGAGT CGAGGGCTAC CGGATCAACC TGGAGTTCCC CAGCTTCTTC
GCGGTCATCA CCTACGCCGA CGACCGCGCG CTGCGCCGCG AGGTTTACGA GGCCTGGAGC
ACCCGGGCCT CCGAGCGGGG GCCCCACGGC GGCCAATGGG ACAACCTGCC GCTGATGGAG
GAGATCCTGG CCCTGCGCCA CGAGAAGGCG CGGCTGCTGG GGTACGACAA CTTCGCCGAA
CTCTCCCTGG CCAAGAAGAT GGCCGGCTCC ACCGATGAGG TCCTGGGCTT CCTGAATGAC
CTGGCTGAGC GCGCCCGGCC CCGCGCCGAG GATGAGCTGG CCGAGCTGCG CCGCTTTGCC
GGCGAGGAGC TGGGTCTGAC CGACCTCCAG GCCTGGGATA TCCCCTATGC CTCGGAGAAG
CTGCGCCAGG CCCGCTTCCA ACTCTCGGAC GAGGACCTTC GTCCCTATTT CCCGGCCGAG
CGGGTGATGG CCGGGCTCTT TGAGGTGGTG CAGCGGCTCT ACGGCCTGCA TATTGAGGAG
CGCCAGGGCG TGCCCGTCTG GCACGAGGAC GTCCGCTACT ACGAGATCCG CGACCGGGAC
GGCGATCTGC GCGGGGCCTT CTACACCGAT CTCTATGCCC GCCCCCACAA GCGCGGCGGC
GCTTGGATGG ACGAGTGCCG GGCGCGGATG CGCCAGGGGG AGCGGGTGCA GGTGCCGGTG
GCTTATCTCA CCTGTAACTT CACGCCGGCG GTAGGCGACC AGCCGGCACT GCTCACCCAC
GGCGAGGTGA CCACGCTGTT CCACGAGTTT GGCCACGGGC TGCACCACAT GCTGACCCGG
GTGGAGGCGC CGGCGGTGGC CGGGATCCGC GGGGTTGCCT GGGATGCGGT GGAGTTGCCC
AGTCAGTTCA TGGAGAACTG GTGCTGGGAG CGCGAGGCGC TGGATCTGTT CGCGGCGCAC
TATCAGACCG GGGCCCGGAT CCCGGAGGAT CTCTTCCGGC GTATGCGGGC GGCGCGCAAT
TTCCAGTCGG CCATGCAGAT GGTGCGCCAG CTCGAGTTCT CGCTGTTCGA TTTCCGCCTG
CACGCGGGGT ATGACCCGGA GCGGGGCGCG CGCATCTATC CGCTGTTGGA GGAGGTGCGC
GAGCAGGTGG CGGTGGTCCG GCCGCCGGAG TGGAACCGCT TTGCCAACAG CTTCGGGCAT
ATCTTTGCCG GCGGTTATGC GGCCGGCTAT TATAGCTACA AGTGGGCGGA GGTGCTGTCG
GCGGATGCCT ATTCGCGGTT TGAGGAGGAG GGCATCTTCA ACCAACAGGC CGGGCATGAG
TTCATGACCC ACATCCTGGA GAAGGGCGGC TCCGAGGACC CCATGGTGCT GTTCCGCAAC
TTCCGCGGGC GGGCGCCGCG GATCGACGCC CTGTTGCGGC ATTCCGGGCT GGCGGCGTGA
 
Protein sequence
MSDNPLLHDE PLPPFPEIQP EHVEPAIDEL LAHCRQTLRE VLERGDWTWD GLVAPLEAAD 
ERLSRAWSPV SHMNAVVNSE ALRAAYNACL PKLSAYATEV GQNAELCAAF HALRDSEEYQ
ALDSAQQRTI DNALRDFRLS GVDLPADQKQ RYGEIAQRLS ELSAKFGENV LDATNAWHKD
LSDAEVLSGL PDSSLALARQ TAERAGVEGY RINLEFPSFF AVITYADDRA LRREVYEAWS
TRASERGPHG GQWDNLPLME EILALRHEKA RLLGYDNFAE LSLAKKMAGS TDEVLGFLND
LAERARPRAE DELAELRRFA GEELGLTDLQ AWDIPYASEK LRQARFQLSD EDLRPYFPAE
RVMAGLFEVV QRLYGLHIEE RQGVPVWHED VRYYEIRDRD GDLRGAFYTD LYARPHKRGG
AWMDECRARM RQGERVQVPV AYLTCNFTPA VGDQPALLTH GEVTTLFHEF GHGLHHMLTR
VEAPAVAGIR GVAWDAVELP SQFMENWCWE REALDLFAAH YQTGARIPED LFRRMRAARN
FQSAMQMVRQ LEFSLFDFRL HAGYDPERGA RIYPLLEEVR EQVAVVRPPE WNRFANSFGH
IFAGGYAAGY YSYKWAEVLS ADAYSRFEEE GIFNQQAGHE FMTHILEKGG SEDPMVLFRN
FRGRAPRIDA LLRHSGLAA