Gene Mlg_0558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0558 
Symbol 
ID4270313 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp605659 
End bp607152 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content69% 
IMG OID638125299 
ProductPepA aminopeptidase 
Protein accessionYP_741402 
Protein GI114319719 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.548083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0000733214 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGACCACA CCGTCAAGAG CAAGACCGCC GATACCGTCA GCAGCCCCTG CGCCGTGGTG 
GGCGTGTTCG AGCGCCGCCG CCTGTCCCCT GCCGCCAAGG CCGTGGACGA GGCCAGCGGC
GGCGCCATAA CCGCGGCCCT GAAGCGGGGC GATATCGAGG CCAAGCCCGG TCAGACCCGC
CTGCTCACCG ATCTGGACAA CGTCAAGGCC GCCCGGGTTC TGCTGGTGGG CCTGGGCGTG
GAGCGCGACC TGGACGAGCG CACCTACCGC AAGGCCGTCA CCGCCGCCGC CCAGGCGGCG
CAGGACTGCG GCGCCGGTGA GGCGACCTTC TACCTGCCGG AGGTCGAGGT GAAGACCCGC
GACCTGGCCT GGCGGGTCCA GCAGCTCGCC ATCGGCGTGA CCGGCGCCCT CTACCGCTTC
GACGACATGA AGAGCGAGGC GGAAACGCCC AGGAAGCCGC TGAAGAAGCT CGCCCTGGGG
GTGGCGGACA AGGCCGAGGC CAAAGTCGCC GATGAGGCCC TGGCGCAGGG GATGGCGGTC
GGCCGGGGCA TGAGCCTGGC CCGCGACCTC GGCAACCTGC CCGGTAACGT CTGCACCCCC
AGCTACCTGG CCGACCAGGC CAAGGCGCTG GGCAAGCGGT TCGACAAACT CAAGGTGCAG
AGCCTGGACC GCAAGGACAT GAAGAAGCTG GGCATGGGCG CGCTGCTGGC GGTGGCCCAG
GGCAGCCAGG AGGAGCCGCG CCTGATCGCC ATGGAGTGGA ACGGCGGCAA GAAGGACGAG
CAGCCCTACG TGCTGGTGGG CAAGGGGATC ACCTTCGACA CCGGCGGCAT CTCCCTGAAA
CCCGGGGCGG CCATGGACGA GATGAAGTTC GACATGTGCG GCGCCGCCAG CGTGTTCGGC
ACCCTGCAGG CGGTGGCCGA GATGAACCTG CCCATCAACG TGGTGGGCGT GGTGCCGGCC
AGCGACAACA TGCCCGACGG CAAGGCCACC CGCCCGGGGG ACATTATCCA GACCCTGTCC
GGGCAGACAG TGGAGGTGCT CAACACCGAC GCCGAGGGCC GGCTGGTGCT GTGCGACGCC
CTTACCTGGT CCGAGCGCTT CAAGCCCAAG GAGATCGTCG ACATCGCCAC CCTCACCGGG
GCCTGCATCA TCGCCCTGGG TCACCACCGG AGTGCGGTGC TGGGCAACCA TCCGCCCCTG
GTCAAGGCCC TGCAGGATGC CGGTGAGACC AGCGGCGACA AATGCTGGGA ACTGCCCCTG
GACCCGGAGT ACGACGAGCA GCTCAAGAGC AACTTCGCCG ACATGGCCAA CATCGGTGGG
CGACCGGCCG GCACCATCAC CGCCGCCAGC TTCCTGGCCC GCTTCACCAA GCGCTACCAG
TGGGCCCACC TGGATATCGC CGGCACCGCC TGGCTCACCG GTGAGCAGAA AGGCGCCACC
GGCCGGCCGG TGCCGCTGCT CACCCGGTAC CTCATGGACC GGGCCGGGGC CTGA
 
Protein sequence
MDHTVKSKTA DTVSSPCAVV GVFERRRLSP AAKAVDEASG GAITAALKRG DIEAKPGQTR 
LLTDLDNVKA ARVLLVGLGV ERDLDERTYR KAVTAAAQAA QDCGAGEATF YLPEVEVKTR
DLAWRVQQLA IGVTGALYRF DDMKSEAETP RKPLKKLALG VADKAEAKVA DEALAQGMAV
GRGMSLARDL GNLPGNVCTP SYLADQAKAL GKRFDKLKVQ SLDRKDMKKL GMGALLAVAQ
GSQEEPRLIA MEWNGGKKDE QPYVLVGKGI TFDTGGISLK PGAAMDEMKF DMCGAASVFG
TLQAVAEMNL PINVVGVVPA SDNMPDGKAT RPGDIIQTLS GQTVEVLNTD AEGRLVLCDA
LTWSERFKPK EIVDIATLTG ACIIALGHHR SAVLGNHPPL VKALQDAGET SGDKCWELPL
DPEYDEQLKS NFADMANIGG RPAGTITAAS FLARFTKRYQ WAHLDIAGTA WLTGEQKGAT
GRPVPLLTRY LMDRAGA