Gene Mlg_0439 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_0439 
Symbol 
ID4270383 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp493783 
End bp495084 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content68% 
IMG OID638125174 
Productputative aminopeptidase 2 
Protein accessionYP_741283 
Protein GI114319600 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.773817 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.534448 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGAAG TACAGGATAT GCAGCAGGCG CAGGAACTAC TGGACTTCAT CGACGACAGC 
CCGAGCCCAT GGCATGCGGT CGCCAATATG GCGGAAATGC TGCAGGCGGC GGGCTTTGTC
GAGCTGCGCG AGGATGAGCC CTGGCATCTG AATCCCGGGG ATGCGGCCTA TGTGATCCGC
GAGGAGGCGA GCCTGGTGGC CTTCCGGGTC GGCAGCCGGG CGCCGGAGGC GGCCGGCTTT
CGGGTGTTGG CGGCGCATAC CGATTCCCCT GGGCTGCGGG TCAAGCCGGG GGGCGCTCAC
CGCGCCGGAC CCTTTCTGCG CCTTGGGGTG GAGGTGTATG GCGGCCCGAT CCTCGCGACC
TTCGCCGACC GGGACCTGAC CCTGGCCGGG CGTGTGGCGG TGCGGGGCGA GGTCGGTATC
GACTCGGTGC TTGTGGACTT TCCCGAGGCC CTGGCGCGTC TGCCCACGCC AGCGATCCAC
CTGAATCGGG AGGTGAACGA GCAGGGCCTG AAATTCGACC GTCAGAAGGA GCTGCCGCTG
ATCTTCAGCC TGCCGGACGA CGATGAGCCC TCACCGGAGG CGTTCCGGCA GCTGCTGGCC
ACGCGCGCTG GCGTCGAGCT GGACGACCTG CTGGGCTGGG ATCTGGCGGT GAGCGATACC
CAGCCGGGGG CCTTCTTCGG CGCGGACCGG GAGTTCCTGG CCGCCCCCCG AATCGATAAT
CTCGCCTCCT GCCATGCCGC GATCAAGGCC CTGTTGGCCG TCGAGCAGCC GACGGCGACG
GCGGTGTGTG CGCTCTTTGA CCACGAGGAG ATCGGCAGCA CCACCTATCG GGGAGCGGCC
GGCACGTTGC TGCCCAATGT GTTGGAGCGC CTGGGCGGTG CCGGTGAAGA ATTGCACCAG
GCCAAGGCGC GCAGTCGGCT GGTCAGCGTG GATATGGCCC ATGCCTGGCA CCCGAACTTT
CCCCATTTCT ACGAGGACGA GCACAAGGCG CACGTCAACC ACGGACCGGT GATCAAGGTG
AACGCCAACC AGCGCTACAC CAGTGAGTCC ACCGGCGGGG CCTGGTTCGC CGAGCTTTGC
CGCGGGGCGG GGGTGCCCTG GCAGACCTAT GTGCACCGGA CCGATCTGCC GTGCGGGAGT
ACGATCGGTC CGGTCACCGC GGCTCGGCTC GGGCTACCGG TGATTGATGT GGGCAACGCC
ATCTGGTCCA TGCACAGTGC GCGCGAGAGC GCGGGGGCGA AGGACCACGC CTGGATGACG
GGCGCCCTGT CGGCCTTCCT GGCCGTGCCA CAGTTGCCAT GA
 
Protein sequence
MSEVQDMQQA QELLDFIDDS PSPWHAVANM AEMLQAAGFV ELREDEPWHL NPGDAAYVIR 
EEASLVAFRV GSRAPEAAGF RVLAAHTDSP GLRVKPGGAH RAGPFLRLGV EVYGGPILAT
FADRDLTLAG RVAVRGEVGI DSVLVDFPEA LARLPTPAIH LNREVNEQGL KFDRQKELPL
IFSLPDDDEP SPEAFRQLLA TRAGVELDDL LGWDLAVSDT QPGAFFGADR EFLAAPRIDN
LASCHAAIKA LLAVEQPTAT AVCALFDHEE IGSTTYRGAA GTLLPNVLER LGGAGEELHQ
AKARSRLVSV DMAHAWHPNF PHFYEDEHKA HVNHGPVIKV NANQRYTSES TGGAWFAELC
RGAGVPWQTY VHRTDLPCGS TIGPVTAARL GLPVIDVGNA IWSMHSARES AGAKDHAWMT
GALSAFLAVP QLP