Gene Mlg_2542 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_2542 
Symbol 
ID4270930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp2883927 
End bp2885240 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content70% 
IMG OID638127301 
Productaminopeptidase P 
Protein accessionYP_743372 
Protein GI114321689 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.158286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0283016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCCCTG CCGAGTACGC CGCCCGCCGC CGGGAGCTCA TGCAACTGAT CGGGGACGAG 
GGCATCGCCA TCATCCCCGC CGCCACCGAA AAGGTGCGCA ATCGCGACGT GCACTACCCC
TTCCGCCAGG ACAGCGACTT TCGCTACCTC ACCGGCTTTC CTGAGCCGGA CGCGGTGGCC
GTGCTGGTGC CGGGACGGGA ACAGGGCGCC TACCTCCTCT TCTGCCGCGA GCGCAACCCC
GAGCGCGAGG TGTGGGACGG CCCCCGCGCC GGTCAGGAGG GCGCCGTGCG CGACTACGGC
GCCGACGATG CCTTCCCCAT CGACGACATC GACGACATCC TCCCCGGGCT GATGGAGGGC
CGCGAGCGGG TCCACTACAC CATGGGGCTG GACAAGGTCT TCGACCAGCG GGTGATCGAT
TGGGCCCGGC AGGTACGCGG CCGCACCCGC GGCGCCCGCC GCGGACCGGA CGAGTTCATC
GCCCTGGAGC ACCACCTGCA CGAGATGCGC CTGATTAAAC GCCCGGCGGA GCTGGATTGC
ATGCGCCGCG CCGCCCGGGT CACCGGCAAG GCCCACCGCC GGGCCATGCA GGCCTGCCGG
CCGGGCATGA TGGAGTACGA ACTGGAGGCG GAGTTCCTGG CCGCCTTCCG GCGCGCCGGG
GGCGAACCGG CCTACCCCAG CATCGTGGGG GGTGGGGGCA ACGGCTGCGT GCTGCACTAC
ATCCTCAACC GGGACAAGCT GCGCGACGGC GACCTGGTGC TGATCGACGC CGGCTGCGAG
CTGGACGGCT ATGCCGCCGA TGTCACCCGC ACCTTCCCGG TCAACGGCCG CTTCAGCGCC
GAGCAGCGCG CCCTCTACGA GGTGGTACTG GCCGCCCAGG AGGCCGCCAT TGCGGCGGTG
ACCCCGGGGG TGAGCTGGAA CCTCGCCCAC GAGCGCGCCA CCGAGACCCT GGTGGACGGC
CTGCTGGAAC TGGGCATCCT CGATGGCAGC CGCGAGCAAA TCCTGGAAGA AGAGAGCTAC
AAGCGCTTTT TCATGCACCG CACCGGTCAC TGGCTGGGCA TGGACGTGCA CGATGTGGGC
GACTACCGCA TCGACGGCCA GTGGCGGGAA CTGGAGCCGG GGATGACCCT AACCATCGAG
CCCGGGCTCT ATATCGCCCC GGAGAGCGAC GGGGTGGCGG AGCGCTGGCG GGGTATCGGC
GTGCGCATTG AGGACGACCT GCTGGTCACC CGGGAGGGCC ACGAGAACCT GACCCCCGAC
ATCCCCAAGG CCCCGGACGC CATTGAGGCC CTGATGGTGG AGGCTCGGTC ATGA
 
Protein sequence
MPPAEYAARR RELMQLIGDE GIAIIPAATE KVRNRDVHYP FRQDSDFRYL TGFPEPDAVA 
VLVPGREQGA YLLFCRERNP EREVWDGPRA GQEGAVRDYG ADDAFPIDDI DDILPGLMEG
RERVHYTMGL DKVFDQRVID WARQVRGRTR GARRGPDEFI ALEHHLHEMR LIKRPAELDC
MRRAARVTGK AHRRAMQACR PGMMEYELEA EFLAAFRRAG GEPAYPSIVG GGGNGCVLHY
ILNRDKLRDG DLVLIDAGCE LDGYAADVTR TFPVNGRFSA EQRALYEVVL AAQEAAIAAV
TPGVSWNLAH ERATETLVDG LLELGILDGS REQILEEESY KRFFMHRTGH WLGMDVHDVG
DYRIDGQWRE LEPGMTLTIE PGLYIAPESD GVAERWRGIG VRIEDDLLVT REGHENLTPD
IPKAPDAIEA LMVEARS