Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_2542 |
Symbol | |
ID | 4270930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 2883927 |
End bp | 2885240 |
Gene Length | 1314 bp |
Protein Length | 437 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 638127301 |
Product | aminopeptidase P |
Protein accession | YP_743372 |
Protein GI | 114321689 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.158286 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.0283016 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCCCTG CCGAGTACGC CGCCCGCCGC CGGGAGCTCA TGCAACTGAT CGGGGACGAG GGCATCGCCA TCATCCCCGC CGCCACCGAA AAGGTGCGCA ATCGCGACGT GCACTACCCC TTCCGCCAGG ACAGCGACTT TCGCTACCTC ACCGGCTTTC CTGAGCCGGA CGCGGTGGCC GTGCTGGTGC CGGGACGGGA ACAGGGCGCC TACCTCCTCT TCTGCCGCGA GCGCAACCCC GAGCGCGAGG TGTGGGACGG CCCCCGCGCC GGTCAGGAGG GCGCCGTGCG CGACTACGGC GCCGACGATG CCTTCCCCAT CGACGACATC GACGACATCC TCCCCGGGCT GATGGAGGGC CGCGAGCGGG TCCACTACAC CATGGGGCTG GACAAGGTCT TCGACCAGCG GGTGATCGAT TGGGCCCGGC AGGTACGCGG CCGCACCCGC GGCGCCCGCC GCGGACCGGA CGAGTTCATC GCCCTGGAGC ACCACCTGCA CGAGATGCGC CTGATTAAAC GCCCGGCGGA GCTGGATTGC ATGCGCCGCG CCGCCCGGGT CACCGGCAAG GCCCACCGCC GGGCCATGCA GGCCTGCCGG CCGGGCATGA TGGAGTACGA ACTGGAGGCG GAGTTCCTGG CCGCCTTCCG GCGCGCCGGG GGCGAACCGG CCTACCCCAG CATCGTGGGG GGTGGGGGCA ACGGCTGCGT GCTGCACTAC ATCCTCAACC GGGACAAGCT GCGCGACGGC GACCTGGTGC TGATCGACGC CGGCTGCGAG CTGGACGGCT ATGCCGCCGA TGTCACCCGC ACCTTCCCGG TCAACGGCCG CTTCAGCGCC GAGCAGCGCG CCCTCTACGA GGTGGTACTG GCCGCCCAGG AGGCCGCCAT TGCGGCGGTG ACCCCGGGGG TGAGCTGGAA CCTCGCCCAC GAGCGCGCCA CCGAGACCCT GGTGGACGGC CTGCTGGAAC TGGGCATCCT CGATGGCAGC CGCGAGCAAA TCCTGGAAGA AGAGAGCTAC AAGCGCTTTT TCATGCACCG CACCGGTCAC TGGCTGGGCA TGGACGTGCA CGATGTGGGC GACTACCGCA TCGACGGCCA GTGGCGGGAA CTGGAGCCGG GGATGACCCT AACCATCGAG CCCGGGCTCT ATATCGCCCC GGAGAGCGAC GGGGTGGCGG AGCGCTGGCG GGGTATCGGC GTGCGCATTG AGGACGACCT GCTGGTCACC CGGGAGGGCC ACGAGAACCT GACCCCCGAC ATCCCCAAGG CCCCGGACGC CATTGAGGCC CTGATGGTGG AGGCTCGGTC ATGA
|
Protein sequence | MPPAEYAARR RELMQLIGDE GIAIIPAATE KVRNRDVHYP FRQDSDFRYL TGFPEPDAVA VLVPGREQGA YLLFCRERNP EREVWDGPRA GQEGAVRDYG ADDAFPIDDI DDILPGLMEG RERVHYTMGL DKVFDQRVID WARQVRGRTR GARRGPDEFI ALEHHLHEMR LIKRPAELDC MRRAARVTGK AHRRAMQACR PGMMEYELEA EFLAAFRRAG GEPAYPSIVG GGGNGCVLHY ILNRDKLRDG DLVLIDAGCE LDGYAADVTR TFPVNGRFSA EQRALYEVVL AAQEAAIAAV TPGVSWNLAH ERATETLVDG LLELGILDGS REQILEEESY KRFFMHRTGH WLGMDVHDVG DYRIDGQWRE LEPGMTLTIE PGLYIAPESD GVAERWRGIG VRIEDDLLVT REGHENLTPD IPKAPDAIEA LMVEARS
|
| |