Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mlg_1592 |
Symbol | pepN |
ID | 4268563 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Alkalilimnicola ehrlichii MLHE-1 |
Kingdom | Bacteria |
Replicon accession | NC_008340 |
Strand | + |
Start bp | 1818823 |
End bp | 1821468 |
Gene Length | 2646 bp |
Protein Length | 881 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 638126349 |
Product | aminopeptidase N |
Protein accession | YP_742429 |
Protein GI | 114320746 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.491814 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGATC ACAAGCAGCG TGCGGCGGTG AAGCTCAGCG ACTACACCCC GCCCGCGTTC CGGGTGGAGC ATATCGACCT GTACTTCTCC CTCGGCGAGG AGTACACCCG GGTGCGCTCG CGGATGCGGC TGTGCCGTGC CGAGGGCACC GACCCGCACA CCCCACTGCA CCTGGATGGC GAGGCCCTGG AGTTGGAGGC GCTTTACCTG GACGGCCAGG CGCTGGCGAT CGACGACTAC CTGCTTACCC GGCAGGGGCT GACCATTGAG CGCGTGCCCG ACCGGTTCGA GCTGGAGGTG CACACCCTCC TCCGGCCCCA GGACAACACC GCGCTGTCCG GCCTGTATCG CTCCGGTGGC ATGTTCTGCA CCCAATGCGA GGCGGAGGGC TTCCGGCGAA TCACCTACTA CCCCGACCGT CCGGACGTGC TCAGCCGCTT TACCACCACC ATCGAGGCCG ACCGCGAGCG CTACCCGGTG CTTCTATCCA ATGGCAACGC CGTGGACCGC GGCGAGAGCG AGGGGGGCCG CCACTGGGTC CGCTGGGACG ACCCCTGGCC CAAGCCCAGC TACCTCTTCG CGCTGGTGGC GGGGGACCTG CACTGCCACC GCGACCGCTA CCGCACCGCC TCCGGTCGCG ACGTGCAACT GGCCTTCTAC GTGGAGCACG CAAACGCCGG ACGCACCGGC CACGCCATGG AGAGCCTCAA GCGCGCCATG CGCTGGGACG AGGAGACCTA CGGCCTGGAG TACGACCTGG ACATCTACAT GGTCGTGGCC GTGGGCGACT TCAACATGGG CGCCATGGAG AACAAAGGGC TCAACATCTT CAACACCCAG TACGTCCTGG CCAGCCCGGA GACCGCCACC GACGCCGATT TCGAGGCGGT GGAGGCGGTG ATCGGCCACG AGTATTTTCA CAACTACACC GGCAACCGGG TGACCTGCCG CGACTGGTTC CAGCTCAGTC TCAAGGAAGG TCTGACGGTC TTCCGCGAGC AGCAGTTCAG CGAGGCGATG GGCTCACCGG CGGTGCAGCG CATCCAGCAG GTGCGCCTGC TGCGCTCGGC CCAATTCCCG GAAGACGCCA GCCCCATGGC CCATCCGGTC CGGCCCGACG CCTACGTCGA GATCAACAAC TTCTACACCG CCACGGTCTA CATCAAAGGC GCGGAAGTGA TCCGCATGTA CCACACCCTG CTGGGCGACG GCGCCTTCCG CAACGGTGTG CGCCGTTACC TGGAGCGCTT TGACGGCCAA GCAGTGACCA TCGAGGACTT CCTCCAGACC ATGGCCGAGA CCGGTGGCCG GGACCTCACC CAGTTCGGGC GCTGGTACAC CCAGGCCGGC ACACCCCGCC TGCGAGTGCG CGATGCCTTC GACGCGGACG CCGGGCGTTA CACCCTGCGC GTAGTGCAGG AGTGCCCGCC CACCCCTGGT CAGCCGGCAA AGCAGCCGTT CCACATCCCG CTGGCGATGG GCCTGGTGGG ACGGGACGGG CAGGCGCTCC CCCTGCGGCT GGCCGGCGAG CCCGAGGGCC AGGCCACGAC CCGGGTGCTG GAGTTGACCC GCTCGGAGCA GGCTTTCGTC TTCGAGGGCC TGGCCGAGCG GCCGGTGCCC TCCCTGCTGC GGGGCTTTTC CGCGCCAGTC ATCCTCGACT ACGACTACAG CGACGACGAC CTGGCCTTCC TGCTCACCCA CGACAGCGAT GCCTTTGCCC GCTGGGAGGC CGGTCAGCAG CTCGCCGTGC GGGTGATCCT GCGCCGGGTG CACGGCGAGC GTCGGGAGGA GGACCTGGAC AAGCTCCGGC GCGCCTTTGC GGGGGTGTTG GAGCATCCCG GGCTCGACGC CTCATTGGCG GCCGAGGCCC TCGCCCTGCC TCAGGAGACC TACCTGGCGC AGCAGCTCGA GCAGGCCGAT CCCGTGGCCA TCCGGGCAGC GCGCGAGGGC GTGCGCGCGG AACTGGCGGG CGCCCTGGCC GCCACCTGGC GGACCGTCTA TAGCCGCTAC CGGCCCCAGG GCCCATGGCG GCTCGAACCG GCGGCTATTG CCGGACGGCG CTTGGCCAAT CTCGCCCTCG GCTACCTGGC GGCCACCGGC GCGGCCGAGG ACGATGAGCG GGTCGAGCGT CAATACCACC AAGCCGACAA CATGACCGAC AGCCTAGCCG CCCTCAGCCT GTTGGCCGAC CGCGACGACG CCGCAGCGCA GGCCGCCCTG GACGACTTCC ATCAGCGCTG GCGTCACGTG CCGCTGGTGC TGGACAAGTG GTTCCGGGTG CAGGCCATGT CGCGTCATCC CGGCGCACTG GCACGGGTCC AGTCCCTGCT CCGGCATCCC GATTTCGACC TGCACAACCC GAACCGGGTC CGCTCGGTCA TCGGGGCCTT CGCCCAGGGC AACCCCGCGG CCTTTCACGA CTCCAGCGGC GAGGGGTACC GCCTGCTGGC CGACCACATC CTGAGACTGG ATACTCTCAA TCCCCAAGTC GCCGCGCGGA TGGCGCTGCC CTTGAGCAAG TGGCAACGTT ACGATCTGCC GAGGCAGCAG ATAATGAAAA CCGAACTCCA ACGGATCGCT GAGGCCCCTT CGCTGTCGAA CGACGTGTAC GAGGTGGTCT CCCGCAGTCT GGAGAACATC GCCTGA
|
Protein sequence | MTDHKQRAAV KLSDYTPPAF RVEHIDLYFS LGEEYTRVRS RMRLCRAEGT DPHTPLHLDG EALELEALYL DGQALAIDDY LLTRQGLTIE RVPDRFELEV HTLLRPQDNT ALSGLYRSGG MFCTQCEAEG FRRITYYPDR PDVLSRFTTT IEADRERYPV LLSNGNAVDR GESEGGRHWV RWDDPWPKPS YLFALVAGDL HCHRDRYRTA SGRDVQLAFY VEHANAGRTG HAMESLKRAM RWDEETYGLE YDLDIYMVVA VGDFNMGAME NKGLNIFNTQ YVLASPETAT DADFEAVEAV IGHEYFHNYT GNRVTCRDWF QLSLKEGLTV FREQQFSEAM GSPAVQRIQQ VRLLRSAQFP EDASPMAHPV RPDAYVEINN FYTATVYIKG AEVIRMYHTL LGDGAFRNGV RRYLERFDGQ AVTIEDFLQT MAETGGRDLT QFGRWYTQAG TPRLRVRDAF DADAGRYTLR VVQECPPTPG QPAKQPFHIP LAMGLVGRDG QALPLRLAGE PEGQATTRVL ELTRSEQAFV FEGLAERPVP SLLRGFSAPV ILDYDYSDDD LAFLLTHDSD AFARWEAGQQ LAVRVILRRV HGERREEDLD KLRRAFAGVL EHPGLDASLA AEALALPQET YLAQQLEQAD PVAIRAAREG VRAELAGALA ATWRTVYSRY RPQGPWRLEP AAIAGRRLAN LALGYLAATG AAEDDERVER QYHQADNMTD SLAALSLLAD RDDAAAQAAL DDFHQRWRHV PLVLDKWFRV QAMSRHPGAL ARVQSLLRHP DFDLHNPNRV RSVIGAFAQG NPAAFHDSSG EGYRLLADHI LRLDTLNPQV AARMALPLSK WQRYDLPRQQ IMKTELQRIA EAPSLSNDVY EVVSRSLENI A
|
| |