Gene Mlg_1592 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMlg_1592 
SymbolpepN 
ID4268563 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAlkalilimnicola ehrlichii MLHE-1 
KingdomBacteria 
Replicon accessionNC_008340 
Strand
Start bp1818823 
End bp1821468 
Gene Length2646 bp 
Protein Length881 aa 
Translation table11 
GC content68% 
IMG OID638126349 
Productaminopeptidase N 
Protein accessionYP_742429 
Protein GI114320746 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.491814 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGATC ACAAGCAGCG TGCGGCGGTG AAGCTCAGCG ACTACACCCC GCCCGCGTTC 
CGGGTGGAGC ATATCGACCT GTACTTCTCC CTCGGCGAGG AGTACACCCG GGTGCGCTCG
CGGATGCGGC TGTGCCGTGC CGAGGGCACC GACCCGCACA CCCCACTGCA CCTGGATGGC
GAGGCCCTGG AGTTGGAGGC GCTTTACCTG GACGGCCAGG CGCTGGCGAT CGACGACTAC
CTGCTTACCC GGCAGGGGCT GACCATTGAG CGCGTGCCCG ACCGGTTCGA GCTGGAGGTG
CACACCCTCC TCCGGCCCCA GGACAACACC GCGCTGTCCG GCCTGTATCG CTCCGGTGGC
ATGTTCTGCA CCCAATGCGA GGCGGAGGGC TTCCGGCGAA TCACCTACTA CCCCGACCGT
CCGGACGTGC TCAGCCGCTT TACCACCACC ATCGAGGCCG ACCGCGAGCG CTACCCGGTG
CTTCTATCCA ATGGCAACGC CGTGGACCGC GGCGAGAGCG AGGGGGGCCG CCACTGGGTC
CGCTGGGACG ACCCCTGGCC CAAGCCCAGC TACCTCTTCG CGCTGGTGGC GGGGGACCTG
CACTGCCACC GCGACCGCTA CCGCACCGCC TCCGGTCGCG ACGTGCAACT GGCCTTCTAC
GTGGAGCACG CAAACGCCGG ACGCACCGGC CACGCCATGG AGAGCCTCAA GCGCGCCATG
CGCTGGGACG AGGAGACCTA CGGCCTGGAG TACGACCTGG ACATCTACAT GGTCGTGGCC
GTGGGCGACT TCAACATGGG CGCCATGGAG AACAAAGGGC TCAACATCTT CAACACCCAG
TACGTCCTGG CCAGCCCGGA GACCGCCACC GACGCCGATT TCGAGGCGGT GGAGGCGGTG
ATCGGCCACG AGTATTTTCA CAACTACACC GGCAACCGGG TGACCTGCCG CGACTGGTTC
CAGCTCAGTC TCAAGGAAGG TCTGACGGTC TTCCGCGAGC AGCAGTTCAG CGAGGCGATG
GGCTCACCGG CGGTGCAGCG CATCCAGCAG GTGCGCCTGC TGCGCTCGGC CCAATTCCCG
GAAGACGCCA GCCCCATGGC CCATCCGGTC CGGCCCGACG CCTACGTCGA GATCAACAAC
TTCTACACCG CCACGGTCTA CATCAAAGGC GCGGAAGTGA TCCGCATGTA CCACACCCTG
CTGGGCGACG GCGCCTTCCG CAACGGTGTG CGCCGTTACC TGGAGCGCTT TGACGGCCAA
GCAGTGACCA TCGAGGACTT CCTCCAGACC ATGGCCGAGA CCGGTGGCCG GGACCTCACC
CAGTTCGGGC GCTGGTACAC CCAGGCCGGC ACACCCCGCC TGCGAGTGCG CGATGCCTTC
GACGCGGACG CCGGGCGTTA CACCCTGCGC GTAGTGCAGG AGTGCCCGCC CACCCCTGGT
CAGCCGGCAA AGCAGCCGTT CCACATCCCG CTGGCGATGG GCCTGGTGGG ACGGGACGGG
CAGGCGCTCC CCCTGCGGCT GGCCGGCGAG CCCGAGGGCC AGGCCACGAC CCGGGTGCTG
GAGTTGACCC GCTCGGAGCA GGCTTTCGTC TTCGAGGGCC TGGCCGAGCG GCCGGTGCCC
TCCCTGCTGC GGGGCTTTTC CGCGCCAGTC ATCCTCGACT ACGACTACAG CGACGACGAC
CTGGCCTTCC TGCTCACCCA CGACAGCGAT GCCTTTGCCC GCTGGGAGGC CGGTCAGCAG
CTCGCCGTGC GGGTGATCCT GCGCCGGGTG CACGGCGAGC GTCGGGAGGA GGACCTGGAC
AAGCTCCGGC GCGCCTTTGC GGGGGTGTTG GAGCATCCCG GGCTCGACGC CTCATTGGCG
GCCGAGGCCC TCGCCCTGCC TCAGGAGACC TACCTGGCGC AGCAGCTCGA GCAGGCCGAT
CCCGTGGCCA TCCGGGCAGC GCGCGAGGGC GTGCGCGCGG AACTGGCGGG CGCCCTGGCC
GCCACCTGGC GGACCGTCTA TAGCCGCTAC CGGCCCCAGG GCCCATGGCG GCTCGAACCG
GCGGCTATTG CCGGACGGCG CTTGGCCAAT CTCGCCCTCG GCTACCTGGC GGCCACCGGC
GCGGCCGAGG ACGATGAGCG GGTCGAGCGT CAATACCACC AAGCCGACAA CATGACCGAC
AGCCTAGCCG CCCTCAGCCT GTTGGCCGAC CGCGACGACG CCGCAGCGCA GGCCGCCCTG
GACGACTTCC ATCAGCGCTG GCGTCACGTG CCGCTGGTGC TGGACAAGTG GTTCCGGGTG
CAGGCCATGT CGCGTCATCC CGGCGCACTG GCACGGGTCC AGTCCCTGCT CCGGCATCCC
GATTTCGACC TGCACAACCC GAACCGGGTC CGCTCGGTCA TCGGGGCCTT CGCCCAGGGC
AACCCCGCGG CCTTTCACGA CTCCAGCGGC GAGGGGTACC GCCTGCTGGC CGACCACATC
CTGAGACTGG ATACTCTCAA TCCCCAAGTC GCCGCGCGGA TGGCGCTGCC CTTGAGCAAG
TGGCAACGTT ACGATCTGCC GAGGCAGCAG ATAATGAAAA CCGAACTCCA ACGGATCGCT
GAGGCCCCTT CGCTGTCGAA CGACGTGTAC GAGGTGGTCT CCCGCAGTCT GGAGAACATC
GCCTGA
 
Protein sequence
MTDHKQRAAV KLSDYTPPAF RVEHIDLYFS LGEEYTRVRS RMRLCRAEGT DPHTPLHLDG 
EALELEALYL DGQALAIDDY LLTRQGLTIE RVPDRFELEV HTLLRPQDNT ALSGLYRSGG
MFCTQCEAEG FRRITYYPDR PDVLSRFTTT IEADRERYPV LLSNGNAVDR GESEGGRHWV
RWDDPWPKPS YLFALVAGDL HCHRDRYRTA SGRDVQLAFY VEHANAGRTG HAMESLKRAM
RWDEETYGLE YDLDIYMVVA VGDFNMGAME NKGLNIFNTQ YVLASPETAT DADFEAVEAV
IGHEYFHNYT GNRVTCRDWF QLSLKEGLTV FREQQFSEAM GSPAVQRIQQ VRLLRSAQFP
EDASPMAHPV RPDAYVEINN FYTATVYIKG AEVIRMYHTL LGDGAFRNGV RRYLERFDGQ
AVTIEDFLQT MAETGGRDLT QFGRWYTQAG TPRLRVRDAF DADAGRYTLR VVQECPPTPG
QPAKQPFHIP LAMGLVGRDG QALPLRLAGE PEGQATTRVL ELTRSEQAFV FEGLAERPVP
SLLRGFSAPV ILDYDYSDDD LAFLLTHDSD AFARWEAGQQ LAVRVILRRV HGERREEDLD
KLRRAFAGVL EHPGLDASLA AEALALPQET YLAQQLEQAD PVAIRAAREG VRAELAGALA
ATWRTVYSRY RPQGPWRLEP AAIAGRRLAN LALGYLAATG AAEDDERVER QYHQADNMTD
SLAALSLLAD RDDAAAQAAL DDFHQRWRHV PLVLDKWFRV QAMSRHPGAL ARVQSLLRHP
DFDLHNPNRV RSVIGAFAQG NPAAFHDSSG EGYRLLADHI LRLDTLNPQV AARMALPLSK
WQRYDLPRQQ IMKTELQRIA EAPSLSNDVY EVVSRSLENI A