Gene Mpe_A2774 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A2774 
SymbolpepN 
ID4784671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp2953563 
End bp2956259 
Gene Length2697 bp 
Protein Length898 aa 
Translation table11 
GC content71% 
IMG OID640091345 
Productaminopeptidase N 
Protein accessionYP_001021963 
Protein GI124267959 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0308] Aminopeptidase N 
TIGRFAM ID[TIGR02414] aminopeptidase N, Escherichia coli type 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAAG GCACCGCTTC CCTCGTCCGC CGCGAGGACT ACAGCGCCCC GGCCTACTGG 
ATCCGCAGCG TCGACCTCAC GTTCGACCTG GACCCGCAGA AGACGCTGGT CATCAACCGC
ATGGAGGTCG AGCGCAACCC CGACCTGCCG ATGCAGCCGC TGCGCCTGGA CGGCGAGGAC
ATCAACCTCA CGCGGGTGCT GGTCAACGGC GAGAGCATCT CCTTCCGCGT CGAGGGCGCG
CAACTGGTGC TCGACGGCCT GCCGGACGGC CGCTTCACGC TCGAGCTGCG CAACACCTGC
GCACCGGCGA AGAACACCCA GCTGTCCGGC CTGTACACCT CCGGCGGCGG CTTCTTCACG
CAGTGCGAGG CGCAGGGCTT CCGCCGCATC ACCTACTTCC TCGACCGGCC CGACGTGATG
GCGGTCTACA CCGTCACGCT GAAGGCCGAT GCCAAGGCCT ACCCGGTGCT CCTGTCGAAC
GGCAACCTCG TGGAGCAGGC CGAACTCGAC GGCGGCAAGC ACTGCGCGAA GTGGCACGAC
CCCTTCCCGA AGCCGAGCTA CCTGTTCGCG CTGGTCGCGG CCGACCTGGT GATGCGCGAG
CAGAAGATCC GCAGCCGTTC GGGCAAGGAC CACCTGCTGC AGGTCTATGT GCGGGCCGGT
GACCTCGACA AGACCGAGCA CGCGATGAAC TCGCTGATCG CCTCGGTGGC CTGGGAAGAG
GCGCGCTTCG GCCTGTCGCT CGACCTGGAG CGCTTCATGA TCGTCGCGGT CAGCGACTTC
AACATGGGCG CGATGGAGAA CAAGGGCCTG AACATCTTCA ACACGAAGTT CGTGCTGGCC
AGCGCCGCCA CCGCCACCGA CTTCGATTTC GCGCACGTCG AGAGCGTGGT CGGCCACGAG
TACTTCCACA ACTGGACCGG CAACCGCGTG ACCTGCCGCG ACTGGTTCCA GCTGTCGTTG
AAGGAGGGGC TCACGGTCTT CCGCGACCAG GAGTTCAGCA TGGACATGGC GGGCGCCGCC
AGCGCCCGCG CCGTGCAGCG CATCCAGGAC GTGCGCCTGC TGCGTCAGGT GCAGTTCTCC
GAGGACGCCG GCCCGATGGC GCACCCGGTG CGGCCCGACC AGTACCAGGC GATCGACAAC
TTCTACACCG CCACCGTCTA TGAAAAGGGC GCCGAGGTGG TGCGGATGAT GCACACGCTC
GTCGGTCGCG ACGGCTTCGC CGCCGGCATG AAGCGCTACT TCGAGCGCCA CGACGGCCAG
GCCGTGACCT GCGACGATTT CGCGCAGGCG ATCGCCGACG CCAACCCCGG CAGCGCCCTG
GCCGGGCGAC TGGACGCCTT CAAACGCTGG TACGCGCAGG CCGGCACGCC GCAGCTCGCC
GCGAGCGGCC GCTATGACGC CGAGTCTCGC CGCTACACGC TCGAACTCAC GCAGCGCGGC
CTGCCCACGC CGGGCCAGCC CGACAAGCAG CCGTGGGTGA TCCCGGTCGC AATCGGCCTG
GTGGCCGCCG ACGACGGCCG TGCGTTGCCG CTGCAGCTCG AGGGCGAACC GGCCGCGGGC
GGCACCGACC GCGTGCTGGT GCTCGACCAG CCGAGCGCCG CCTTCACCTT CGTCGGCGTC
GAGTCCGAAC CCGTGCCCTC GCTGCTGCGC GGCTTCTCGT CGCCGGTGGT CCTGGACGCC
GCACTGACGG ACGCCCAGCT GCTGGTGCTG CTGGCGCATG ACCAAGACGC CTTCAACCGC
TGGGAGGCCG GCCAGCAGCT CGCGTTGCGC CGCCTGCTCG CGGCGGCCCG ACGGCCGGAC
GACGGCACGC CGGTGCTCGA CGAGCCCTTC GTCGCGGCAA TGCACAGCCT GCTGCGCGAT
CCCTCACTCG ACGCCGCCTT CAAGGAGCTC GCGCTCACGC TGCCGACCGA AACCTATGTG
GGCGAGCGGC TGGGTGCGGA GATCGACCCA CAGCGTGTGC ATGCGGTGCG CGAGGCCGCG
CGCCTGCAGC TCGCGCAGGC GCTGCGCGAC GACTGGGTCT GGGCCTACGA GCACCACCAG
CCCGCGGGCG GCTACTCGCC CGACGCACAC AGCGCCGGCC AGCGCGCGCT GGCCAACCTG
GCTCTGGCGA TGCTGGTGCT CGACGCCACC GGCAGCGGCG ACACCGTCTG GCCCGGCCGG
GCCTACCAGC GCTTCAAGGA CGCCGGCAAC ATGACCGACC GCTTCGGCGC GCTGTCGGCT
CTGGTGAACG CGCACGCCGA GCTGGCGGTG CCGGCGTTGG AGCGCTTTCA CGCGCTGTTC
AAGGGCGAGG CGCTGGTGAT CGACAAGTGG TTCGCGCTGC AGGCCGGCGC GAGCGAGCCG
GTCGGCGAGC ACGCCGGCCG CGTGTTCACG CGGGCCAAGG CCCTGCTGCA GCATGCCGAC
TTCTCGCTGC GCAACCCGAA CCGGGCGCGC AGCGTGCTGG CGGCGCTGTT CCTCAACAAC
CCGGCCGCCT TCCACCGCCG CGACGCGGCC GGCTACGTGT TCTGGGCCGA GCGCGTGTGC
GAGGTCGACG CCATCAACCC GCAGCTCGCC TCGCGGCTGG CGCGCGCGCT CGACCGCTGG
CGTGCGCTGG CCGAGCCTTA CCGCAGCGCC GCGCGCGAGG CGATCGCCCG CGTGGCGGCC
AAGCCGGAGC TGTCCGAGGA CACGCGCGAG ATCGTCACGC GCGCGCTGCA GGACTGA
 
Protein sequence
MREGTASLVR REDYSAPAYW IRSVDLTFDL DPQKTLVINR MEVERNPDLP MQPLRLDGED 
INLTRVLVNG ESISFRVEGA QLVLDGLPDG RFTLELRNTC APAKNTQLSG LYTSGGGFFT
QCEAQGFRRI TYFLDRPDVM AVYTVTLKAD AKAYPVLLSN GNLVEQAELD GGKHCAKWHD
PFPKPSYLFA LVAADLVMRE QKIRSRSGKD HLLQVYVRAG DLDKTEHAMN SLIASVAWEE
ARFGLSLDLE RFMIVAVSDF NMGAMENKGL NIFNTKFVLA SAATATDFDF AHVESVVGHE
YFHNWTGNRV TCRDWFQLSL KEGLTVFRDQ EFSMDMAGAA SARAVQRIQD VRLLRQVQFS
EDAGPMAHPV RPDQYQAIDN FYTATVYEKG AEVVRMMHTL VGRDGFAAGM KRYFERHDGQ
AVTCDDFAQA IADANPGSAL AGRLDAFKRW YAQAGTPQLA ASGRYDAESR RYTLELTQRG
LPTPGQPDKQ PWVIPVAIGL VAADDGRALP LQLEGEPAAG GTDRVLVLDQ PSAAFTFVGV
ESEPVPSLLR GFSSPVVLDA ALTDAQLLVL LAHDQDAFNR WEAGQQLALR RLLAAARRPD
DGTPVLDEPF VAAMHSLLRD PSLDAAFKEL ALTLPTETYV GERLGAEIDP QRVHAVREAA
RLQLAQALRD DWVWAYEHHQ PAGGYSPDAH SAGQRALANL ALAMLVLDAT GSGDTVWPGR
AYQRFKDAGN MTDRFGALSA LVNAHAELAV PALERFHALF KGEALVIDKW FALQAGASEP
VGEHAGRVFT RAKALLQHAD FSLRNPNRAR SVLAALFLNN PAAFHRRDAA GYVFWAERVC
EVDAINPQLA SRLARALDRW RALAEPYRSA AREAIARVAA KPELSEDTRE IVTRALQD