Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | M446_0267 |
Symbol | pepN |
ID | 6134582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Methylobacterium sp. 4-46 |
Kingdom | Bacteria |
Replicon accession | NC_010511 |
Strand | + |
Start bp | 326384 |
End bp | 329068 |
Gene Length | 2685 bp |
Protein Length | 894 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641640594 |
Product | aminopeptidase N |
Protein accession | YP_001767272 |
Protein GI | 170738617 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.341125 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0226017 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCACCG AGTCCCCGCC GATCGTCCGC CTCGCCGATT ACCGGCCGAG CGACTACCTG ATCGACCGGG TCGATCTCAA CGTGCGGCTG CACCCGACCG AGACCCGGAT CTCCGCCACC CTGGCGCTGC GGCCCAACCC GCGCGGCGAG GCCGGTGCGC CGCTCCACCT CGACGGCGAC GACCTCGCGC TGCTGGCGGT CGCCCTCGAC GGCCAGCCGA CCGCGCCGGG GGCGATCGAA CTCGGCCCCC TGGGCCTCAC CCTGCACCGG CCGCCGCAGC GCCCCTTCGT GCTCAGCCTC GAGACGCAGG TGAACCCGAG CGCCAACACC AAGCTGATGG GGCTCTACCG CTCGAACGGC GTCTACTGCA CCCAGTGCGA GGCGGACGGG TTCCGGCGCA TCACCTACTT CCTCGACCGG CCGGACGTGC TCGCGGTCTA CACCACCCGG ATCGAGGCCG ACCGGGACGA GGCGCCGGTG CTCCTCGGCA ACGGCAACCC GGTCGAGGCG GGCGAGGCCG GGCCGGGGCG CCACTACGCC GTCTGGCACG ATCCCCACCC GAAGCCCGCC TACCTGTTCG CCCTGGTCGG CGGCCGGCTC GGCCGCATCG CCACCCGCTT CACCACCATG GAGGGCCGCG ACGTCGCGGT CGCGGTCTAC GTCGAGCCGG GCAAGGAGGC GCGCGCGCCC TACGCCCTCG ACGCCGTCAC CCGCGCCATG GCCTGGGACG AGCGGCGCTT CGGCCGCGCC TACGACCTCG ACGTCTTCAA CGTCGTCGCG GTCTCCGACT TCAACATGGG CGCGATGGAG AACAAGGGCC TCAACATCTT CAACGACAAG TACGTGCTGG CCAGCCCCGA CACCGCCACC GACACGGACT ACGCCAACAT CGAGGCGATC ATCGCCCACG AGTATTTCCA CAACTGGTCG GGCAACCGGG TGACCTGCCG CGACTGGTTC CAGCTCTGCC TCAAGGAGGG CCTGACGGTC TTCCGCGACC AGGAATTCTC CTCCGACGAG CGCTCGCGCC CGGTCCACCG CATCGCCGAG GTGCGCACCC TGCGCGCGCG CCAGTTCCCG GAGGATGCGG GGCCGCTCGC CCATCCGGTC CGGCCCCAGG CCTACCGGGA GATCAACAAC TTCTACACGG CGACGGTCTA CGAGAAGGGC GCCGAGATCG TCAGGATGCT GGGCACCCTG CTCGGCCCGG CGGCGTTCCG GGCCGGCATG GACCTGTTCT TCGCGCGCTG CGACGGCACC GCCGCCACCG TGGAGGATTT CCTGGCCGCC TTCGCGCAGG TGAGCGGGCG CGACCTCGCC GCCTTCTCGC GCTGGTACGC GCAGGCCGGC ACCCCGACCG TGTCGGTGGC CGGGCGCTAC GACCCGGCCC AGCGCAGCTA CACCCTCGAC TTCCACCAGA GCCTGCCGGC CGTCGCCACG GAGGCGGCCG GCGGCGGCCC GCCCCAGCCC CTGGTGATCC CGGTCGCCCT CGGCCTCGTC GGCCGCGGCG GCGGCGCCCT CGAGGCCCGC TCGGACCGGG TGCGGGACGG CGTCTTCGTG CTGGAGGGCG AGAGCGACCG CCTCGTCTTC ACGGAGGTGG CGGCCGAGCC GGTGCCCTCC CTGTTCCGGG GCTTCTCGGC CCCCGTGAAG GTCGCCCATT CCCTCGACAC GGCCCAGCGC CTGACCCTGC TCGCCCACGA CAGCGACCCG TTCAACCGCT GGCAGGCGGC CCAGAACCTC GCCCTCGACC TGGTCACCGC GCGGGCGCGG CTCGGTGCCC CGATCGAGGC CGATGCGGGC CCGGAGGCGG CCGCCCTCGC GGAGGCGCTC GGCGCCTTCC TGGACGGGGA GGCCCTGCGG GACCCGGCCT TCGCGGCCCT GGTCCTGGCG ATCCCGGGCG ACCAGGAGGC CGCGCAGGAG ATCAGCACCA ACGTCGACCC GGACGCGATC CACCGCGCCC GCTGGACCCT GCGGGCCCAT CTCGGCCGCG CCCTGCTGCC GCGGCTCGTC GCCCTGCGCG ACGCGCTCGC GGCCCCGCCC GGCAGCCCGT TCAGCCCCGA CGCCGCGAGC GCCGGGCGCC GGTCCCTGCG CAACGCCGCC CTGGACCTGA TCGCGGCGGC CGATCCCGCC CGCGGGACGG CGCTCGCCGA GGCGCAGCTG CGCGAGGCCG ACAACATGAC CGACCGCCTC GCCGCCCTGG CGGTGCTGAC GCTCCTGCCC GGCGAGGCGC GCGAGCGCGC CCTCGCGGCG TTCGGGGAGA CCTACCGGGG CGAGCCCCTC GTCCTCGACA AGTGGTTCGC CCTCCAGGCG ATGATCCCCG AGGCCGGCAC CGTGGCGCGG GTGCGGGCGC TGATGCGCCA CGAGGGCTTC TCGGCCTCGA ACCCGAACCG GGTGCGGGCG CTGGTCGGCA GCTTCAGCCT CAACAACCCG ACCCAGTTCC ACCGGGCGGA CGGTGCCGGC TACGAACTCC TCGCCGAGAC GGTGCTCGAC GTCGACTCGC GCAACCCCCA GGTCGCGGCA CGTTTGCTCA CGGCCTTCAA CACGTGGCGG ATGATGGAGC CGACCCGGCG CGCCCGCGCG GAGGCGCAGC TGCGCGCCAT CGCGGCCGCC CCCGGCCTGT CTCCCGACGC GGGCGACATC GCCAGCCGCT CCCTGGCGCC GGCCCGGCAC CACATAAACA CCTGA
|
Protein sequence | MRTESPPIVR LADYRPSDYL IDRVDLNVRL HPTETRISAT LALRPNPRGE AGAPLHLDGD DLALLAVALD GQPTAPGAIE LGPLGLTLHR PPQRPFVLSL ETQVNPSANT KLMGLYRSNG VYCTQCEADG FRRITYFLDR PDVLAVYTTR IEADRDEAPV LLGNGNPVEA GEAGPGRHYA VWHDPHPKPA YLFALVGGRL GRIATRFTTM EGRDVAVAVY VEPGKEARAP YALDAVTRAM AWDERRFGRA YDLDVFNVVA VSDFNMGAME NKGLNIFNDK YVLASPDTAT DTDYANIEAI IAHEYFHNWS GNRVTCRDWF QLCLKEGLTV FRDQEFSSDE RSRPVHRIAE VRTLRARQFP EDAGPLAHPV RPQAYREINN FYTATVYEKG AEIVRMLGTL LGPAAFRAGM DLFFARCDGT AATVEDFLAA FAQVSGRDLA AFSRWYAQAG TPTVSVAGRY DPAQRSYTLD FHQSLPAVAT EAAGGGPPQP LVIPVALGLV GRGGGALEAR SDRVRDGVFV LEGESDRLVF TEVAAEPVPS LFRGFSAPVK VAHSLDTAQR LTLLAHDSDP FNRWQAAQNL ALDLVTARAR LGAPIEADAG PEAAALAEAL GAFLDGEALR DPAFAALVLA IPGDQEAAQE ISTNVDPDAI HRARWTLRAH LGRALLPRLV ALRDALAAPP GSPFSPDAAS AGRRSLRNAA LDLIAAADPA RGTALAEAQL READNMTDRL AALAVLTLLP GEARERALAA FGETYRGEPL VLDKWFALQA MIPEAGTVAR VRALMRHEGF SASNPNRVRA LVGSFSLNNP TQFHRADGAG YELLAETVLD VDSRNPQVAA RLLTAFNTWR MMEPTRRARA EAQLRAIAAA PGLSPDAGDI ASRSLAPARH HINT
|
| |