Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dtpsy_2249 |
Symbol | pepN |
ID | 7384841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidovorax ebreus TPSY |
Kingdom | Bacteria |
Replicon accession | NC_011992 |
Strand | + |
Start bp | 2390739 |
End bp | 2393450 |
Gene Length | 2712 bp |
Protein Length | 903 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 643655557 |
Product | aminopeptidase N |
Protein accession | YP_002553687 |
Protein GI | 222111423 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.328453 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAGAG AAGGACAAGC CATCGCCATC CACCGTGCCG ACTACGCGGC CCCCGCGTAC TGGATCGACA CGGTGGATCT CACCTTTGAC CTGGACCCGG CCAAGACCCG CGTGCTGAGC CGCCTGCGCG TGCGCCGCAA TGCCGACGTA CCCGCGCAGC CGTTGCGCCT GGATGGCGAC GAGCTGAACC TGGCGCGCGT GATGGTCAAC GGCGCCGGCA CCTCGTTCAA GCTGGACGGC GGCCAGCTGG TGCTGGAAAA CCTGCCGGAG GGCACCGGGC CCTTCGAGCT GGAGATTTTC ACCACCTGCG CGCCCGCCAA GAACACCCAG CTGTCGGGCC TGTACGTGAG CCAGGGCACC TTCTTCACGC AGTGCGAGGC CGAGGGCTTT CGGCGCATCA CCTACTTCCT GGACCGCCCC GACGTCATGG CCAGCTACAG CGTGCTGCTG CGCGCCAGCA AGGCCGACTA CCCCGTGCTG CTGTCCAACG GCAACCTGGT GGACAGCGGC GAGCTGGAAG ACGGCCGCCA CTTCGCCAAG TGGGTGGACC CGCACAAGAA GCCCAGCTAC CTGTTCGCCC TGGTGGCCGG CAAGCTGGTC GCGCGCGAGC AGAAGATCAA GAGCCGCGCG GGGCGCGAGC ATCTGCTGCA GGTGTACGTG CGCCCGGGCG ACCTGGAGAA GACCGAGCAC GCCATGAACT CCCTCATGGC CAGCATCGCG TGGGACGAGG CGCGCTTCGG CCTGAGCCTG GACCTGGACC GCTTCATGAT CGTCGCCACC AGCGACTTCA ACATGGGCGC CATGGAAAAC AAGGGCCTGA ACATCTTCAA CACGAAGTAC GTTCTGGCCA GCCAGGCCAC GGCGACGGAC GTGGACTTCG CCAACATCGA GTCCGTGGTG GGCCACGAGT ACTTCCACAA CTGGACAGGC AACCGCGTCA CCTGCCGCGA CTGGTTCCAG CTGTCGCTCA AGGAAGGCCT CACCGTCTTT CGCGACCAGG AATTCAGCAT GGACATGGCG GGCAGCCCGT CCGCGCGCGC CGTCAAGCGC ATCGAGGACG TGCGCGTGCT GCGCACCGTG CAGTTCCCCG AGGATGCGGG CCCCATGGCC CACCCCGTGC GGCCCGACAG CTATGTCGAG ATCAACAACT TCTACACCGT CACGATCTAT GAGAAGGGCT CCGAGGTCGT GCGCATGCAG CACAACCTGG TCGGGCGCGA CGGTTTTGCC AAGGGGCTGA AGCTGTACTT CGAGCGCCAC GACGGCCAGG CCGTGACCTG CGACGACTTC GCCCAGGCCA TGGCCGACGC CAACCCCGGC AGCCCGCTGG CCCAGCACCT GGAGCAATTC AAGCGCTGGT ACTCCCAGGC AGGCACGCCG CGCGTGAAGG CCGTTGGCCA GTACGACGCC GACGCACGCC GCTACACGCT CACGCTGTCC CAAAGCTGCC CCGCCACGCC GGGCCAGCCC GACAAGCTGC CCTTTGTGAT CCCCGTGACG CTGGGCCTGG TGGGCACCGA CGGCCGCGCC CTGGCACTGC AGCTGGACGG CGAAGCCGCG GCGGGCGCTC CGGAGCGCAC CGTGGTGCTG ACCGAGGGCG AGATGACGCT CACCTTCACA GGGGTGGACG TGCCGCCCGT GCCCTCGCTG CTGCGCTCCT TCAGCGCACC GGTGGTCCTG GACTGCGAAT ACAGCGACGC CGAGCTGCTC ACCCTGCTGG CGCACGACAG CGATGCCTTC AACCGCTGGG AAGCCGGCCA GCGCCTGATG CTGCGCATCG CTATCAATGC GATAGCTGAT GGCGCTTTAC TGGCGGGCGT AAACGGCCAG ATCGGCCAGA ACCTCCTGCC CCAGCCGCTG GTGCAGGCCA TGCGCGACGT ACTGCGCCAC CCCACGCTGG ACGCGGCCTT CAAGGAACTG GTGCTGACCC TGCCGTCCGA GGGCTACATC GCCGAGCAGC TGGACACGGT GGACCCGCAG CGCGTGCATG CCGTGCGCGA GGCGCTGCGC GAGCAGTTGG CCCTGGCCCT GCGTGACGAC TGGGTCTGGG CCTGGGAGGA GCACCACGCC ACGGGCGGCT ACCGCCCGGA TGCCGTGTCG GCGGGCCGCC GCGCCCTGGC CGGCCTGGCC CTGGGCATGC TGTGCCTGGC CGCGCGTTCC AGCGGCGACA TCGTGTGGCC CGGCAAGACC TACCAGCGCT TCAAGGACGC GGGCAATATG ACCGACCGCT TCAACGCGCT GGCGGCGCTG GTCGGCAGCG GCCACGAGTT GGCGGCGCCC GCGCTGGCGC GCTTTCATGC GCTGTTCAAG GACGACGCCT TGGTGCTGGA CAAGTGGTTC GCGCTGCAGG CGGGCGCGCC CGACCGCGGC GGCCAGGTGC TGCCGGCCGT GCGCCAGCTC ATGAAGCACC CGGACTTCCA CATCAAGAAC CCCAACCGCG CGCGCAGCGT GATCTTCAGT TACTGCAACG GCAACCCCGG GGGCTTTCAC CGCACCGACG CCGCCGGCTA CGTCTTCTGG GCCGACCGCG TGCTGGAGCT GGACAGTCTC AACCCCCAGG TGGCCGCCCG CCTGGCGCGT GCGCTGGACC GCTGGAAGAA GCTGGCCGAG CCTTACCGCA CGGCCGCCCA CGAAGCCATC TCCCGCGTGG CCGCCAAGTC CGACCTCTCC AACGACGTGC GCGAAGTCGT CACCCGCGCC CTTGCCGATT AA
|
Protein sequence | MMREGQAIAI HRADYAAPAY WIDTVDLTFD LDPAKTRVLS RLRVRRNADV PAQPLRLDGD ELNLARVMVN GAGTSFKLDG GQLVLENLPE GTGPFELEIF TTCAPAKNTQ LSGLYVSQGT FFTQCEAEGF RRITYFLDRP DVMASYSVLL RASKADYPVL LSNGNLVDSG ELEDGRHFAK WVDPHKKPSY LFALVAGKLV AREQKIKSRA GREHLLQVYV RPGDLEKTEH AMNSLMASIA WDEARFGLSL DLDRFMIVAT SDFNMGAMEN KGLNIFNTKY VLASQATATD VDFANIESVV GHEYFHNWTG NRVTCRDWFQ LSLKEGLTVF RDQEFSMDMA GSPSARAVKR IEDVRVLRTV QFPEDAGPMA HPVRPDSYVE INNFYTVTIY EKGSEVVRMQ HNLVGRDGFA KGLKLYFERH DGQAVTCDDF AQAMADANPG SPLAQHLEQF KRWYSQAGTP RVKAVGQYDA DARRYTLTLS QSCPATPGQP DKLPFVIPVT LGLVGTDGRA LALQLDGEAA AGAPERTVVL TEGEMTLTFT GVDVPPVPSL LRSFSAPVVL DCEYSDAELL TLLAHDSDAF NRWEAGQRLM LRIAINAIAD GALLAGVNGQ IGQNLLPQPL VQAMRDVLRH PTLDAAFKEL VLTLPSEGYI AEQLDTVDPQ RVHAVREALR EQLALALRDD WVWAWEEHHA TGGYRPDAVS AGRRALAGLA LGMLCLAARS SGDIVWPGKT YQRFKDAGNM TDRFNALAAL VGSGHELAAP ALARFHALFK DDALVLDKWF ALQAGAPDRG GQVLPAVRQL MKHPDFHIKN PNRARSVIFS YCNGNPGGFH RTDAAGYVFW ADRVLELDSL NPQVAARLAR ALDRWKKLAE PYRTAAHEAI SRVAAKSDLS NDVREVVTRA LAD
|
| |