Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acry_2104 |
Symbol | pepN |
ID | 5161715 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Acidiphilium cryptum JF-5 |
Kingdom | Bacteria |
Replicon accession | NC_009484 |
Strand | - |
Start bp | 2330552 |
End bp | 2333215 |
Gene Length | 2664 bp |
Protein Length | 887 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 640554026 |
Product | aminopeptidase N |
Protein accession | YP_001235222 |
Protein GI | 148261095 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0562697 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCGAAC GCCCCGTCCC GCTGCCCGAC GCCGCCCCGC GCCGCCTCGA CGAATACCGG CCGCCGGCCT TTCTGGTCGA TACGGTCGAT CTCATCTTCG ATCTCGATCC GGCGGCAACA CGGGTGCGCG CCAGCCTGAC GCTGCGGCGC AGCGCGGCGC ACGGGATCGC GGATGCGCCG CTGGAACTCG ATGGCGAGGG GCTCGACCTG CGATCGGTGC TGCTCGATGG CGAGGCGCTC GGCGCCAACC GCTACGCGCT CGGCGCGCAT GGCCTCGTCA TTCGCGACGT GCCGGACGAA TTCACCCTCG ATACCGAGGT GATCATCGCG CCGGAGCGCA ATTCCGAGCT TTCGGGACTG TATGTCTCGG GCGGGGATTT CTTCACCCAG TGCGAGGCCG AGGGGTTTCG CCGCATCACC TTCTTCCCCG ACCGGCCGGA TGTGATGGCC CGCTACACGG CGACGCTGAT CGCCGATCCG GCGCGCTGTC CGCTGCTGCT GTCCAACGGC AATCCGGTCG ATCGCGGCAC CCTGCCGGAC GGGCGGCACT GGGCGAAATG GGAGGATCCG CACCCCAAGC CGTCCTATCT GTTCGCGCTG GTCGCGGGCG ACCTGGTTTC GGTGCATGAC GCGTTCGTCA CCCGCTCGGG GCGGCAGGTG CAGCTCGGCA TCCATGTCCG CCGCGGCGAC GAGGACAAGG TCGACCACGC GATGGCCTGC CTCAAGCGGG CGATGCGCTG GGACGAGGAG GCGTTCGGCC TCGAATACGA CCTCGACATC TTCAACATCG CCGCCGTGTC CGACTTCAAC ATGGGGGCGA TGGAGAACAA GGGGCTCAAC ATCTTCAACA CCGCCCTCGT CCTCGCCCGG CCGGATACCG CGACGGATGC CGACTACCAG CGGATCGACC GGGTGATCGC GCATGAATAT TTCCACAACT GGACCGGCGA CCGGGTGACC TGCCGGGACT GGTTCCAGCT CTCGCTCAAG GAGGGACTGA CCGTCTTTCG CGACCAGGAA TACGGGGCTG CGACGACCGA TGCGGCGCTG TCGCGGATCG ACGACGTCAA GATGCTGCGC GCCCGGCAAT TCGCCGAGGA TGCCGGGCCG CTGGCGCACC CCGTCCGCCC CGCCGCCTAT CGCAAGATCG ACAATTTCTA CACCGCGACC GTCTATGAGA AGGGGGCGGA GGTGGTGCGG ATGATCCGCA CCATCATCGG GCATGAGGCG TTCCGCCGCG GCTTCGATCG CTACATCGCG CGCAACGACA ATTCGGCGGC GACGATCGAG GATTTCGTGG CCGCGATGCG CGAGGAATCG GGGTTCGATT TTTCCCAGTT CATGCGCTGG TACGAGCAGG CGGGCACGCC GCAGCTGCGA TTCGAGGGCG AATACGACGC GGCGGCGCGG TGCTACACGC TCACGCTGCG GCAGGTGACG GCGCCGACGC CGGGGCAGGC GCATAAGGAG CCGTTCCTGA TCCCGGTGGC GATGGGGCTG ATCGGGCCGG ATGGGGCGGA GCTTGAAGCT CGGCTGGAGG GCGAGGCGGA CGCGCATCGC GGCACTCGGG TGCTGCTGCT GCGCGCGGCC GAGCAGCGTT TCGTGTTCGA GGAGGTCGCG GCTCCGCCGG TGCCCTCGCT GTTGCGCGGG TTCTCGGCGC CGGTGAAGCT CTCGGGGCAC GATGCGGCCG GGCTCGCGCA CCTTGCCGCG CACGATACCG ACGGGTTCAG CCGCTGGGAG GCCGGGCAGG AATATGCGAC GCGCGCGCTG CTCGACGCGA TCGCCGCATT GCAGCGCGGC GAGGCGGTGG CGACCGATCC GGCGCTGGTG GCGGCGATGG CGGCGGCGCT CGACCTCGCG CAGGAGGCGC CGGCGCTCGC CGCGCGGATA TTGTCGCTGC CGGGCAAGGA CGTGTTGGCG GACCGGATGG CGGTGATCGA CCCGGATGCG ATTCACGAGG CGCTGGAGAC GACGCGGGCG GCGATCGGCC GGCAGTTGCG CGACCGCTTC GCCGCGCTGT GCGGGGCGGA GGACCCGTCG GCGCCGTTCA GCCTCGACCC GGCGAGCATG GGGCGGCGGG TGCTCGGCAA TGTCGCGCTG GCCTATCTGA TGGCGGCGGA CCCGGCGGCG GGGCTGGACG CGGCACAGCG CCGCTTCGAG ACCGCGCCGA CGATGACCGG GCGGCTCGGC GCGCTGGCGC TGCTCGCCGA TACCGCGACG CCGGCGCGCG ATGCGGCACT CGCGGCGTTT CACGCGCGCT GGCGCGGCGA TGCGCTGGTG GGCGACAAGT GGTTCCGCAT CCAGGCGATG GCGGCCGCCC CGGACACGCT GGACCGCGTC GAGGCGCTGA CCCGGCACGC CGATTTCGAC CTGCGCAACC CCAACCGCTT CCGCGCCCTG GTGCAGGCGT TCGGCGCGGG GAACCAGCGC TGGTTCCACG ATGCGTCGGG GCGGGGCTAT GCGCTGGTGG CGCGGATGAT CGGCGAGGTC GACCGGGTGA ACCGGCAGAT CGCGGCGCGC TGCATCGACG TGTTCGCGAG CTGGCAGCGC TTCGATCCGG CGCGGCAGGC GCTGATGCGC GGCGCGCTGG ACGCGCTGCT CGCCGATCCC GCTTTGTCGG CCAATGCGCG GGAGATGGCC GAGCGGGCGC GGGCGGGGTC GTGA
|
Protein sequence | MTERPVPLPD AAPRRLDEYR PPAFLVDTVD LIFDLDPAAT RVRASLTLRR SAAHGIADAP LELDGEGLDL RSVLLDGEAL GANRYALGAH GLVIRDVPDE FTLDTEVIIA PERNSELSGL YVSGGDFFTQ CEAEGFRRIT FFPDRPDVMA RYTATLIADP ARCPLLLSNG NPVDRGTLPD GRHWAKWEDP HPKPSYLFAL VAGDLVSVHD AFVTRSGRQV QLGIHVRRGD EDKVDHAMAC LKRAMRWDEE AFGLEYDLDI FNIAAVSDFN MGAMENKGLN IFNTALVLAR PDTATDADYQ RIDRVIAHEY FHNWTGDRVT CRDWFQLSLK EGLTVFRDQE YGAATTDAAL SRIDDVKMLR ARQFAEDAGP LAHPVRPAAY RKIDNFYTAT VYEKGAEVVR MIRTIIGHEA FRRGFDRYIA RNDNSAATIE DFVAAMREES GFDFSQFMRW YEQAGTPQLR FEGEYDAAAR CYTLTLRQVT APTPGQAHKE PFLIPVAMGL IGPDGAELEA RLEGEADAHR GTRVLLLRAA EQRFVFEEVA APPVPSLLRG FSAPVKLSGH DAAGLAHLAA HDTDGFSRWE AGQEYATRAL LDAIAALQRG EAVATDPALV AAMAAALDLA QEAPALAARI LSLPGKDVLA DRMAVIDPDA IHEALETTRA AIGRQLRDRF AALCGAEDPS APFSLDPASM GRRVLGNVAL AYLMAADPAA GLDAAQRRFE TAPTMTGRLG ALALLADTAT PARDAALAAF HARWRGDALV GDKWFRIQAM AAAPDTLDRV EALTRHADFD LRNPNRFRAL VQAFGAGNQR WFHDASGRGY ALVARMIGEV DRVNRQIAAR CIDVFASWQR FDPARQALMR GALDALLADP ALSANAREMA ERARAGS
|
| |