Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pnuc_0668 |
Symbol | pepN |
ID | 5052364 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Polynucleobacter necessarius subsp. asymbioticus QLW-P1DMWA-1 |
Kingdom | Bacteria |
Replicon accession | NC_009379 |
Strand | + |
Start bp | 659278 |
End bp | 661887 |
Gene Length | 2610 bp |
Protein Length | 869 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640470822 |
Product | aminopeptidase N |
Protein accession | YP_001155450 |
Protein GI | 145588853 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACTG ATCTGCCACA GAGCTTTCGT CGGCTCGAAT ATCGTCCCCC TGTATATACC TTTGACCAAG TAGAACTGGA TATTGCTCTA GATCCGGCGC GAACGATAGT TAAAAGTCGT ATCGATGTAT TGCCAGGCAA AGGCTTTGTC GCTGGTGCGC CATTGGTATT GGTAGGTCAA GATTTAGAGT TTGTGAGTTT GCGAATCAAT GGTGATGCAC ATCGCCATTT TGAACTCACT CCTGAAACCC TCTCGATCCA TTCCTTACCA AATGAAGGTA AAGACCCATT CATCTTGGAA ATTATTTCTA TTTGTGTTCC TGAGAAAAAT ACTACCTTAA TGGGTTTGTA TGTTTCTAAT GGAAACTTTT TTACGCAATG TGAGGCAGAA GGATTCAGAA AGATTACCTA CTTTTTAGAT CGTCCAGATG TCATGGCGCG TTATCGCGTG ACTCTTCAGG CGCGAGAAGC GCAGTGCCCA GTTCTTCTAT CGAATGGTAA TTTGTTGAAA ACAGAAAAGC TGCCGAACGG TTGGCATAGC GCTACTTGGG AAGACCCATT TCCAAAGCCT TCTTACTTAT TCGCATTAGT CGCCGGTAAA TTAGAGTGTA TAGAAGAAAC CATTACCACT AGTAGCGGCG CTAAGAAGCT ACTGCAAATT TGGGTTGAGC CTCATGACTT GAAGAAAACT CGTCATGCGA TGGATTCTTT AATTGCTTCG ATTCACTGGG ATGAGAAGCG CTTTGGTTTG GAGTTGGATT TAGAGCGTTT CATGATTGTG GCCGTGAGTG ATTTCAATAT GGGTGCAATG GAGAACAAGG GATTAAATGT TTTTAATACC AAGTTTGTTC TTGCTCAACC TGAGACTGCA ACTGATGCTG ATTTTGCCAA TATTGAAAGT GTCGTTGCCC ATGAATACTT CCATAACTGG ACGGGTAATC GAGTCACTTG CCAAGACTGG TTTCAACTAT CCCTCAAAGA AGGGTTAACC GTATTTAGGG ACCAAGAGTT TTCTGCTGAT CAAATGGGCA GCGAGTCTGG ACGGGCGGTT AAGCGCATCG AAGATGTTAG ATTGCTGCGT CAAATGCAAT TTCCTGAGGA TGCGGGTCCA ATGGCCCATC CGATCCGCCC GGATGAATAT CAAGAAATTA ATAACTTTTA TACCGTAACG GTTTATGAAA AGGGTTCCGA GGTTGTTCGT ATGTATCAAA CCCTCCTGGG GCGGGAAGGT TTCCGCAAGG GTATGGACCT ATATTTCAAA CGCCACGATG GTCAAGCGGT AACCTGCGAT GACTTTCTTG CAGCAATGGC CGATGCCAAT CATCGAGACC TTACTCAATT TAAAAATTGG TATAGCCAAG CTGGCACCCC GCGTGTGAAG GTGCAAGAGC ATTACGACGC TACTAAAAAA CAATATCAAC TCACATTAAG CCAATCCTGT GCTCCAAGCC CAGGCCAGCC TGATAAAAAA CCTTTTCATA TTCCATTGAA GATACGTTTG ATTACTTCTG GCAATGAGCA AGTTGAGAAA TTGCTTGAAT TAACAGAGCC TGAGCAGTCT TGGACCTTTG ATAATATTTC TGAACGGCCA GTGCTTTCGA TTAACCGTGA TTTTTCTGCC CCCATTCAGC TCGAATTTGA CCAGAGTGAA GCAGATCTAT TGACCTTGTT CTCGGGCGAT GATGATCCTT TTAGCCGCTG GGAAGCGGGT CAAAAGTTGG CCATGCAAAT GATATTGGGT AACCGCTTGC CCGATGCAAA ACTCATTGAG GCATATCGCA CTCTCTTGAT GGATCCCCAG TTAGATCCTG CCTTTAAAGA GTTGGCATTA ACCTTGCCGG CAGAAACCTA TTTGTATGAG CAATGTAAAA GCGTAGACCC ACAACAAATC TTTACTGCTC GCCGCGCTTT TCGTAAGGAG ATCGCTAAGC AATTGCAGTT GGAGTGGGCT GCCCTGTATC AGCAAAACCA AACACCAGGA CCGTTTAAGC CAGATGCAGT CAGTGCCGGA AAGCGTGCAC TGAAAAATCT CGCCTTGAGC ATGCTTTTGG AGGCGGATCC TAAAACTTGG GCGCCAATGG CAGTGAACCA GTATCAAACT GCTGACAATA TGACAGATCG CTATGCAGCT TTGTCAGCAC TCGTGATTCA TGGTGCAAAA GAAGCAAACG AATGTCTTGC AGACTTCTTT AATCGCTTTG CGAATGATGC TTTGGTAATC GATAAATGGT TTGCATTGCA ATCTAGTCGG CCACCAGTAG AGGGCTTGGA ATTGACTTTA AGTGATGTAA AGCATTTACG CGAACATCCA GCCTTTAAGT TAAATAATCC AAATCGTGCC CGTAGTGTTA TTCATGCTTT CTGCGCCAAT AATCCAGCGA GCTTTCATCA GCCCGACGGC AGTGGTTATG CGTTTTGGGT GGAGAGCGTA TTAGCCTTAG ATGCGATCAA CCCGCAGGTA GCCGCTCGTC TTGCCAGAGC TTTGGATCGC TGGCGTTTAT TTGCTGAGCC CTACCAAAGC AAGATGAAAG CCGCTCTAGA GCAGGTTGCA GCTTGTCAAA CCTTGTCTCC CGACGTCAGA GAGGTGATTG GCAAGGCTTT GGGTAATTGA
|
Protein sequence | MKTDLPQSFR RLEYRPPVYT FDQVELDIAL DPARTIVKSR IDVLPGKGFV AGAPLVLVGQ DLEFVSLRIN GDAHRHFELT PETLSIHSLP NEGKDPFILE IISICVPEKN TTLMGLYVSN GNFFTQCEAE GFRKITYFLD RPDVMARYRV TLQAREAQCP VLLSNGNLLK TEKLPNGWHS ATWEDPFPKP SYLFALVAGK LECIEETITT SSGAKKLLQI WVEPHDLKKT RHAMDSLIAS IHWDEKRFGL ELDLERFMIV AVSDFNMGAM ENKGLNVFNT KFVLAQPETA TDADFANIES VVAHEYFHNW TGNRVTCQDW FQLSLKEGLT VFRDQEFSAD QMGSESGRAV KRIEDVRLLR QMQFPEDAGP MAHPIRPDEY QEINNFYTVT VYEKGSEVVR MYQTLLGREG FRKGMDLYFK RHDGQAVTCD DFLAAMADAN HRDLTQFKNW YSQAGTPRVK VQEHYDATKK QYQLTLSQSC APSPGQPDKK PFHIPLKIRL ITSGNEQVEK LLELTEPEQS WTFDNISERP VLSINRDFSA PIQLEFDQSE ADLLTLFSGD DDPFSRWEAG QKLAMQMILG NRLPDAKLIE AYRTLLMDPQ LDPAFKELAL TLPAETYLYE QCKSVDPQQI FTARRAFRKE IAKQLQLEWA ALYQQNQTPG PFKPDAVSAG KRALKNLALS MLLEADPKTW APMAVNQYQT ADNMTDRYAA LSALVIHGAK EANECLADFF NRFANDALVI DKWFALQSSR PPVEGLELTL SDVKHLREHP AFKLNNPNRA RSVIHAFCAN NPASFHQPDG SGYAFWVESV LALDAINPQV AARLARALDR WRLFAEPYQS KMKAALEQVA ACQTLSPDVR EVIGKALGN
|
| |