Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_10001 |
Symbol | pepN |
ID | 4777945 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 912412 |
End bp | 915030 |
Gene Length | 2619 bp |
Protein Length | 872 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640086508 |
Product | aminopeptidase N |
Protein accession | YP_001017014 |
Protein GI | 124022707 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0308] Aminopeptidase N |
TIGRFAM ID | [TIGR02414] aminopeptidase N, Escherichia coli type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.458523 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACAACAG CAACCACCCG ATTAGCGGAT TACAGGCCAT ATCCCTTCCA GATTCCTAAC ATTGAACTGG ATGTTGTTGT CGAAGAACAG CACATCGTGA TCTCCAGCTC CATGCAGATC GAGCCTGCCT TGACAACCAA AGTTCCTTTA GTCCTTCAGG GGTTAGACCT GGAGTTGGAC TCCATTGTTA TAAACGGCAG TTCTGTACCA ACGGATGCGT ACAGCCTATC GAGCCGTGAG TTAGTTCTCC ATCAGCCACC AATTCATCCT TTTGAGTTGA AGATCATCTG CCAAATTGAT CCCTTCAGCA ACACCTCACT AGAAGGGCTT TATGCCAGCG AGAGCATGCT CACAAGCCAA TGTGAGGCAG AAGGCTTCCG TCGCATCTGC TTCCATCCTG ATCGTCCTGA TGTGCTCAGT CGCTATCGAG TACGAATTGA GGCGGATCGG ACGCGCTATC CAGTGTTGCT CTCTAATGGA AATCTTGTGA GCAAAGGGCC CTTACCCAAA GATCCAATGC GACATGAGGC AATTTGGGAC GATCCCTATC CCAAGCCTGC CTATCTCTTT GCTTTGGTTG CAGGGGCACT TCAGGAAGTT CAGGCTCATT TCACAACCAC ATATGGACGC TCAGTTCTAT TGCGTTTGCA TGTTGAGAAC GGAGATGAAC CCTACACCTC TCACGCCTTG GATTCTCTAA AAAAAGCTAT GGCCTGGGAT GAAAAGGTCT ATGGGCTGGA GTACGACCTG GATGAATACA ACATTGTGGC AGTGCGTCAC TTCAACATGG GCGCAATGGA AAATAAGAGC TTAAATATAT TCAACTCAAA ACTTGTCTTG GCTGATTCCG AATGTGCCAC AGATAGTGAA CTTGAACGAA TCGAAAGTGT TGTTGCTCAT GAATATTTTC ACAACTGGAC TGGCAACCGT ATCACCTGCA GGGACTGGTT TCAGCTCTCT CTAAAAGAAG GTCTCACTGT ATTTCGGGAT CAGAGCTTCA CAGCTGATCT GCATTCAAAA GCAGTGAAAC GCATCGAGGA TGTATCAATG CTGCGAAACA CCCAATTTCG TGAAGATTCA GGCCCTACAT CTCATCCAGT TAAACCTAGT GAATACAAGG CAATTGATAA TTTCTACACC ACAACTATTT ATGAAAAAGG TGCAGAGTTA ATCCGCATGC TTCACACCAT GCTTGGGCAG CAACGCTTCA TGGCAGGAAT GGCGTTATAT GTGCAGCGTT TTGATGGCAC TGCTGCTACC ACAGACGATT TCATCGACTC CATTGCTGAA GGAGCCTGCG CAAATGGGGA ACAACTTGGT TTTGATCTTG ATCAATTTCA GCGTTGGTAT CACCAAAGCG GAACTCCTCA AGTATGCGTC AAACGCCACT GGGATTCTCA AGTCGGAACA CTCACACTTG AGGTCAGCCA GTTCACACCA CCAACTCCTG GTCAACCCTC AAAAGAACCT CTAGTCATTC CTATGGCCCT CGCCGTGATC GGCCCAAATG GACGAGTCGG CGAAGAGAAA CTGGTTATTT TGGACCAAGA CACGCAGAAC GTCTCACTAA GGGATCTACC AAGACAAGCA AAGCCTCCAG CTCTATCGAT ATTCCGTCGT TTTTCAGCTC CGATCACGTT GCAGATGGAC GTCTCAGTCG ACGAATCATT GCAGCTACTT GCTCTTGATG ACGATCCCGT TGCTCGATGG GAAGCAGGCC AGAGATTATG GAGAAAGATC CTTTTAGCAA GGGCTCGCAA ACAAACTGAC AATCCACTTG AAGAACGTTT AGCTCTAGCG TTAAATCAAC TGATCACCAG TGGAGGGGAG AGTGATCCCT CTTTTCTGGC GATGTTATTT GGCATGCCAG GACTAGCTGA ACTGGAGGCC GCCCAGGATG TCGCCGATCC ATTGATGCTC TATCAGGTGT ATCAATCACT CAAGTCATGG TTGGGAGTCA AACTCTCAGA TCCGCTCCAG AATCTGCTTG AACGTAGTCG ATTGAATTGG GGGGCTCAAT GGCCAGCTGG ACAAGGTGAA CGCATGCTCA CAGGACTGAC CTGGAGCTGG TTAGCTGCTG CTGGTGATTG CGAAGTTCGC AAGGAAGCAG TTGAAGCAGT CAACGGACCC TCGATGTCTT TGGCTCGAGC GGCCCTTCGG GCACTACAAC CCGTTGAATG TCCTGAGCGA GACGAAGCTC TCAAAAGCTT TTACGAGCGC TGGCAAGATC GACCAGTGAT CCTAGATACC TGGTTTGCTC TGGAGGCATC CACTCCACGG TCAGATGGGC TTGAACGAAT CAAACAGTTA CTCGATCATC CCCGCTTTGA CCCAATGGCA CCCAACGCGA TAAGGGCTGT CCTTGGTGGT CTCGCTTCAA ATCCTCCTGT ATTTCACGCT ATTGATGGAA GTGGCTACAA CTTCATGGCT GATCAGCTCA TCGCCATTGA TCAACGCAAC CCAATCACTG CTTCACGCAT GGTGAAAGTG TTCAGTCGTT GGCAAACTTA TGCTCCCTCA AGGAAAGAGG CGATGCACCG GGCCATCGAT CAACTGGCCA GTGCAGAGCT GTCTGCAAAC ACCAGAGAAG TGGTCACGCT CATGCAGCCA GAACAGTAG
|
Protein sequence | MTTATTRLAD YRPYPFQIPN IELDVVVEEQ HIVISSSMQI EPALTTKVPL VLQGLDLELD SIVINGSSVP TDAYSLSSRE LVLHQPPIHP FELKIICQID PFSNTSLEGL YASESMLTSQ CEAEGFRRIC FHPDRPDVLS RYRVRIEADR TRYPVLLSNG NLVSKGPLPK DPMRHEAIWD DPYPKPAYLF ALVAGALQEV QAHFTTTYGR SVLLRLHVEN GDEPYTSHAL DSLKKAMAWD EKVYGLEYDL DEYNIVAVRH FNMGAMENKS LNIFNSKLVL ADSECATDSE LERIESVVAH EYFHNWTGNR ITCRDWFQLS LKEGLTVFRD QSFTADLHSK AVKRIEDVSM LRNTQFREDS GPTSHPVKPS EYKAIDNFYT TTIYEKGAEL IRMLHTMLGQ QRFMAGMALY VQRFDGTAAT TDDFIDSIAE GACANGEQLG FDLDQFQRWY HQSGTPQVCV KRHWDSQVGT LTLEVSQFTP PTPGQPSKEP LVIPMALAVI GPNGRVGEEK LVILDQDTQN VSLRDLPRQA KPPALSIFRR FSAPITLQMD VSVDESLQLL ALDDDPVARW EAGQRLWRKI LLARARKQTD NPLEERLALA LNQLITSGGE SDPSFLAMLF GMPGLAELEA AQDVADPLML YQVYQSLKSW LGVKLSDPLQ NLLERSRLNW GAQWPAGQGE RMLTGLTWSW LAAAGDCEVR KEAVEAVNGP SMSLARAALR ALQPVECPER DEALKSFYER WQDRPVILDT WFALEASTPR SDGLERIKQL LDHPRFDPMA PNAIRAVLGG LASNPPVFHA IDGSGYNFMA DQLIAIDQRN PITASRMVKV FSRWQTYAPS RKEAMHRAID QLASAELSAN TREVVTLMQP EQ
|
| |