Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_04951 |
Symbol | pepP |
ID | 4778181 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 489138 |
End bp | 490415 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640085999 |
Product | putative aminopeptidase P |
Protein accession | YP_001016512 |
Protein GI | 124022205 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAAT TGGGGGGAGT GGCAGCGGTA ATTCCAGCGG CTTCGTTCGT GACGCATCAC GCCGATTGCG AGTATCCGTT TCGTCAAAAC AGTGACTTCT GGTATCTCAC GGGTCTGGAT GAGCCTGATG CGGTGGCTTT GCTTTTGCCC TACCGGCCCG AAGGAGAGCG TTTTGTGCTG TTTGTGCAAC CCAAAGAGCC AACGACCGAG GTATGGAATG GCTTTCGTTG GGGTGTCGAG GGAGCAGTGG CGCAGTTCGG TGCTGATCGT GCTCATCCAA TAGCTGAATT GCCTCAGCTG TTGGGCGATT ACCTCAAAGA CGCCGAAGCG ATTGGCTTCA GGGTGGGGAA GCACCCAAAG GTGGAGTCGC TGGTTCTTGA GGCCTGGGCT AAGCAACTCG ATCAGGCCCC ACGCAGCGGC GTATCGGCGC TTGGTCTTGT CGCCCCTTGC CCTTTCATCC ATCAGTTGAG GTTGCGCAAG GGAGCAGAGG AAATCGAGCG CATGCGTGAG ACGGCTCGGA TTTCTGCGCA AGCTCATCAG CTGGTTCGGA ATACGGCGCG TCCGGGCATG AATGAGCGAC AGTTGCAGGC TGTAATCGAG CAGCATTTTC TTGAGCAAGG CGCGCGCGGC CCTGCCTATG GATCGATCGT GGCCGGCGGC GATAACGCCT GCGTGTTGCA TTACACCGCT AACAATGCTC CGCTTGTTGA TGGCGAGCTG GTGCTGATCG ATGCAGGCTG TTCGCTCGTT GATTACTACA ACGGAGATAT CACTAGGACA TTCCCTGTTA ACGGACGCTT TAGCGCAGAG CAACGCGCTC TGTATGAGCT GGTTTTAGCA GCGCAGCAGG CAGCGATCGC TGAGGTAAGA CCGGGTGGCA CTGCTGATCG CGTGCATGAT CTTGCGGTGC GGGTGCTCGT TGAAGGGCTT GTGGAGTTGG GTTTATTGCT TGGCAGTGTT GATGGACTGA TTGAACAAGG GGCCTACCGA CACCTTTATA TGCATCGCAC TGGCCATTGG CTGGGCCTTG ATGTTCATGA TGTAGGCGCC TATCGCCTTG GTGAGCATCC GGTGGATCTT GAGCCCGGCA TGGTGTTAAC AGTGGAACCG GGTTTGTATG TCAGTGACCG TTTGCCAGTG CCGGATGGGC AGCCTGCGAT TGCTGATCGT TGGAAGGGGA TCGGCATTCG TATTGAAGAT GATGTGTTGG TAAGTGAGCA GGGAAATGAG GTGCTCACAT CGTTAGCGGA GAAAAGTGTT GAAGCAATGC AGCGATAA
|
Protein sequence | MAQLGGVAAV IPAASFVTHH ADCEYPFRQN SDFWYLTGLD EPDAVALLLP YRPEGERFVL FVQPKEPTTE VWNGFRWGVE GAVAQFGADR AHPIAELPQL LGDYLKDAEA IGFRVGKHPK VESLVLEAWA KQLDQAPRSG VSALGLVAPC PFIHQLRLRK GAEEIERMRE TARISAQAHQ LVRNTARPGM NERQLQAVIE QHFLEQGARG PAYGSIVAGG DNACVLHYTA NNAPLVDGEL VLIDAGCSLV DYYNGDITRT FPVNGRFSAE QRALYELVLA AQQAAIAEVR PGGTADRVHD LAVRVLVEGL VELGLLLGSV DGLIEQGAYR HLYMHRTGHW LGLDVHDVGA YRLGEHPVDL EPGMVLTVEP GLYVSDRLPV PDGQPAIADR WKGIGIRIED DVLVSEQGNE VLTSLAEKSV EAMQR
|
| |