Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_15611 |
Symbol | pepP |
ID | 5730012 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 1392746 |
End bp | 1394065 |
Gene Length | 1320 bp |
Protein Length | 439 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641285939 |
Product | putative aminopeptidase P |
Protein accession | YP_001551446 |
Protein GI | 159904102 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGTAACC AGATTATTTT CCAAGAGCGG AGAACTCGCT TTCTTGACCA ATTAGGGGGT CATGCAGCAA TTATTCCAGC TGCTTCTAGT GTTATTCATC ATGCTGATTG TGAATATCCA TTTAGGCAGG ACAGTGATTT TTGGTATTTA ACTGGTTTTG ATGAGCCTGA TGCAGTGGCT TTGTTCTTGC CTCATCGACC GCATGGAGAA AGGTATGTGT TGTTTGTTTT ACCTAAAGAC TCATCAATTG AAATTTGGAA TGGTTTTCGT TGGGGAGTAG AAGGGGCTGT AAATCATTTT CATGCCGATA TAGCCCACCC TTTACAGCAA TTACCTAATC TTCTTGAGGA ATATATTGTT GGTGCAGAAG GGCTAGCTTT TCGCATTGGT AAATACGAAA AAATAGAACC AATAGTTCTT AAAGTTTGGT CTGAACAAAT AAATAAATCT TCTCGAAGTG GATATGCTCC ATTGTCATTA AAACCTCCTT GTTCGTTACT GCACAAACTA AGACTTCGTA AAGACTCTTG GGAAGTAGAT CGATTACGTG AGGCTGCTTA TATTTCTGCA GGAGCGCATG AATTAGCGAG AAATGTTACA CAGCCTGGAA TGAATGAAAG AGCAATCCAA GCTGAAATTG AAAAATTCTT TTTAGAAAAA GGAGCTAGAG GACCTGCTTA TGGATCAATA GTGGCTAGTG GAGATAATGC CTGCATACTT CATTACACTG CTAATTGTGC TCCTCTTAGT GATGGAAAGT TGTTACTTAT AGATGCTGGA TGTTCTTTGG TTGATTATTA CAATGGTGAT ATTACTAGAA CTTTTCCAAT TGGTGGAAAG TTTACAAGTG AACAAAGAGC TCTATATGAG ATTGTTTTAA TTGCTCAGAA AGCTGCTATT GAATCGGTAG TCTCTGGGAA TAATACTGAG GAAGTACATC TTACTGCTGT AAGGGTTTTG ATTGAAGGCT TGATTACCCT TGGACTACTT AAAGGAAAGG TTGATTCTCT AATTGAGCAA GGTGCTTACA GGCACCTTTA TATGCATAGG ACAGGCCATT GGCTTGGTTT GGATGTGCAT GATGTAGGTT CATATCGACT TGGTGATTAT CAGGTCGCGT TAGAACCTGG GATGGTCTTA ACAGTGGAGC CAGGCCTCTA TATAAGTGAT CGACTGCCTA TCCCTGAAGG GCAACCTTCT ATTCATGAAA GATGGAAGGG AATAGGTATA CGAATTGAAG ATGATGTATT GGTAACTGAA TTTGAACCAG AAGTTTTGAG CTCTAATGCT TTGAAGTCTA TAGAAGCAAT GGAAAGATAA
|
Protein sequence | MSNQIIFQER RTRFLDQLGG HAAIIPAASS VIHHADCEYP FRQDSDFWYL TGFDEPDAVA LFLPHRPHGE RYVLFVLPKD SSIEIWNGFR WGVEGAVNHF HADIAHPLQQ LPNLLEEYIV GAEGLAFRIG KYEKIEPIVL KVWSEQINKS SRSGYAPLSL KPPCSLLHKL RLRKDSWEVD RLREAAYISA GAHELARNVT QPGMNERAIQ AEIEKFFLEK GARGPAYGSI VASGDNACIL HYTANCAPLS DGKLLLIDAG CSLVDYYNGD ITRTFPIGGK FTSEQRALYE IVLIAQKAAI ESVVSGNNTE EVHLTAVRVL IEGLITLGLL KGKVDSLIEQ GAYRHLYMHR TGHWLGLDVH DVGSYRLGDY QVALEPGMVL TVEPGLYISD RLPIPEGQPS IHERWKGIGI RIEDDVLVTE FEPEVLSSNA LKSIEAMER
|
| |