Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9515_16211 |
Symbol | pepP |
ID | 4720495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9515 |
Kingdom | Bacteria |
Replicon accession | NC_008817 |
Strand | - |
Start bp | 1416011 |
End bp | 1417336 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640081313 |
Product | putative aminopeptidase P |
Protein accession | YP_001011935 |
Protein GI | 123966854 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.878711 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAAAA CTAATACAAA AATTTTTGAA GAAAGAAGAG AAATTTTCCT AAAAAAATTA AATGGTAAAG CTGCCATCAT TTCAAGCTCA AGTTTAGTTA ATCATCATGC TGATTGTGAA TACCCTTTTA GACAAGATAG TAATTTTTGG TATCTTACTG GATTTGATGA GCCTGATTCA ATTGCTCTTT TCTTGTCTAA TAAGCCAAAA GGTGAAAGGT ACATCTTGTT TGTTGCCCCA AAAGACATTA TTAGCGAGGT CTGGCATGGC TTTAGATGGG GTGTAGAGGG CGCTGAAAGA GAATTCAAGG CAGATAAAGC CCACTCAATT AGTGATTTTA AGCGTTTACT TCCAGTTTAT ATTAGTGATT CAGAAGACAT CGTATATTCA CAAAACAAAC ATTCCAAATT TGAAAAAATA GTTTTGGAAA TATTCTCAAA ACAAATTGAA GCTCGTTCAA AAGAAGGAAA AGAAGAAATT AATATAGAAT CCCCAGAAAT TTATCTCAAT GAGATGCGCT TAATTAAAAG TGATTTTGAG ATCAGTAGGA TGAGAGAGGC AACTCAAATT TCTGCCGAAG CTCATGAATT AGTTAGGGAA TCTATTTCAT TAAAAAAAAA TGAAAGACAA ATTCAGGGAT TAATAGAAGG ATTCTTTTTA GAAAAAGGTG CAAGAGGACC AGCTTATAAC TCAATAGTTG CTTCAGGCGA TAATGCCTGC ATTTTGCATT ACACATTAAA TAACTCTGAC TTAAATAAAG GAGATCTATT ATTGGTGGAC GCAGGATGTT CATTAATGGA TTATTACAAT GGGGACATAA CAAGAACTAT TCCGATAGGA GGGAAGTTTT CTAAAGAGCA GAAAATTATA TATGAAATTG TTTTAGAAGC ACAAAAAAAC GCAATTAAGC ATTCTGTAAA AGGTTCTAAT ACTACTAATG TTCATAATGT TGCTTTGAGA ATTTTGGTAG ATGGATTAAA GGAAATTGGA CTGTTGAGAG GAGATACTGA TGGAATAATT GAAAACGGAT CTTATAAACA TCTTTATATG CATAGAACTG GACATTGGCT TGGTTTAGAC GTTCATGATG TTGGGGCTTA CAGGATGGGA GAATATGATG TTCCATTACA GAATGGCATG ATACTTACTG TTGAACCGGG TATCTATATA AGTGATAGGA TCCCAGTCCC CGAAGGACAA CCTAGTATTG ATGAAAAATG GAAAGGTATT GGAATAAGAA TAGAAGATGA CATTCTTGTA AAAGAAAAAG AACCAGAAAT TCTTAGCATC GCTGCGCTGA AAGAAATTTC TGATTTAGAG TATTGA
|
Protein sequence | MFKTNTKIFE ERREIFLKKL NGKAAIISSS SLVNHHADCE YPFRQDSNFW YLTGFDEPDS IALFLSNKPK GERYILFVAP KDIISEVWHG FRWGVEGAER EFKADKAHSI SDFKRLLPVY ISDSEDIVYS QNKHSKFEKI VLEIFSKQIE ARSKEGKEEI NIESPEIYLN EMRLIKSDFE ISRMREATQI SAEAHELVRE SISLKKNERQ IQGLIEGFFL EKGARGPAYN SIVASGDNAC ILHYTLNNSD LNKGDLLLVD AGCSLMDYYN GDITRTIPIG GKFSKEQKII YEIVLEAQKN AIKHSVKGSN TTNVHNVALR ILVDGLKEIG LLRGDTDGII ENGSYKHLYM HRTGHWLGLD VHDVGAYRMG EYDVPLQNGM ILTVEPGIYI SDRIPVPEGQ PSIDEKWKGI GIRIEDDILV KEKEPEILSI AALKEISDLE Y
|
| |