Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_16321 |
Symbol | pepP |
ID | 4911302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 1364524 |
End bp | 1365849 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640161229 |
Product | putative aminopeptidase P |
Protein accession | YP_001091856 |
Protein GI | 126696970 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAAAC CTGAAAATAA AGTTTTTGAA GAAAGAAGAG AAATCTTCCT AAATAAATTA AATGGAAAAG CTGCTATTAT CCCTGGTGCT AGGCTTGTAA AGCATCATGC TGATTGCGAA TATCCTTTTA GACAAGATAG TAATTTTTGG TATTTAACCG GTTTCGATGA GCCAGATGCA ATTGCTCTTT TCTTATCGCA TAAGCCCAAG GGAGAGAGAT TTATTATGTT TGTTGCTCCT AAAGATGTTA TAAGCGAAGT CTGGCATGGC TTTAGATGGG GTTTAGAAGG TGCTGAAAAA GAGTTTAACG CTGATAAGGC TCACTCAATA AACGAATTAA AAGATTTACT CCCATCCTAT ATAAATGGTT CTGATGAACT CGTTTTTTCT ATTGGTAAGC ATCCATCAGT TGAGAAAATA ATATTGGAAA TTTTTTCACA ACAACTTGAA AACCGCTCAA GATTAGGCAT TGGTGGAAAT TCTATAAAAT CTCCAGAAAT TTTTTTAAAT GAGATGAGAT TAATTAAAAG TGAATTTGAA ATAAAAAGAA TGAGAGAGGC TATTCAAATC TCAGCAGAAG CTCATGAATT AGTTAGAGAA TCAATCTCAT CAAAGAAAAA TGAAAGACAA ATTCAGGGTC TTCTAGAGGG ATTTTTTCTG GAAAAAGGGG CAAGAGGACC AGCTTATAAC TCTATTGTTG CATCAGGAGA TAACGCTTGT ATTTTACATT ACACTTCAAA TAACTCGCCC TTGAAGAAGG AAGATTTATT GTTAGTGGAT GCTGGTTGCT CACTAATTGA TTATTACAAT GGAGACATAA CAAGAACTAT TCCAATTGGT GGTAAATTTT CTAATGAGCA AAAAGTTATC TATGAAATCG TATTAAGTGC GCAGAAAAAT GCAATTAAAA GTGCTGTAAA AGGATCCAAT TCTAGTCTTG TTCATAATGT TGCTTTAACA ATTCTTATAG AAGGATTAAA AGAAATTGGT TTATTGTCAG GCAGTACTGA GGAGATAATT GAGAATCAAT CTTATAAACA TCTTTACATG CATAGAACTG GACATTGGCT TGGTTTAGAT GTTCATGATG TTGGAGCATA CAGAATGGGG GACTATGAAG TGCCATTACA GAATGGAATG ATTCTTACGG TAGAACCTGG GATCTACATA AGTGATAGGA TCCCAGTACC TGAGGGACAA CCCCCTATAG ATGAGAAATG GAAAGGCATA GGGATAAGAA TTGAAGACGA TGTCCTTGTC AAAGATGAAA ACCCAGAAGT TTTAAGTATT GCTGCACTAA AAGAAATTTC TGATTTAGAA TTTTGA
|
Protein sequence | MFKPENKVFE ERREIFLNKL NGKAAIIPGA RLVKHHADCE YPFRQDSNFW YLTGFDEPDA IALFLSHKPK GERFIMFVAP KDVISEVWHG FRWGLEGAEK EFNADKAHSI NELKDLLPSY INGSDELVFS IGKHPSVEKI ILEIFSQQLE NRSRLGIGGN SIKSPEIFLN EMRLIKSEFE IKRMREAIQI SAEAHELVRE SISSKKNERQ IQGLLEGFFL EKGARGPAYN SIVASGDNAC ILHYTSNNSP LKKEDLLLVD AGCSLIDYYN GDITRTIPIG GKFSNEQKVI YEIVLSAQKN AIKSAVKGSN SSLVHNVALT ILIEGLKEIG LLSGSTEEII ENQSYKHLYM HRTGHWLGLD VHDVGAYRMG DYEVPLQNGM ILTVEPGIYI SDRIPVPEGQ PPIDEKWKGI GIRIEDDVLV KDENPEVLSI AALKEISDLE F
|
| |