Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_16441 |
Symbol | pepP |
ID | 4718374 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1388760 |
End bp | 1390085 |
Gene Length | 1326 bp |
Protein Length | 441 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640079370 |
Product | putative aminopeptidase P |
Protein accession | YP_001010034 |
Protein GI | 123969176 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.689084 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTAAAC CTGAAAATAA AGTTTTTGAA GAAAGAAGAG AAGTCTTCCT CAATAAATTA AATGGAAAAG CTGCTATTAT CCCCGGTGCT AATCTCGTAA AGCATCATGC TGATTGTGAA TATCCTTTTA GACAAGATAG TAATTTTTGG TATTTAACCG GTTTTGATGA GCCGGATGCA ATCGCCCTTT TCTTATCGCA TAAGCCAAAG GGAGAGAGAT TTATTATGTT TGTTGCTCCT AAAGATGTTA TCAGCGAAGT CTGGCATGGC TTTAGATGGG GTTTAGAAGG CGCTGAAAAA GAGTTTAATG CGGATAAGGC TCACTCAGTA AATGAATTCA GAGATTTACT CCCAGCTTAT ATAAATGGTT CCGATGAACT TGTTTTTTCT ATTGGTAAGC ATCCATCAGT TGAGAAAATA ATACTTGAAA TTTTTTCACA ACAACTTGAA AATCGCTCAA GATTAGGCAT TGGTGGAAAT TCTATAAAAT CTCCAGAAAT TTACTTAAAT GAGATGAGAT TAATTAAAAG TGAATTTGAA ATTAAGAGAA TGAGAAAGGC TATACAAATC TCAGCCGAAG CTCATGAACT AGTTAGAGAA TCTATCTCAT CAAAGAAAAA TGAAAGACAA ATTCAGGGTC TACTAGAGGG ATTCTTTCTG GAAAAAGGGG CGAGAGGTCC AGCTTATAAC TCAATTGTTG CATCAGGAGA TAATGCGTGT ATTTTGCATT ACACTTCAAA TAATTCACCA CTGAAGAAGG AAGATTTATT ATTGGTTGAT GCTGGCTGCT CACTAATTGA TTATTACAAT GGAGACATAA CAAGAACTAT ACCAATAGGT GGCAAATTTT CTAATGAGCA AAAAGTTATC TATGAAATTG TATTGAGAGC GCAGAAAAAT GCAATTAAAA GTGCTGTAAA GGGATCGAAT TCTAGTGCTG TTCATAATGT CGCTTTGACA ATTCTTATAG AAGGATTAAA AGAAATTGGG TTATTGTCGG GCAGTACTGA GGAGATAATT GATAATCAAT CTTATAAGCA TCTTTACATG CATAGAACTG GACATTGGCT AGGCTTAGAT GTTCATGATG TTGGAGCATA CAGAATGGGA GACTATGAAG TGCCATTACA GAATGGAATG ATTCTTACGG TTGAACCTGG GATCTACATA AGTGATAGGA TCCCAGTCTC TGAGGGACAA CCCCCTATAG ATGAGAAATG GAAAGGCATA GGGATAAGAA TTGAAGACGA TGTCCTTGTA AATGATACAA ACCCAGAAGT TTTAAGTATT GCAGCACTAA AAGAAATTTC TGATTTAGAA TTTTGA
|
Protein sequence | MFKPENKVFE ERREVFLNKL NGKAAIIPGA NLVKHHADCE YPFRQDSNFW YLTGFDEPDA IALFLSHKPK GERFIMFVAP KDVISEVWHG FRWGLEGAEK EFNADKAHSV NEFRDLLPAY INGSDELVFS IGKHPSVEKI ILEIFSQQLE NRSRLGIGGN SIKSPEIYLN EMRLIKSEFE IKRMRKAIQI SAEAHELVRE SISSKKNERQ IQGLLEGFFL EKGARGPAYN SIVASGDNAC ILHYTSNNSP LKKEDLLLVD AGCSLIDYYN GDITRTIPIG GKFSNEQKVI YEIVLRAQKN AIKSAVKGSN SSAVHNVALT ILIEGLKEIG LLSGSTEEII DNQSYKHLYM HRTGHWLGLD VHDVGAYRMG DYEVPLQNGM ILTVEPGIYI SDRIPVSEGQ PPIDEKWKGI GIRIEDDVLV NDTNPEVLSI AALKEISDLE F
|
| |