Gene P9301_16321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_16321 
SymbolpepP 
ID4911302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1364524 
End bp1365849 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content34% 
IMG OID640161229 
Productputative aminopeptidase P 
Protein accessionYP_001091856 
Protein GI126696970 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAAC CTGAAAATAA AGTTTTTGAA GAAAGAAGAG AAATCTTCCT AAATAAATTA 
AATGGAAAAG CTGCTATTAT CCCTGGTGCT AGGCTTGTAA AGCATCATGC TGATTGCGAA
TATCCTTTTA GACAAGATAG TAATTTTTGG TATTTAACCG GTTTCGATGA GCCAGATGCA
ATTGCTCTTT TCTTATCGCA TAAGCCCAAG GGAGAGAGAT TTATTATGTT TGTTGCTCCT
AAAGATGTTA TAAGCGAAGT CTGGCATGGC TTTAGATGGG GTTTAGAAGG TGCTGAAAAA
GAGTTTAACG CTGATAAGGC TCACTCAATA AACGAATTAA AAGATTTACT CCCATCCTAT
ATAAATGGTT CTGATGAACT CGTTTTTTCT ATTGGTAAGC ATCCATCAGT TGAGAAAATA
ATATTGGAAA TTTTTTCACA ACAACTTGAA AACCGCTCAA GATTAGGCAT TGGTGGAAAT
TCTATAAAAT CTCCAGAAAT TTTTTTAAAT GAGATGAGAT TAATTAAAAG TGAATTTGAA
ATAAAAAGAA TGAGAGAGGC TATTCAAATC TCAGCAGAAG CTCATGAATT AGTTAGAGAA
TCAATCTCAT CAAAGAAAAA TGAAAGACAA ATTCAGGGTC TTCTAGAGGG ATTTTTTCTG
GAAAAAGGGG CAAGAGGACC AGCTTATAAC TCTATTGTTG CATCAGGAGA TAACGCTTGT
ATTTTACATT ACACTTCAAA TAACTCGCCC TTGAAGAAGG AAGATTTATT GTTAGTGGAT
GCTGGTTGCT CACTAATTGA TTATTACAAT GGAGACATAA CAAGAACTAT TCCAATTGGT
GGTAAATTTT CTAATGAGCA AAAAGTTATC TATGAAATCG TATTAAGTGC GCAGAAAAAT
GCAATTAAAA GTGCTGTAAA AGGATCCAAT TCTAGTCTTG TTCATAATGT TGCTTTAACA
ATTCTTATAG AAGGATTAAA AGAAATTGGT TTATTGTCAG GCAGTACTGA GGAGATAATT
GAGAATCAAT CTTATAAACA TCTTTACATG CATAGAACTG GACATTGGCT TGGTTTAGAT
GTTCATGATG TTGGAGCATA CAGAATGGGG GACTATGAAG TGCCATTACA GAATGGAATG
ATTCTTACGG TAGAACCTGG GATCTACATA AGTGATAGGA TCCCAGTACC TGAGGGACAA
CCCCCTATAG ATGAGAAATG GAAAGGCATA GGGATAAGAA TTGAAGACGA TGTCCTTGTC
AAAGATGAAA ACCCAGAAGT TTTAAGTATT GCTGCACTAA AAGAAATTTC TGATTTAGAA
TTTTGA
 
Protein sequence
MFKPENKVFE ERREIFLNKL NGKAAIIPGA RLVKHHADCE YPFRQDSNFW YLTGFDEPDA 
IALFLSHKPK GERFIMFVAP KDVISEVWHG FRWGLEGAEK EFNADKAHSI NELKDLLPSY
INGSDELVFS IGKHPSVEKI ILEIFSQQLE NRSRLGIGGN SIKSPEIFLN EMRLIKSEFE
IKRMREAIQI SAEAHELVRE SISSKKNERQ IQGLLEGFFL EKGARGPAYN SIVASGDNAC
ILHYTSNNSP LKKEDLLLVD AGCSLIDYYN GDITRTIPIG GKFSNEQKVI YEIVLSAQKN
AIKSAVKGSN SSLVHNVALT ILIEGLKEIG LLSGSTEEII ENQSYKHLYM HRTGHWLGLD
VHDVGAYRMG DYEVPLQNGM ILTVEPGIYI SDRIPVPEGQ PPIDEKWKGI GIRIEDDVLV
KDENPEVLSI AALKEISDLE F