Gene A9601_16441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_16441 
SymbolpepP 
ID4718374 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1388760 
End bp1390085 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content36% 
IMG OID640079370 
Productputative aminopeptidase P 
Protein accessionYP_001010034 
Protein GI123969176 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0006] Xaa-Pro aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.689084 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTAAAC CTGAAAATAA AGTTTTTGAA GAAAGAAGAG AAGTCTTCCT CAATAAATTA 
AATGGAAAAG CTGCTATTAT CCCCGGTGCT AATCTCGTAA AGCATCATGC TGATTGTGAA
TATCCTTTTA GACAAGATAG TAATTTTTGG TATTTAACCG GTTTTGATGA GCCGGATGCA
ATCGCCCTTT TCTTATCGCA TAAGCCAAAG GGAGAGAGAT TTATTATGTT TGTTGCTCCT
AAAGATGTTA TCAGCGAAGT CTGGCATGGC TTTAGATGGG GTTTAGAAGG CGCTGAAAAA
GAGTTTAATG CGGATAAGGC TCACTCAGTA AATGAATTCA GAGATTTACT CCCAGCTTAT
ATAAATGGTT CCGATGAACT TGTTTTTTCT ATTGGTAAGC ATCCATCAGT TGAGAAAATA
ATACTTGAAA TTTTTTCACA ACAACTTGAA AATCGCTCAA GATTAGGCAT TGGTGGAAAT
TCTATAAAAT CTCCAGAAAT TTACTTAAAT GAGATGAGAT TAATTAAAAG TGAATTTGAA
ATTAAGAGAA TGAGAAAGGC TATACAAATC TCAGCCGAAG CTCATGAACT AGTTAGAGAA
TCTATCTCAT CAAAGAAAAA TGAAAGACAA ATTCAGGGTC TACTAGAGGG ATTCTTTCTG
GAAAAAGGGG CGAGAGGTCC AGCTTATAAC TCAATTGTTG CATCAGGAGA TAATGCGTGT
ATTTTGCATT ACACTTCAAA TAATTCACCA CTGAAGAAGG AAGATTTATT ATTGGTTGAT
GCTGGCTGCT CACTAATTGA TTATTACAAT GGAGACATAA CAAGAACTAT ACCAATAGGT
GGCAAATTTT CTAATGAGCA AAAAGTTATC TATGAAATTG TATTGAGAGC GCAGAAAAAT
GCAATTAAAA GTGCTGTAAA GGGATCGAAT TCTAGTGCTG TTCATAATGT CGCTTTGACA
ATTCTTATAG AAGGATTAAA AGAAATTGGG TTATTGTCGG GCAGTACTGA GGAGATAATT
GATAATCAAT CTTATAAGCA TCTTTACATG CATAGAACTG GACATTGGCT AGGCTTAGAT
GTTCATGATG TTGGAGCATA CAGAATGGGA GACTATGAAG TGCCATTACA GAATGGAATG
ATTCTTACGG TTGAACCTGG GATCTACATA AGTGATAGGA TCCCAGTCTC TGAGGGACAA
CCCCCTATAG ATGAGAAATG GAAAGGCATA GGGATAAGAA TTGAAGACGA TGTCCTTGTA
AATGATACAA ACCCAGAAGT TTTAAGTATT GCAGCACTAA AAGAAATTTC TGATTTAGAA
TTTTGA
 
Protein sequence
MFKPENKVFE ERREVFLNKL NGKAAIIPGA NLVKHHADCE YPFRQDSNFW YLTGFDEPDA 
IALFLSHKPK GERFIMFVAP KDVISEVWHG FRWGLEGAEK EFNADKAHSV NEFRDLLPAY
INGSDELVFS IGKHPSVEKI ILEIFSQQLE NRSRLGIGGN SIKSPEIYLN EMRLIKSEFE
IKRMRKAIQI SAEAHELVRE SISSKKNERQ IQGLLEGFFL EKGARGPAYN SIVASGDNAC
ILHYTSNNSP LKKEDLLLVD AGCSLIDYYN GDITRTIPIG GKFSNEQKVI YEIVLRAQKN
AIKSAVKGSN SSAVHNVALT ILIEGLKEIG LLSGSTEEII DNQSYKHLYM HRTGHWLGLD
VHDVGAYRMG DYEVPLQNGM ILTVEPGIYI SDRIPVSEGQ PPIDEKWKGI GIRIEDDVLV
NDTNPEVLSI AALKEISDLE F